The corpus of Basque simplified texts (CBST)
In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator w...
Saved in:
Published in: | Language resources and evaluation Vol. 52; no. 1; pp. 217 - 247 |
---|---|
Main Authors: | , , |
Format: | Journal Article |
Language: | English |
Published: |
Dordrecht
Springer Netherlands
01-03-2018
Springer Nature B.V |
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation scheme we have created to annotate the corpus. The annotation scheme is divided into eight macro-operations: delete, merge, split, transformation, insert, reordering, no operation and other. These macro-operations can be classified into different operations. We also relate our work and results to other languages. This corpus will be used to corroborate the decisions taken and to improve the design of the automatic text simplification system for Basque. |
---|---|
AbstractList | In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation scheme we have created to annotate the corpus. The annotation scheme is divided into eight macro-operations: delete, merge, split, transformation, insert, reordering, no operation and other. These macro-operations can be classified into different operations. We also relate our work and results to other languages. This corpus will be used to corroborate the decisions taken and to improve the design of the automatic text simplification system for Basque. |
Author | Gonzalez-Dios, Itziar Aranzabe, María Jesús Díaz de Ilarraza, Arantza |
Author_xml | – sequence: 1 givenname: Itziar orcidid: 0000-0003-1048-5403 surname: Gonzalez-Dios fullname: Gonzalez-Dios, Itziar email: itziar.gonzalezd@ehu.eus organization: Ixa NLP Group, University of the Basque Country (UPV/EHU) – sequence: 2 givenname: María Jesús orcidid: 0000-0002-0401-1087 surname: Aranzabe fullname: Aranzabe, María Jesús organization: Ixa NLP Group, University of the Basque Country (UPV/EHU) – sequence: 3 givenname: Arantza orcidid: 0000-0003-3317-8561 surname: Díaz de Ilarraza fullname: Díaz de Ilarraza, Arantza organization: Ixa NLP Group, University of the Basque Country (UPV/EHU) |
BookMark | eNp1kE1Lw0AQhhepYFv9Ad4CXhRcnc1-H23wCwoejOBtidtdTWmTuJtA_fduiYgXTzMMzzszPDM0adrGIXRK4IoAyOtIgEuNgUisGUgsDtCUcMkw5ERNfnt4PUKzGNcALGdSTdFl-eEy24ZuiFnrs0UVPweXxXrbbWpfu1XWu10fs_Ni8VxeHKNDX22iO_mpc_Ryd1sWD3j5dP9Y3CyxpVz3mImKyhWnXBHtAAQnnknwbz5NNFVSek8tt4pIARUwnoDcW6Xy3AlFU3COzsa9XWjTO7E363YITTppiBZcUs5AJ4qMlA1tjMF504V6W4UvQ8DspZhRiklSzF6KESmTj5mY2ObdhT-b_w19A4S4Ymc |
CitedBy_id | crossref_primary_10_3389_fpsyg_2022_707630 crossref_primary_10_3366_word_2020_0172 crossref_primary_10_1017_S1351324918000384 |
Cites_doi | 10.1016/j.cognition.2009.11.012 10.1016/S0010-0277(02)00087-2 10.1177/1362168811423456 10.1162/tacl_a_00139 10.3115/v1/W14-1206 10.1007/s10648-011-9181-8 10.1016/j.jml.2004.02.003 10.1037/h0057532 10.4304/tpls.2.1.43-53 10.1080/23273798.2014.994009 10.3115/v1/W14-5604 10.1075/itl.165.2.06sid 10.1007/s10579-014-9265-4 10.1075/ijcl.14.1.02lu 10.1017/S0142716400000047 10.3115/v1/W14-1210 10.3115/v1/W15-1604 10.21437/SLaTE.2007-20 |
ContentType | Journal Article |
Copyright | The Author(s) 2017 Language Resources and Evaluation is a copyright of Springer, (2017). All Rights Reserved. |
Copyright_xml | – notice: The Author(s) 2017 – notice: Language Resources and Evaluation is a copyright of Springer, (2017). All Rights Reserved. |
DBID | C6C AAYXX CITATION 3V. 7SC 7T9 7XB 8AL 8FD 8FE 8FG 8FK 8G5 ABUWG AFKRA AIMQZ ALSLI ARAPS AVQMV AZQEC BENPR BGLVJ CCPQU CPGLG CRLPW DWQXO GB0 GNUQQ GUQSH HCIFZ JQ2 K50 K7- L7M LIQON L~C L~D M0N M1D M2O MBDVC P5Z P62 PQEST PQQKQ PQUKI PRINS Q9U |
DOI | 10.1007/s10579-017-9407-6 |
DatabaseName | Springer Nature OA Free Journals CrossRef ProQuest Central (Corporate) Computer and Information Systems Abstracts Linguistics and Language Behavior Abstracts (LLBA) ProQuest Central (purchase pre-March 2016) Computing Database (Alumni Edition) Technology Research Database ProQuest SciTech Collection ProQuest Technology Collection ProQuest Central (Alumni) (purchase pre-March 2016) Research Library (Alumni Edition) ProQuest Central (Alumni) ProQuest Central ProQuest One Literature Social Science Premium Collection (Proquest) (PQ_SDU_P3) Advanced Technologies & Aerospace Collection Arts Premium Collection ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College Linguistics Collection Linguistics Database ProQuest Central DELNET Social Sciences & Humanities Collection ProQuest Central Student Research Library Prep SciTech Premium Collection (Proquest) (PQ_SDU_P3) ProQuest Computer Science Collection Art, Design & Architecture Collection (Proquest) (PQ_SDU_P3) Computer Science Database Advanced Technologies Database with Aerospace ProQuest One Literature - U.S. Customers Only Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Computing Database ProQuest Arts & Humanities Database ProQuest Research Library Research Library (Corporate) Advanced Technologies & Aerospace Database ProQuest Advanced Technologies & Aerospace Collection ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China ProQuest Central Basic |
DatabaseTitle | CrossRef ProQuest DELNET Social Sciences and Humanities Collection Research Library Prep Computer Science Database ProQuest Central Student Technology Collection Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest Advanced Technologies & Aerospace Collection ProQuest Central Essentials ProQuest Computer Science Collection Computer and Information Systems Abstracts ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College Research Library (Alumni Edition) ProQuest Central China ProQuest Central Linguistics Collection Arts Premium Collection ProQuest Central Korea ProQuest Research Library ProQuest Art, Design and Architecture Collection Advanced Technologies Database with Aerospace Advanced Technologies & Aerospace Collection Social Science Premium Collection ProQuest Computing ProQuest One Literature - U.S. Customers Only ProQuest Central Basic ProQuest One Literature ProQuest Computing (Alumni Edition) ProQuest One Academic Eastern Edition Linguistics and Language Behavior Abstracts (LLBA) ProQuest Technology Collection ProQuest SciTech Collection Computer and Information Systems Abstracts Professional Advanced Technologies & Aerospace Database ProQuest One Academic UKI Edition Linguistics Database Arts & Humanities Full Text ProQuest One Academic ProQuest Central (Alumni) |
DatabaseTitleList | ProQuest DELNET Social Sciences and Humanities Collection |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Library & Information Science Computer Science |
EISSN | 1574-0218 |
EndPage | 247 |
ExternalDocumentID | 10_1007_s10579_017_9407_6 |
GrantInformation_xml | – fundername: Ministerio de Economía y Competitividad grantid: TIN2013-46616-C2-1-R funderid: http://dx.doi.org/10.13039/501100003329 – fundername: Universidad del País Vasco (UPV/EHU) grantid: Grant for the new doctors from the Vice-rectory of Research – fundername: Eusko Jaurlaritza grantid: Ph.D. grant BFI-2011- 392; IT344-10 funderid: http://dx.doi.org/10.13039/501100003086 |
GroupedDBID | -51 -5C -5G -BR -DZ -EM -Y2 -~C .4H .4S .86 .DC 06D 07C 0R~ 0VY 199 2.D 203 29L 2J2 2JN 2JY 2KG 2LR 2P1 2VQ 2~H 30V 3EH 3V. 4.4 406 408 409 40E 5GY 5VS 67Z 6NX 78A 8FE 8FG 8G5 8TC 8UJ 95- 95. 95~ 96X AAAVM AABHQ AABYN AAFGU AAGAY AAGJQ AAHNG AAIAL AAJKR AANTL AANZL AAPBV AARHV AARTL AATNV AATVU AAUYE AAWCG AAXYU AAYFA AAYIU AAYOK AAYQN AAYTO ABBBX ABBHK ABBXA ABDZT ABECU ABECW ABFGW ABFTV ABHLI ABHQN ABJNI ABJOX ABKAS ABKCH ABKTR ABLJU ABMNI ABMQK ABNWP ABPTK ABQBU ABSXP ABTEG ABTHY ABTKH ABTMW ABULA ABUWG ABWNU ABXPI ACBMV ACBRV ACBXY ACBYP ACGFO ACGFS ACHSB ACHXU ACIGE ACIPQ ACKNC ACMDZ ACMLO ACNXV ACOKC ACOMO ACREN ACTTH ACVWB ACVYN ACWMK ADHIR ADINQ ADKNI ADKPE ADMDM ADOXG ADPTO ADRFC ADSWE ADTPH ADULT ADURQ ADYFF ADYOE ADZKW AEBTG AEEQQ AEFTE AEGAL AEGNC AEJHL AEJRE AEKMD AENEX AEOHA AEPYU AESKC AESTI AETLH AEUPB AEVLU AEVTX AEXYK AFEXP AFFNX AFGCZ AFKRA AFLOW AFNRJ AFQWF AFWTZ AFYQB AFZKB AGAYW AGDGC AGGBP AGHSJ AGJBK AGMZJ AGQMX AGWIL AGWZB AGYKE AHAVH AHBYD AHKAY AHSBF AHYZX AIAKS AIIXL AILAN AIMQZ AIMYW AITGF AJBLW AJDOV AJRNO AJZVZ AKQUC ALMA_UNASSIGNED_HOLDINGS ALSLI ALWAN AMKLP AMTXH AMXSW AMYLF AOCGG ARAPS ARCSS ARMRJ AVQMV AXYYD AYQZM AZFZN AZQEC AZRUE B-. BA0 BDATZ BENPR BGLVJ BGNMA BHNFS BPHCQ C6C CAG CCPQU COF CPGLG CRLPW CS3 CSCUP DDRTE DL5 DNIVK DPUIP DWQXO EBLON EBS EDO EHI EIOEI EJD ESBYG FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FWDCC GB0 GGCAI GGRSB GJIRD GNUQQ GNWQR GPZZG GQ6 GQ7 GQ8 GUQSH GXS HCIFZ HF~ HG5 HG6 HLICF HMHOC HMJXF HQYDN HRMNR HVGLF HZ~ I-F I09 IHE IJ- IKXTQ ITM IWAJR IXC IZIGR IZQ I~X I~Z J-C J0Z JAAYA JAB JBMMH JBSCW JCJTX JENOY JHFFW JKQEH JLEZI JLXEF JPL JSODD JST JZLTJ K50 K6V K7- KDC KOV LIQON LLZTM M0N M1D M2O M4Y MA- MQGED N2Q NB0 NDZJH NF0 NPVJJ NQJWS NU0 O9- O93 O9G O9I O9J OAM P-O P19 P62 P9Q PF- PQQKQ PROAC PT4 Q2X QF4 QN3 QN7 QOS R89 R9I RHV RIG ROL RPX RSV S16 S1Z S26 S27 S28 S3B SA0 SAP SCLPG SDA SDH SDM SHS SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 T16 TN5 TSG TSK TSV TUC TUS U2A UG4 UNUBA UOJIU UTJUX UZXMN VC2 VFIZW VQA W23 W48 WK8 YLTOR Z45 Z7X Z83 Z88 Z8R Z8W Z92 ZMTXR ZWUKE ~EX AACDK AAEOY AAJBT AASML AAYXX ABAKF ABXSQ ACAOD ACDTI ACZOJ ADACV AEFQL AEMSY AFBBN AGQEE AGRTI AGZLP AHEXP AIGIU CITATION H13 IPSME 7SC 7T9 7XB 8AL 8FD 8FK AAHCP AAYZH JQ2 L7M L~C L~D MBDVC PQEST PQUKI PRINS Q9U |
ID | FETCH-LOGICAL-c359t-46a37d535819e00651f470fbf35893877ff3c5c81760a0456512fc8822e683d53 |
IEDL.DBID | AEJHL |
ISSN | 1574-020X |
IngestDate | Tue Nov 19 05:37:04 EST 2024 Thu Sep 26 21:41:02 EDT 2024 Sat Dec 16 12:00:07 EST 2023 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Keywords | Basque Monolingual parallel corpora Text simplification Annotation scheme |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c359t-46a37d535819e00651f470fbf35893877ff3c5c81760a0456512fc8822e683d53 |
ORCID | 0000-0002-0401-1087 0000-0003-3317-8561 0000-0003-1048-5403 |
OpenAccessLink | http://link.springer.com/10.1007/s10579-017-9407-6 |
PQID | 1965735409 |
PQPubID | 28740 |
PageCount | 31 |
ParticipantIDs | proquest_journals_1965735409 crossref_primary_10_1007_s10579_017_9407_6 springer_journals_10_1007_s10579_017_9407_6 |
PublicationCentury | 2000 |
PublicationDate | 2018-03-01 |
PublicationDateYYYYMMDD | 2018-03-01 |
PublicationDate_xml | – month: 03 year: 2018 text: 2018-03-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | Dordrecht |
PublicationPlace_xml | – name: Dordrecht – name: Dordrect |
PublicationTitle | Language resources and evaluation |
PublicationTitleAbbrev | Lang Resources & Evaluation |
PublicationYear | 2018 |
Publisher | Springer Netherlands Springer Nature B.V |
Publisher_xml | – name: Springer Netherlands – name: Springer Nature B.V |
References | Gonzalez-DiosIAranzabeMJDíaz de IlarrazaATestuen sinplifikazio automatikoa: arloaren egungo egoera [Automatic text simplification: State of art]Linguamática2013524363 Hancke, J., Vajjala, S., & Meurers, D. (2012). Readability classification for German using lexical, syntactic, and morphological features. In Proceedings of COLING 2012, the 24th international conference on computational linguistics: Technical papers (pp. 1063–1080). Stenetorp, P., Pyysalo, S., Topic, G., Ohta, T., Ananiadou, S., & Tsujii, J. (2012). BRAT: A web-based tool for NLP-assisted text annotation. In Proceedings of the demonstrations session at EACL 2012. Petersen, S. E., & Ostendorf, M. (2007). Text simplification for language learners: A corpus analysis. In Proceedings of workshop on speech and language technology for education. SLaTE, Citeseer (pp. 69–72). Brunato, D., Dell’Orletta, F., Venturi, G., & Montemagni, S. (2015). Design and annotation of the first Italian corpus for text simplification. In The 9th linguistic annotation workshop held in conjunction with NAACL 2015. Dell’Orletta, F., Montemagni, S., & Venturi, G. (2011). READ-IT: Assessing readability of Italian texts with a view to text simplification. In Proceedings of the second workshop on speech and language processing for assistive technologies, Association for Computational Linguistics, Stroudsburg, PA, USA, SLPAT ‘11 (pp. 73–83). GroszBJWeinsteinSJoshiAKCentering: A framework for modeling the local coherence of discourseComputational Linguistics1995212203225 ChallJSDaleEReadability revisited: The new Dale–Chall readability formula1995NorthamptonBrookline Books Mitkov, R., & Štajner, S. (2014). The fewer, the better? A contrastive study about ways to simplify. In Proceedings of the workshop on automatic text simplification-methods and applications in the multilingual society (ATS-MA 2014), Association for Computational Linguistics and Dublin University (pp. 30–40). GunningRThe technique of clear writing1968New YorkMcGraw-Hill XuWCallison-BurchCNapolesCProblems in current text simplification research: New data can helpTransactions of the Association for Computational Linguistics20153283297 Covington, M. A., He, C., Brown, C., Naçi, L., & Brown, J. (2006). How complex is that sentence? A proposed revision of the Rosenberg and Abbeduto D-Level Scale. CASPR Research Report 2006-01. Athens, GA: The University of Georgia, Artificial Intelligence Center. Gonzalez-Dios, I., Aranzabe, M. J., Díaz de Ilarraza, A., & Salaberri, H. (2014). Simple or complex? Assessing the readability of Basque texts. In Proceedings of COLING 2014, the 25th international conference on computational linguistics: Technical papers (pp. 334–344). Štajner, S. (2015). New data-driven approaches to text simplification. PhD Thesis, University of Wolverhampton. Gonzalez-Dios, I. (2016). Euskarazko egitura konplexuen analisirako eta testuen sinplifikazio automatikorako proposamena/Readability assessment and automatic text simplification. The analysis of Basque complex structures. PhD Thesis, University of the Basque Country (UPV/EHU). CarreirasMDuñabeitiaJAVergaraMde la Cruz-PavíaILakaISubject relative clauses are not universally easier to process: Evidence from BasqueCognition20101151799210.1016/j.cognition.2009.11.012 LuXAutomatic measurement of syntactic complexity in child language acquisitionInternational Journal of Corpus Linguistics200914132810.1075/ijcl.14.1.02lu WarrenTGibsonEThe influence of referential processing on sentence complexityCognition20028517911210.1016/S0010-0277(02)00087-2 ZamanianMHeydariPReadability of texts: State of the artTheory and Practice in Language Studies201221435310.4304/tpls.2.1.43-53 DuBayWHThe principles of readability2004Costa Mesa, CAImpact Information Pellow, D., & Eskenazi, M. (2014). An open corpus of everyday documents for simplification tasks. In Proceedings of the 3rd workshop on predicting and improving text readability for target reader populations (PITR), Association for Computational Linguistics, Gothenburg, Sweden (pp. 84–93). Coster, W., & Kauchak, D. (2011). Simple English Wikipedia: A new text simplification task. In Proceedings of the 49th annual meeting of the Association for Computational Linguistics: Human language technologies: Short papers (Vol. 2, pp. 665–669). Klaper, D., Ebling, S., & Volk, M. (2013). Building a German/simple German parallel corpus for automatic text simplification. In Proceedings of the second workshop on predicting and improving text readability for target reader populations, Association for Computational Linguistics, Sofia, Bulgaria (pp. 11–19). LakaIErdoziaKTorregoELinearization references given “Free Word Order”; Subject preferences given ergativity: A look at BasqueFestschrift for Professor Carlos Piera2010OxfordOxford University Press SiddharthanAA survey of research on text simplificationThe International Journal of Applied Linguistics20141652259298 Bott, S., & Saggion, H. (2011). An unsupervised alignment algorithm for text simplification corpus construction. In Proceedings of the workshop on monolingual text-to-text generation, Association for Computational Linguistics, Stroudsburg, PA, USA, MTTG ‘11 (pp. 20–26). FleschRA new readability yardstickJournal of Applied Psychology194832322123310.1037/h0057532 MannWCThompsonSARhetorical structure theory: Toward a functional theory of text organizationText19888324328110.1515/text.1.1988.8.3.243 BottSSaggionHText simplification resources for SpanishLanguage Resources and Evaluation20144819312010.1007/s10579-014-9265-4 Aranzabe, M. J., Díaz de Ilarraza, A., & Gonzalez-Dios, I. (2012). First approach to automatic text simplification in basque. In L. Rello, & H. Saggion (Eds.), Proceedings of the natural language processing for improving textual accesibility (NLP4ITA) workshop (LREC 2012) (pp. 1–8). ShardlowMA survey of automated text simplificationInternational Journal of Advanced Computer Science and Applications (IJACSA)2014415870 Caseli, H. M., Pereira, T. F., Specia, L., Pardo, T. A. S., Gasperin, C., & Aluísio, S. (2009). Building a Brazilian Portuguese parallel corpus of original and simplified texts. In Proceedings of CICLing (pp. 59–70). CrossleySAAllenDMcNamaraDSText simplification and comprehensible input: A case for an intuitive approachLanguage Teaching Research20121618910810.1177/1362168811423456 RosenbergSAbbedutoLIndicators of linguistic competence in the peer group conversational behavior of mildly retarded adultsApplied Psycholinguistics198781193210.1017/S0142716400000047 Brouwers, L., Bernhard, D., Ligozat, A. L., & Francois, T. (2014). Syntactic sentence simplification for French. In Proceedings of the 3rd workshop on predicting and improving text readability for target reader populations (PITR), Association for Computational Linguistics, Gothenburg, Sweden (pp. 47–56). RosISantestebanMFukumuraKLakaIAiming at shorter dependencies: The role of agreement morphologyLanguage, Cognition and Neuroscience20153091156117410.1080/23273798.2014.994009 ŠtajnerSDrndarevicBSaggionHCorpus-based sentence deletion and split decisions for Spanish text simplificationComputación y Sistemas2013172251262 GordonPCHendrickRJohnsonMEffects of noun phrase type on sentence complexityJournal of Memory and Language20045119711410.1016/j.jml.2004.02.003 Klerke, S., & Søgaard, A. (2012). DSim, a Danish parallel corpus for text simplification. In N. Calzolari (Conference Chair), K. Choukri, T. Declerck, M. Ugur Dogan, B. Maegaard, J. Mariani, et al. (Eds.),. Proceedings of the eight international conference on language resources and evaluation (LREC’12), European Language Resources Association (ELRA), Istanbul, Turkey (pp. 4015–4018). BenjaminRGReconstructing readability: Recent developments and recommendations in the analysis of text difficultyEducational Psychology Review2012241638810.1007/s10648-011-9181-8 9407_CR3 9407_CR1 S Bott (9407_CR4) 2014; 48 9407_CR6 9407_CR16 9407_CR37 9407_CR18 PC Gordon (9407_CR19) 2004; 51 9407_CR5 JS Chall (9407_CR9) 1995 BJ Grosz (9407_CR20) 1995; 21 9407_CR8 I Laka (9407_CR25) 2010 9407_CR30 X Lu (9407_CR26) 2009; 14 9407_CR10 WH DuBay (9407_CR14) 2004 9407_CR11 A Siddharthan (9407_CR34) 2014; 165 9407_CR13 9407_CR35 T Warren (9407_CR38) 2002; 85 M Zamanian (9407_CR40) 2012; 2 R Gunning (9407_CR21) 1968 SA Crossley (9407_CR12) 2012; 16 S Štajner (9407_CR36) 2013; 17 I Gonzalez-Dios (9407_CR17) 2013; 5 M Carreiras (9407_CR7) 2010; 115 9407_CR29 R Flesch (9407_CR15) 1948; 32 9407_CR28 I Ros (9407_CR31) 2015; 30 RG Benjamin (9407_CR2) 2012; 24 W Xu (9407_CR39) 2015; 3 S Rosenberg (9407_CR32) 1987; 8 9407_CR23 9407_CR22 9407_CR24 M Shardlow (9407_CR33) 2014; 4 WC Mann (9407_CR27) 1988; 8 |
References_xml | – ident: 9407_CR35 – ident: 9407_CR37 – volume: 115 start-page: 79 issue: 1 year: 2010 ident: 9407_CR7 publication-title: Cognition doi: 10.1016/j.cognition.2009.11.012 contributor: fullname: M Carreiras – ident: 9407_CR18 – volume: 85 start-page: 79 issue: 1 year: 2002 ident: 9407_CR38 publication-title: Cognition doi: 10.1016/S0010-0277(02)00087-2 contributor: fullname: T Warren – volume: 8 start-page: 243 issue: 3 year: 1988 ident: 9407_CR27 publication-title: Text contributor: fullname: WC Mann – volume: 5 start-page: 43 issue: 2 year: 2013 ident: 9407_CR17 publication-title: Linguamática contributor: fullname: I Gonzalez-Dios – volume: 16 start-page: 89 issue: 1 year: 2012 ident: 9407_CR12 publication-title: Language Teaching Research doi: 10.1177/1362168811423456 contributor: fullname: SA Crossley – ident: 9407_CR10 – ident: 9407_CR16 – volume: 3 start-page: 283 year: 2015 ident: 9407_CR39 publication-title: Transactions of the Association for Computational Linguistics doi: 10.1162/tacl_a_00139 contributor: fullname: W Xu – ident: 9407_CR5 doi: 10.3115/v1/W14-1206 – ident: 9407_CR24 – volume: 24 start-page: 63 issue: 1 year: 2012 ident: 9407_CR2 publication-title: Educational Psychology Review doi: 10.1007/s10648-011-9181-8 contributor: fullname: RG Benjamin – volume: 17 start-page: 251 issue: 2 year: 2013 ident: 9407_CR36 publication-title: Computación y Sistemas contributor: fullname: S Štajner – volume-title: The principles of readability year: 2004 ident: 9407_CR14 contributor: fullname: WH DuBay – volume: 21 start-page: 203 issue: 2 year: 1995 ident: 9407_CR20 publication-title: Computational Linguistics contributor: fullname: BJ Grosz – volume: 51 start-page: 97 issue: 1 year: 2004 ident: 9407_CR19 publication-title: Journal of Memory and Language doi: 10.1016/j.jml.2004.02.003 contributor: fullname: PC Gordon – ident: 9407_CR8 – volume: 32 start-page: 221 issue: 3 year: 1948 ident: 9407_CR15 publication-title: Journal of Applied Psychology doi: 10.1037/h0057532 contributor: fullname: R Flesch – ident: 9407_CR22 – volume: 2 start-page: 43 issue: 1 year: 2012 ident: 9407_CR40 publication-title: Theory and Practice in Language Studies doi: 10.4304/tpls.2.1.43-53 contributor: fullname: M Zamanian – ident: 9407_CR11 – volume: 30 start-page: 1156 issue: 9 year: 2015 ident: 9407_CR31 publication-title: Language, Cognition and Neuroscience doi: 10.1080/23273798.2014.994009 contributor: fullname: I Ros – volume: 4 start-page: 58 issue: 1 year: 2014 ident: 9407_CR33 publication-title: International Journal of Advanced Computer Science and Applications (IJACSA) contributor: fullname: M Shardlow – volume-title: Readability revisited: The new Dale–Chall readability formula year: 1995 ident: 9407_CR9 contributor: fullname: JS Chall – ident: 9407_CR28 doi: 10.3115/v1/W14-5604 – volume: 165 start-page: 259 issue: 2 year: 2014 ident: 9407_CR34 publication-title: The International Journal of Applied Linguistics doi: 10.1075/itl.165.2.06sid contributor: fullname: A Siddharthan – volume: 48 start-page: 93 issue: 1 year: 2014 ident: 9407_CR4 publication-title: Language Resources and Evaluation doi: 10.1007/s10579-014-9265-4 contributor: fullname: S Bott – ident: 9407_CR13 – volume-title: The technique of clear writing year: 1968 ident: 9407_CR21 contributor: fullname: R Gunning – ident: 9407_CR3 – volume: 14 start-page: 3 issue: 1 year: 2009 ident: 9407_CR26 publication-title: International Journal of Corpus Linguistics doi: 10.1075/ijcl.14.1.02lu contributor: fullname: X Lu – volume-title: Festschrift for Professor Carlos Piera year: 2010 ident: 9407_CR25 contributor: fullname: I Laka – volume: 8 start-page: 19 issue: 1 year: 1987 ident: 9407_CR32 publication-title: Applied Psycholinguistics doi: 10.1017/S0142716400000047 contributor: fullname: S Rosenberg – ident: 9407_CR29 doi: 10.3115/v1/W14-1210 – ident: 9407_CR6 doi: 10.3115/v1/W15-1604 – ident: 9407_CR1 – ident: 9407_CR30 doi: 10.21437/SLaTE.2007-20 – ident: 9407_CR23 |
SSID | ssj0042478 |
Score | 2.2553089 |
Snippet | In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified... |
SourceID | proquest crossref springer |
SourceType | Aggregation Database Publisher |
StartPage | 217 |
SubjectTerms | Annotations Basque language Computational Linguistics Computer Science Corpus analysis Corpus linguistics Language and Literature Linguistics Machine learning Sentences Simplified language Social Sciences Texts Translators |
Title | The corpus of Basque simplified texts (CBST) |
URI | https://link.springer.com/article/10.1007/s10579-017-9407-6 https://www.proquest.com/docview/1965735409 |
Volume | 52 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NS8MwFH_odvHidCpON8lBxK_I2jRNetzmxhQZwibsVtq0ARG6Ybf_35eucVP0oKdCk4bw-vLeL-8T4FwrJtwklhR1a0q9KEiojJmxNkntKE8L5RZNbMdiNJX3fVMmx_00XWRvd9YjWQjqjVw3Lkxoj6ABXkKovw1VVD2cV6Da6T8On6z89VyvkL8OFx5FMDS1vsyfFvmqjdYQ85tXtFA2g9p_trkHuyW0JJ0VL-zDVprVoWbbNpDyFNehVeYqkAtSJiOZn2PHD-AWWYfgpXS-zMlMk26U43ZJ_mpizzUiVmKCRXJy2euOJ1eH8DLoT3pDWjZVoIrxYEE9P2Ii4absWZAaAOJoT7R1rPFNwKQQWjPFlXSE344KvOe4WiEOd1NfMvzwCCrZLEuPgbBYRNqRimuN1zJEeoGfpFFRkYcpp60acG2JG85XtTPCdZVkQ6cQ54aGTqHfgKYlf1geozw05Q6FsUwFDbix9N4Y_m2xkz_NPoUdhEFyFVnWhMrifZm2YDtPlmclb-Fz8DDqPn8AkvbF5g |
link.rule.ids | 315,782,786,27935,27936,41075,42144,48346,48349,48359,49651,49654,49664,52155 |
linkProvider | Springer Nature |
linkToHtml | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NT8IwFH8ROOhFFDWioD0Y41eTbe3W7shnMCIXMIFTs3Vr4gWIg__fdqwBjR70unZN89q-93t97_0KcKMkYV4Sc6xta4ppFCaYx8TcNnHlSqqY9PJHbMdsNOXdnqHJIbYWJs92tyHJXFPvFLv5zOT2MBxqLwQHJagYsnNahkprOpt1rQKmHs0VsOszijUamtpg5k-DfDVHW4z5LSyaW5t-9V_zPILDAlyi1mY3HMNeOq9B1T7cgIpzXINmUa2AblFRjmSWx7afwJPePEi7pct1hhYKtaNMzxdl7yb7XGnMiky6SIbuOu3x5P4U3vq9SWeAi2cVsCR-uMI0iAhLfEN8FqYGgriKMkfFSn8JCWdMKSJ9yV0WOFGO-FxPSY3EvTTgRP94BuX5Yp6eAyIxi5TLpa-Udsw01guDJI1yTh4iXUfW4cFKVyw37Bliy5Ns5CR0X2HkJII6NKz8RXGQMmEID5m5mwrr8GjlvdP822AXf-p9DfuDyetQDJ9HL5dwoEER3-SZNaC8-linTShlyfqq2GifoKHIhg |
linkToPdf | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3dS8MwED90A_HF6VScbpoHEb_C1qZt0ifZV3UoQ9iEvYU2bcCXbtjt_zdpGzZFH8TXJg3hcsn9krv7HcClFITaccSwsq0JdkI_xiwi-rWJSUs4kgo7L2I7oeMZGww1Tc6DyYXJo92NS7LIadAsTemyvYhleyPxzaU6zodiX91IsLcNVf0qplS82h1NHwNzGDu2kx_GlksdrJDRzDg2fxrkq2la481vLtLc8gS1f895H_ZK0Im6hZYcwFaS1qFmCjqgcn_XoVVmMaArVKYp6WUz7Ydwr5QKqevqYpWhuUS9MFNzR9m7jkqXCssiHUaSoet-bzK9OYK3YDjtP-Gy3AIWxPWX2PFCQmNXE6L5iYYmlnRoR0ZSffEJo1RKIlzBLOp1whwJWrYUCqHbiceI-vEYKuk8TU4AkYiG0mLClVJd2BQG9L04CXOuHiKsjmjArZE0XxSsGnzNn6zlxFVfruXEvQY0zVrwcoNlXBMhUv1m5Tfgzsh-o_m3wU7_1PsCdl4HAX8ZjZ_PYFdhJVaEnzWhsvxYJS3YzuLVealzn47m0R0 |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+corpus+of+Basque+simplified+texts+%28CBST%29&rft.jtitle=Language+resources+and+evaluation&rft.au=Gonzalez-Dios%2C+Itziar&rft.au=Aranzabe%2C+Mar%C3%ADa+Jes%C3%BAs&rft.au=D%C3%ADaz+de+Ilarraza%2C+Arantza&rft.date=2018-03-01&rft.pub=Springer+Netherlands&rft.issn=1574-020X&rft.eissn=1574-0218&rft.volume=52&rft.issue=1&rft.spage=217&rft.epage=247&rft_id=info:doi/10.1007%2Fs10579-017-9407-6&rft.externalDocID=10_1007_s10579_017_9407_6 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1574-020X&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1574-020X&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1574-020X&client=summon |