The corpus of Basque simplified texts (CBST)

In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator w...

Full description

Saved in:
Bibliographic Details
Published in:Language resources and evaluation Vol. 52; no. 1; pp. 217 - 247
Main Authors: Gonzalez-Dios, Itziar, Aranzabe, María Jesús, Díaz de Ilarraza, Arantza
Format: Journal Article
Language:English
Published: Dordrecht Springer Netherlands 01-03-2018
Springer Nature B.V
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation scheme we have created to annotate the corpus. The annotation scheme is divided into eight macro-operations: delete, merge, split, transformation, insert, reordering, no operation and other. These macro-operations can be classified into different operations. We also relate our work and results to other languages. This corpus will be used to corroborate the decisions taken and to improve the design of the automatic text simplification system for Basque.
AbstractList In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation scheme we have created to annotate the corpus. The annotation scheme is divided into eight macro-operations: delete, merge, split, transformation, insert, reordering, no operation and other. These macro-operations can be classified into different operations. We also relate our work and results to other languages. This corpus will be used to corroborate the decisions taken and to improve the design of the automatic text simplification system for Basque.
Author Gonzalez-Dios, Itziar
Aranzabe, María Jesús
Díaz de Ilarraza, Arantza
Author_xml – sequence: 1
  givenname: Itziar
  orcidid: 0000-0003-1048-5403
  surname: Gonzalez-Dios
  fullname: Gonzalez-Dios, Itziar
  email: itziar.gonzalezd@ehu.eus
  organization: Ixa NLP Group, University of the Basque Country (UPV/EHU)
– sequence: 2
  givenname: María Jesús
  orcidid: 0000-0002-0401-1087
  surname: Aranzabe
  fullname: Aranzabe, María Jesús
  organization: Ixa NLP Group, University of the Basque Country (UPV/EHU)
– sequence: 3
  givenname: Arantza
  orcidid: 0000-0003-3317-8561
  surname: Díaz de Ilarraza
  fullname: Díaz de Ilarraza, Arantza
  organization: Ixa NLP Group, University of the Basque Country (UPV/EHU)
BookMark eNp1kE1Lw0AQhhepYFv9Ad4CXhRcnc1-H23wCwoejOBtidtdTWmTuJtA_fduiYgXTzMMzzszPDM0adrGIXRK4IoAyOtIgEuNgUisGUgsDtCUcMkw5ERNfnt4PUKzGNcALGdSTdFl-eEy24ZuiFnrs0UVPweXxXrbbWpfu1XWu10fs_Ni8VxeHKNDX22iO_mpc_Ryd1sWD3j5dP9Y3CyxpVz3mImKyhWnXBHtAAQnnknwbz5NNFVSek8tt4pIARUwnoDcW6Xy3AlFU3COzsa9XWjTO7E363YITTppiBZcUs5AJ4qMlA1tjMF504V6W4UvQ8DspZhRiklSzF6KESmTj5mY2ObdhT-b_w19A4S4Ymc
CitedBy_id crossref_primary_10_3389_fpsyg_2022_707630
crossref_primary_10_3366_word_2020_0172
crossref_primary_10_1017_S1351324918000384
Cites_doi 10.1016/j.cognition.2009.11.012
10.1016/S0010-0277(02)00087-2
10.1177/1362168811423456
10.1162/tacl_a_00139
10.3115/v1/W14-1206
10.1007/s10648-011-9181-8
10.1016/j.jml.2004.02.003
10.1037/h0057532
10.4304/tpls.2.1.43-53
10.1080/23273798.2014.994009
10.3115/v1/W14-5604
10.1075/itl.165.2.06sid
10.1007/s10579-014-9265-4
10.1075/ijcl.14.1.02lu
10.1017/S0142716400000047
10.3115/v1/W14-1210
10.3115/v1/W15-1604
10.21437/SLaTE.2007-20
ContentType Journal Article
Copyright The Author(s) 2017
Language Resources and Evaluation is a copyright of Springer, (2017). All Rights Reserved.
Copyright_xml – notice: The Author(s) 2017
– notice: Language Resources and Evaluation is a copyright of Springer, (2017). All Rights Reserved.
DBID C6C
AAYXX
CITATION
3V.
7SC
7T9
7XB
8AL
8FD
8FE
8FG
8FK
8G5
ABUWG
AFKRA
AIMQZ
ALSLI
ARAPS
AVQMV
AZQEC
BENPR
BGLVJ
CCPQU
CPGLG
CRLPW
DWQXO
GB0
GNUQQ
GUQSH
HCIFZ
JQ2
K50
K7-
L7M
LIQON
L~C
L~D
M0N
M1D
M2O
MBDVC
P5Z
P62
PQEST
PQQKQ
PQUKI
PRINS
Q9U
DOI 10.1007/s10579-017-9407-6
DatabaseName Springer Nature OA Free Journals
CrossRef
ProQuest Central (Corporate)
Computer and Information Systems Abstracts
Linguistics and Language Behavior Abstracts (LLBA)
ProQuest Central (purchase pre-March 2016)
Computing Database (Alumni Edition)
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Central (Alumni) (purchase pre-March 2016)
Research Library (Alumni Edition)
ProQuest Central (Alumni)
ProQuest Central
ProQuest One Literature
Social Science Premium Collection (Proquest) (PQ_SDU_P3)
Advanced Technologies & Aerospace Collection
Arts Premium Collection
ProQuest Central Essentials
ProQuest Central
Technology Collection
ProQuest One Community College
Linguistics Collection
Linguistics Database
ProQuest Central
DELNET Social Sciences & Humanities Collection
ProQuest Central Student
Research Library Prep
SciTech Premium Collection (Proquest) (PQ_SDU_P3)
ProQuest Computer Science Collection
Art, Design & Architecture Collection (Proquest) (PQ_SDU_P3)
Computer Science Database
Advanced Technologies Database with Aerospace
ProQuest One Literature - U.S. Customers Only
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
ProQuest Arts & Humanities Database
ProQuest Research Library
Research Library (Corporate)
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
DatabaseTitle CrossRef
ProQuest DELNET Social Sciences and Humanities Collection
Research Library Prep
Computer Science Database
ProQuest Central Student
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
Research Library (Alumni Edition)
ProQuest Central China
ProQuest Central
Linguistics Collection
Arts Premium Collection
ProQuest Central Korea
ProQuest Research Library
ProQuest Art, Design and Architecture Collection
Advanced Technologies Database with Aerospace
Advanced Technologies & Aerospace Collection
Social Science Premium Collection
ProQuest Computing
ProQuest One Literature - U.S. Customers Only
ProQuest Central Basic
ProQuest One Literature
ProQuest Computing (Alumni Edition)
ProQuest One Academic Eastern Edition
Linguistics and Language Behavior Abstracts (LLBA)
ProQuest Technology Collection
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest One Academic UKI Edition
Linguistics Database
Arts & Humanities Full Text
ProQuest One Academic
ProQuest Central (Alumni)
DatabaseTitleList ProQuest DELNET Social Sciences and Humanities Collection

DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
Computer Science
EISSN 1574-0218
EndPage 247
ExternalDocumentID 10_1007_s10579_017_9407_6
GrantInformation_xml – fundername: Ministerio de Economía y Competitividad
  grantid: TIN2013-46616-C2-1-R
  funderid: http://dx.doi.org/10.13039/501100003329
– fundername: Universidad del País Vasco (UPV/EHU)
  grantid: Grant for the new doctors from the Vice-rectory of Research
– fundername: Eusko Jaurlaritza
  grantid: Ph.D. grant BFI-2011- 392; IT344-10
  funderid: http://dx.doi.org/10.13039/501100003086
GroupedDBID -51
-5C
-5G
-BR
-DZ
-EM
-Y2
-~C
.4H
.4S
.86
.DC
06D
07C
0R~
0VY
199
2.D
203
29L
2J2
2JN
2JY
2KG
2LR
2P1
2VQ
2~H
30V
3EH
3V.
4.4
406
408
409
40E
5GY
5VS
67Z
6NX
78A
8FE
8FG
8G5
8TC
8UJ
95-
95.
95~
96X
AAAVM
AABHQ
AABYN
AAFGU
AAGAY
AAGJQ
AAHNG
AAIAL
AAJKR
AANTL
AANZL
AAPBV
AARHV
AARTL
AATNV
AATVU
AAUYE
AAWCG
AAXYU
AAYFA
AAYIU
AAYOK
AAYQN
AAYTO
ABBBX
ABBHK
ABBXA
ABDZT
ABECU
ABECW
ABFGW
ABFTV
ABHLI
ABHQN
ABJNI
ABJOX
ABKAS
ABKCH
ABKTR
ABLJU
ABMNI
ABMQK
ABNWP
ABPTK
ABQBU
ABSXP
ABTEG
ABTHY
ABTKH
ABTMW
ABULA
ABUWG
ABWNU
ABXPI
ACBMV
ACBRV
ACBXY
ACBYP
ACGFO
ACGFS
ACHSB
ACHXU
ACIGE
ACIPQ
ACKNC
ACMDZ
ACMLO
ACNXV
ACOKC
ACOMO
ACREN
ACTTH
ACVWB
ACVYN
ACWMK
ADHIR
ADINQ
ADKNI
ADKPE
ADMDM
ADOXG
ADPTO
ADRFC
ADSWE
ADTPH
ADULT
ADURQ
ADYFF
ADYOE
ADZKW
AEBTG
AEEQQ
AEFTE
AEGAL
AEGNC
AEJHL
AEJRE
AEKMD
AENEX
AEOHA
AEPYU
AESKC
AESTI
AETLH
AEUPB
AEVLU
AEVTX
AEXYK
AFEXP
AFFNX
AFGCZ
AFKRA
AFLOW
AFNRJ
AFQWF
AFWTZ
AFYQB
AFZKB
AGAYW
AGDGC
AGGBP
AGHSJ
AGJBK
AGMZJ
AGQMX
AGWIL
AGWZB
AGYKE
AHAVH
AHBYD
AHKAY
AHSBF
AHYZX
AIAKS
AIIXL
AILAN
AIMQZ
AIMYW
AITGF
AJBLW
AJDOV
AJRNO
AJZVZ
AKQUC
ALMA_UNASSIGNED_HOLDINGS
ALSLI
ALWAN
AMKLP
AMTXH
AMXSW
AMYLF
AOCGG
ARAPS
ARCSS
ARMRJ
AVQMV
AXYYD
AYQZM
AZFZN
AZQEC
AZRUE
B-.
BA0
BDATZ
BENPR
BGLVJ
BGNMA
BHNFS
BPHCQ
C6C
CAG
CCPQU
COF
CPGLG
CRLPW
CS3
CSCUP
DDRTE
DL5
DNIVK
DPUIP
DWQXO
EBLON
EBS
EDO
EHI
EIOEI
EJD
ESBYG
FEDTE
FERAY
FFXSO
FIGPU
FINBP
FNLPD
FRRFC
FSGXE
FWDCC
GB0
GGCAI
GGRSB
GJIRD
GNUQQ
GNWQR
GPZZG
GQ6
GQ7
GQ8
GUQSH
GXS
HCIFZ
HF~
HG5
HG6
HLICF
HMHOC
HMJXF
HQYDN
HRMNR
HVGLF
HZ~
I-F
I09
IHE
IJ-
IKXTQ
ITM
IWAJR
IXC
IZIGR
IZQ
I~X
I~Z
J-C
J0Z
JAAYA
JAB
JBMMH
JBSCW
JCJTX
JENOY
JHFFW
JKQEH
JLEZI
JLXEF
JPL
JSODD
JST
JZLTJ
K50
K6V
K7-
KDC
KOV
LIQON
LLZTM
M0N
M1D
M2O
M4Y
MA-
MQGED
N2Q
NB0
NDZJH
NF0
NPVJJ
NQJWS
NU0
O9-
O93
O9G
O9I
O9J
OAM
P-O
P19
P62
P9Q
PF-
PQQKQ
PROAC
PT4
Q2X
QF4
QN3
QN7
QOS
R89
R9I
RHV
RIG
ROL
RPX
RSV
S16
S1Z
S26
S27
S28
S3B
SA0
SAP
SCLPG
SDA
SDH
SDM
SHS
SHX
SISQX
SJYHP
SNE
SNPRN
SNX
SOHCF
SOJ
SPISZ
SRMVM
SSLCW
STPWE
SZN
T13
T16
TN5
TSG
TSK
TSV
TUC
TUS
U2A
UG4
UNUBA
UOJIU
UTJUX
UZXMN
VC2
VFIZW
VQA
W23
W48
WK8
YLTOR
Z45
Z7X
Z83
Z88
Z8R
Z8W
Z92
ZMTXR
ZWUKE
~EX
AACDK
AAEOY
AAJBT
AASML
AAYXX
ABAKF
ABXSQ
ACAOD
ACDTI
ACZOJ
ADACV
AEFQL
AEMSY
AFBBN
AGQEE
AGRTI
AGZLP
AHEXP
AIGIU
CITATION
H13
IPSME
7SC
7T9
7XB
8AL
8FD
8FK
AAHCP
AAYZH
JQ2
L7M
L~C
L~D
MBDVC
PQEST
PQUKI
PRINS
Q9U
ID FETCH-LOGICAL-c359t-46a37d535819e00651f470fbf35893877ff3c5c81760a0456512fc8822e683d53
IEDL.DBID AEJHL
ISSN 1574-020X
IngestDate Tue Nov 19 05:37:04 EST 2024
Thu Sep 26 21:41:02 EDT 2024
Sat Dec 16 12:00:07 EST 2023
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Basque
Monolingual parallel corpora
Text simplification
Annotation scheme
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c359t-46a37d535819e00651f470fbf35893877ff3c5c81760a0456512fc8822e683d53
ORCID 0000-0002-0401-1087
0000-0003-3317-8561
0000-0003-1048-5403
OpenAccessLink http://link.springer.com/10.1007/s10579-017-9407-6
PQID 1965735409
PQPubID 28740
PageCount 31
ParticipantIDs proquest_journals_1965735409
crossref_primary_10_1007_s10579_017_9407_6
springer_journals_10_1007_s10579_017_9407_6
PublicationCentury 2000
PublicationDate 2018-03-01
PublicationDateYYYYMMDD 2018-03-01
PublicationDate_xml – month: 03
  year: 2018
  text: 2018-03-01
  day: 01
PublicationDecade 2010
PublicationPlace Dordrecht
PublicationPlace_xml – name: Dordrecht
– name: Dordrect
PublicationTitle Language resources and evaluation
PublicationTitleAbbrev Lang Resources & Evaluation
PublicationYear 2018
Publisher Springer Netherlands
Springer Nature B.V
Publisher_xml – name: Springer Netherlands
– name: Springer Nature B.V
References Gonzalez-DiosIAranzabeMJDíaz de IlarrazaATestuen sinplifikazio automatikoa: arloaren egungo egoera [Automatic text simplification: State of art]Linguamática2013524363
Hancke, J., Vajjala, S., & Meurers, D. (2012). Readability classification for German using lexical, syntactic, and morphological features. In Proceedings of COLING 2012, the 24th international conference on computational linguistics: Technical papers (pp. 1063–1080).
Stenetorp, P., Pyysalo, S., Topic, G., Ohta, T., Ananiadou, S., & Tsujii, J. (2012). BRAT: A web-based tool for NLP-assisted text annotation. In Proceedings of the demonstrations session at EACL 2012.
Petersen, S. E., & Ostendorf, M. (2007). Text simplification for language learners: A corpus analysis. In Proceedings of workshop on speech and language technology for education. SLaTE, Citeseer (pp. 69–72).
Brunato, D., Dell’Orletta, F., Venturi, G., & Montemagni, S. (2015). Design and annotation of the first Italian corpus for text simplification. In The 9th linguistic annotation workshop held in conjunction with NAACL 2015.
Dell’Orletta, F., Montemagni, S., & Venturi, G. (2011). READ-IT: Assessing readability of Italian texts with a view to text simplification. In Proceedings of the second workshop on speech and language processing for assistive technologies, Association for Computational Linguistics, Stroudsburg, PA, USA, SLPAT ‘11 (pp. 73–83).
GroszBJWeinsteinSJoshiAKCentering: A framework for modeling the local coherence of discourseComputational Linguistics1995212203225
ChallJSDaleEReadability revisited: The new Dale–Chall readability formula1995NorthamptonBrookline Books
Mitkov, R., & Štajner, S. (2014). The fewer, the better? A contrastive study about ways to simplify. In Proceedings of the workshop on automatic text simplification-methods and applications in the multilingual society (ATS-MA 2014), Association for Computational Linguistics and Dublin University (pp. 30–40).
GunningRThe technique of clear writing1968New YorkMcGraw-Hill
XuWCallison-BurchCNapolesCProblems in current text simplification research: New data can helpTransactions of the Association for Computational Linguistics20153283297
Covington, M. A., He, C., Brown, C., Naçi, L., & Brown, J. (2006). How complex is that sentence? A proposed revision of the Rosenberg and Abbeduto D-Level Scale. CASPR Research Report 2006-01. Athens, GA: The University of Georgia, Artificial Intelligence Center.
Gonzalez-Dios, I., Aranzabe, M. J., Díaz de Ilarraza, A., & Salaberri, H. (2014). Simple or complex? Assessing the readability of Basque texts. In Proceedings of COLING 2014, the 25th international conference on computational linguistics: Technical papers (pp. 334–344).
Štajner, S. (2015). New data-driven approaches to text simplification. PhD Thesis, University of Wolverhampton.
Gonzalez-Dios, I. (2016). Euskarazko egitura konplexuen analisirako eta testuen sinplifikazio automatikorako proposamena/Readability assessment and automatic text simplification. The analysis of Basque complex structures. PhD Thesis, University of the Basque Country (UPV/EHU).
CarreirasMDuñabeitiaJAVergaraMde la Cruz-PavíaILakaISubject relative clauses are not universally easier to process: Evidence from BasqueCognition20101151799210.1016/j.cognition.2009.11.012
LuXAutomatic measurement of syntactic complexity in child language acquisitionInternational Journal of Corpus Linguistics200914132810.1075/ijcl.14.1.02lu
WarrenTGibsonEThe influence of referential processing on sentence complexityCognition20028517911210.1016/S0010-0277(02)00087-2
ZamanianMHeydariPReadability of texts: State of the artTheory and Practice in Language Studies201221435310.4304/tpls.2.1.43-53
DuBayWHThe principles of readability2004Costa Mesa, CAImpact Information
Pellow, D., & Eskenazi, M. (2014). An open corpus of everyday documents for simplification tasks. In Proceedings of the 3rd workshop on predicting and improving text readability for target reader populations (PITR), Association for Computational Linguistics, Gothenburg, Sweden (pp. 84–93).
Coster, W., & Kauchak, D. (2011). Simple English Wikipedia: A new text simplification task. In Proceedings of the 49th annual meeting of the Association for Computational Linguistics: Human language technologies: Short papers (Vol. 2, pp. 665–669).
Klaper, D., Ebling, S., & Volk, M. (2013). Building a German/simple German parallel corpus for automatic text simplification. In Proceedings of the second workshop on predicting and improving text readability for target reader populations, Association for Computational Linguistics, Sofia, Bulgaria (pp. 11–19).
LakaIErdoziaKTorregoELinearization references given “Free Word Order”; Subject preferences given ergativity: A look at BasqueFestschrift for Professor Carlos Piera2010OxfordOxford University Press
SiddharthanAA survey of research on text simplificationThe International Journal of Applied Linguistics20141652259298
Bott, S., & Saggion, H. (2011). An unsupervised alignment algorithm for text simplification corpus construction. In Proceedings of the workshop on monolingual text-to-text generation, Association for Computational Linguistics, Stroudsburg, PA, USA, MTTG ‘11 (pp. 20–26).
FleschRA new readability yardstickJournal of Applied Psychology194832322123310.1037/h0057532
MannWCThompsonSARhetorical structure theory: Toward a functional theory of text organizationText19888324328110.1515/text.1.1988.8.3.243
BottSSaggionHText simplification resources for SpanishLanguage Resources and Evaluation20144819312010.1007/s10579-014-9265-4
Aranzabe, M. J., Díaz de Ilarraza, A., & Gonzalez-Dios, I. (2012). First approach to automatic text simplification in basque. In L. Rello, & H. Saggion (Eds.), Proceedings of the natural language processing for improving textual accesibility (NLP4ITA) workshop (LREC 2012) (pp. 1–8).
ShardlowMA survey of automated text simplificationInternational Journal of Advanced Computer Science and Applications (IJACSA)2014415870
Caseli, H. M., Pereira, T. F., Specia, L., Pardo, T. A. S., Gasperin, C., & Aluísio, S. (2009). Building a Brazilian Portuguese parallel corpus of original and simplified texts. In Proceedings of CICLing (pp. 59–70).
CrossleySAAllenDMcNamaraDSText simplification and comprehensible input: A case for an intuitive approachLanguage Teaching Research20121618910810.1177/1362168811423456
RosenbergSAbbedutoLIndicators of linguistic competence in the peer group conversational behavior of mildly retarded adultsApplied Psycholinguistics198781193210.1017/S0142716400000047
Brouwers, L., Bernhard, D., Ligozat, A. L., & Francois, T. (2014). Syntactic sentence simplification for French. In Proceedings of the 3rd workshop on predicting and improving text readability for target reader populations (PITR), Association for Computational Linguistics, Gothenburg, Sweden (pp. 47–56).
RosISantestebanMFukumuraKLakaIAiming at shorter dependencies: The role of agreement morphologyLanguage, Cognition and Neuroscience20153091156117410.1080/23273798.2014.994009
ŠtajnerSDrndarevicBSaggionHCorpus-based sentence deletion and split decisions for Spanish text simplificationComputación y Sistemas2013172251262
GordonPCHendrickRJohnsonMEffects of noun phrase type on sentence complexityJournal of Memory and Language20045119711410.1016/j.jml.2004.02.003
Klerke, S., & Søgaard, A. (2012). DSim, a Danish parallel corpus for text simplification. In N. Calzolari (Conference Chair), K. Choukri, T. Declerck, M. Ugur Dogan, B. Maegaard, J. Mariani, et al. (Eds.),. Proceedings of the eight international conference on language resources and evaluation (LREC’12), European Language Resources Association (ELRA), Istanbul, Turkey (pp. 4015–4018).
BenjaminRGReconstructing readability: Recent developments and recommendations in the analysis of text difficultyEducational Psychology Review2012241638810.1007/s10648-011-9181-8
9407_CR3
9407_CR1
S Bott (9407_CR4) 2014; 48
9407_CR6
9407_CR16
9407_CR37
9407_CR18
PC Gordon (9407_CR19) 2004; 51
9407_CR5
JS Chall (9407_CR9) 1995
BJ Grosz (9407_CR20) 1995; 21
9407_CR8
I Laka (9407_CR25) 2010
9407_CR30
X Lu (9407_CR26) 2009; 14
9407_CR10
WH DuBay (9407_CR14) 2004
9407_CR11
A Siddharthan (9407_CR34) 2014; 165
9407_CR13
9407_CR35
T Warren (9407_CR38) 2002; 85
M Zamanian (9407_CR40) 2012; 2
R Gunning (9407_CR21) 1968
SA Crossley (9407_CR12) 2012; 16
S Štajner (9407_CR36) 2013; 17
I Gonzalez-Dios (9407_CR17) 2013; 5
M Carreiras (9407_CR7) 2010; 115
9407_CR29
R Flesch (9407_CR15) 1948; 32
9407_CR28
I Ros (9407_CR31) 2015; 30
RG Benjamin (9407_CR2) 2012; 24
W Xu (9407_CR39) 2015; 3
S Rosenberg (9407_CR32) 1987; 8
9407_CR23
9407_CR22
9407_CR24
M Shardlow (9407_CR33) 2014; 4
WC Mann (9407_CR27) 1988; 8
References_xml – ident: 9407_CR35
– ident: 9407_CR37
– volume: 115
  start-page: 79
  issue: 1
  year: 2010
  ident: 9407_CR7
  publication-title: Cognition
  doi: 10.1016/j.cognition.2009.11.012
  contributor:
    fullname: M Carreiras
– ident: 9407_CR18
– volume: 85
  start-page: 79
  issue: 1
  year: 2002
  ident: 9407_CR38
  publication-title: Cognition
  doi: 10.1016/S0010-0277(02)00087-2
  contributor:
    fullname: T Warren
– volume: 8
  start-page: 243
  issue: 3
  year: 1988
  ident: 9407_CR27
  publication-title: Text
  contributor:
    fullname: WC Mann
– volume: 5
  start-page: 43
  issue: 2
  year: 2013
  ident: 9407_CR17
  publication-title: Linguamática
  contributor:
    fullname: I Gonzalez-Dios
– volume: 16
  start-page: 89
  issue: 1
  year: 2012
  ident: 9407_CR12
  publication-title: Language Teaching Research
  doi: 10.1177/1362168811423456
  contributor:
    fullname: SA Crossley
– ident: 9407_CR10
– ident: 9407_CR16
– volume: 3
  start-page: 283
  year: 2015
  ident: 9407_CR39
  publication-title: Transactions of the Association for Computational Linguistics
  doi: 10.1162/tacl_a_00139
  contributor:
    fullname: W Xu
– ident: 9407_CR5
  doi: 10.3115/v1/W14-1206
– ident: 9407_CR24
– volume: 24
  start-page: 63
  issue: 1
  year: 2012
  ident: 9407_CR2
  publication-title: Educational Psychology Review
  doi: 10.1007/s10648-011-9181-8
  contributor:
    fullname: RG Benjamin
– volume: 17
  start-page: 251
  issue: 2
  year: 2013
  ident: 9407_CR36
  publication-title: Computación y Sistemas
  contributor:
    fullname: S Štajner
– volume-title: The principles of readability
  year: 2004
  ident: 9407_CR14
  contributor:
    fullname: WH DuBay
– volume: 21
  start-page: 203
  issue: 2
  year: 1995
  ident: 9407_CR20
  publication-title: Computational Linguistics
  contributor:
    fullname: BJ Grosz
– volume: 51
  start-page: 97
  issue: 1
  year: 2004
  ident: 9407_CR19
  publication-title: Journal of Memory and Language
  doi: 10.1016/j.jml.2004.02.003
  contributor:
    fullname: PC Gordon
– ident: 9407_CR8
– volume: 32
  start-page: 221
  issue: 3
  year: 1948
  ident: 9407_CR15
  publication-title: Journal of Applied Psychology
  doi: 10.1037/h0057532
  contributor:
    fullname: R Flesch
– ident: 9407_CR22
– volume: 2
  start-page: 43
  issue: 1
  year: 2012
  ident: 9407_CR40
  publication-title: Theory and Practice in Language Studies
  doi: 10.4304/tpls.2.1.43-53
  contributor:
    fullname: M Zamanian
– ident: 9407_CR11
– volume: 30
  start-page: 1156
  issue: 9
  year: 2015
  ident: 9407_CR31
  publication-title: Language, Cognition and Neuroscience
  doi: 10.1080/23273798.2014.994009
  contributor:
    fullname: I Ros
– volume: 4
  start-page: 58
  issue: 1
  year: 2014
  ident: 9407_CR33
  publication-title: International Journal of Advanced Computer Science and Applications (IJACSA)
  contributor:
    fullname: M Shardlow
– volume-title: Readability revisited: The new Dale–Chall readability formula
  year: 1995
  ident: 9407_CR9
  contributor:
    fullname: JS Chall
– ident: 9407_CR28
  doi: 10.3115/v1/W14-5604
– volume: 165
  start-page: 259
  issue: 2
  year: 2014
  ident: 9407_CR34
  publication-title: The International Journal of Applied Linguistics
  doi: 10.1075/itl.165.2.06sid
  contributor:
    fullname: A Siddharthan
– volume: 48
  start-page: 93
  issue: 1
  year: 2014
  ident: 9407_CR4
  publication-title: Language Resources and Evaluation
  doi: 10.1007/s10579-014-9265-4
  contributor:
    fullname: S Bott
– ident: 9407_CR13
– volume-title: The technique of clear writing
  year: 1968
  ident: 9407_CR21
  contributor:
    fullname: R Gunning
– ident: 9407_CR3
– volume: 14
  start-page: 3
  issue: 1
  year: 2009
  ident: 9407_CR26
  publication-title: International Journal of Corpus Linguistics
  doi: 10.1075/ijcl.14.1.02lu
  contributor:
    fullname: X Lu
– volume-title: Festschrift for Professor Carlos Piera
  year: 2010
  ident: 9407_CR25
  contributor:
    fullname: I Laka
– volume: 8
  start-page: 19
  issue: 1
  year: 1987
  ident: 9407_CR32
  publication-title: Applied Psycholinguistics
  doi: 10.1017/S0142716400000047
  contributor:
    fullname: S Rosenberg
– ident: 9407_CR29
  doi: 10.3115/v1/W14-1210
– ident: 9407_CR6
  doi: 10.3115/v1/W15-1604
– ident: 9407_CR1
– ident: 9407_CR30
  doi: 10.21437/SLaTE.2007-20
– ident: 9407_CR23
SSID ssj0042478
Score 2.2553089
Snippet In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified...
SourceID proquest
crossref
springer
SourceType Aggregation Database
Publisher
StartPage 217
SubjectTerms Annotations
Basque language
Computational Linguistics
Computer Science
Corpus analysis
Corpus linguistics
Language and Literature
Linguistics
Machine learning
Sentences
Simplified language
Social Sciences
Texts
Translators
Title The corpus of Basque simplified texts (CBST)
URI https://link.springer.com/article/10.1007/s10579-017-9407-6
https://www.proquest.com/docview/1965735409
Volume 52
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NS8MwFH_odvHidCpON8lBxK_I2jRNetzmxhQZwibsVtq0ARG6Ybf_35eucVP0oKdCk4bw-vLeL-8T4FwrJtwklhR1a0q9KEiojJmxNkntKE8L5RZNbMdiNJX3fVMmx_00XWRvd9YjWQjqjVw3Lkxoj6ABXkKovw1VVD2cV6Da6T8On6z89VyvkL8OFx5FMDS1vsyfFvmqjdYQ85tXtFA2g9p_trkHuyW0JJ0VL-zDVprVoWbbNpDyFNehVeYqkAtSJiOZn2PHD-AWWYfgpXS-zMlMk26U43ZJ_mpizzUiVmKCRXJy2euOJ1eH8DLoT3pDWjZVoIrxYEE9P2Ii4absWZAaAOJoT7R1rPFNwKQQWjPFlXSE344KvOe4WiEOd1NfMvzwCCrZLEuPgbBYRNqRimuN1zJEeoGfpFFRkYcpp60acG2JG85XtTPCdZVkQ6cQ54aGTqHfgKYlf1geozw05Q6FsUwFDbix9N4Y_m2xkz_NPoUdhEFyFVnWhMrifZm2YDtPlmclb-Fz8DDqPn8AkvbF5g
link.rule.ids 315,782,786,27935,27936,41075,42144,48346,48349,48359,49651,49654,49664,52155
linkProvider Springer Nature
linkToHtml http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NT8IwFH8ROOhFFDWioD0Y41eTbe3W7shnMCIXMIFTs3Vr4gWIg__fdqwBjR70unZN89q-93t97_0KcKMkYV4Sc6xta4ppFCaYx8TcNnHlSqqY9PJHbMdsNOXdnqHJIbYWJs92tyHJXFPvFLv5zOT2MBxqLwQHJagYsnNahkprOpt1rQKmHs0VsOszijUamtpg5k-DfDVHW4z5LSyaW5t-9V_zPILDAlyi1mY3HMNeOq9B1T7cgIpzXINmUa2AblFRjmSWx7afwJPePEi7pct1hhYKtaNMzxdl7yb7XGnMiky6SIbuOu3x5P4U3vq9SWeAi2cVsCR-uMI0iAhLfEN8FqYGgriKMkfFSn8JCWdMKSJ9yV0WOFGO-FxPSY3EvTTgRP94BuX5Yp6eAyIxi5TLpa-Udsw01guDJI1yTh4iXUfW4cFKVyw37Bliy5Ns5CR0X2HkJII6NKz8RXGQMmEID5m5mwrr8GjlvdP822AXf-p9DfuDyetQDJ9HL5dwoEER3-SZNaC8-linTShlyfqq2GifoKHIhg
linkToPdf http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV3dS8MwED90A_HF6VScbpoHEb_C1qZt0ifZV3UoQ9iEvYU2bcCXbtjt_zdpGzZFH8TXJg3hcsn9krv7HcClFITaccSwsq0JdkI_xiwi-rWJSUs4kgo7L2I7oeMZGww1Tc6DyYXJo92NS7LIadAsTemyvYhleyPxzaU6zodiX91IsLcNVf0qplS82h1NHwNzGDu2kx_GlksdrJDRzDg2fxrkq2la481vLtLc8gS1f895H_ZK0Im6hZYcwFaS1qFmCjqgcn_XoVVmMaArVKYp6WUz7Ydwr5QKqevqYpWhuUS9MFNzR9m7jkqXCssiHUaSoet-bzK9OYK3YDjtP-Gy3AIWxPWX2PFCQmNXE6L5iYYmlnRoR0ZSffEJo1RKIlzBLOp1whwJWrYUCqHbiceI-vEYKuk8TU4AkYiG0mLClVJd2BQG9L04CXOuHiKsjmjArZE0XxSsGnzNn6zlxFVfruXEvQY0zVrwcoNlXBMhUv1m5Tfgzsh-o_m3wU7_1PsCdl4HAX8ZjZ_PYFdhJVaEnzWhsvxYJS3YzuLVealzn47m0R0
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+corpus+of+Basque+simplified+texts+%28CBST%29&rft.jtitle=Language+resources+and+evaluation&rft.au=Gonzalez-Dios%2C+Itziar&rft.au=Aranzabe%2C+Mar%C3%ADa+Jes%C3%BAs&rft.au=D%C3%ADaz+de+Ilarraza%2C+Arantza&rft.date=2018-03-01&rft.pub=Springer+Netherlands&rft.issn=1574-020X&rft.eissn=1574-0218&rft.volume=52&rft.issue=1&rft.spage=217&rft.epage=247&rft_id=info:doi/10.1007%2Fs10579-017-9407-6&rft.externalDocID=10_1007_s10579_017_9407_6
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1574-020X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1574-020X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1574-020X&client=summon