Search Results - "Zeldes, Amir"
-
1
A Collaborative Ecosystem for Digital Coptic Studies
Published in Journal of data mining and digital humanities (23-09-2020)“…Scholarship on underresourced languages bring with them a variety of challenges which make access to the full spectrum of source materials and their evaluation…”
Get full text
Journal Article -
2
The GUM corpus: creating multilayer resources in the classroom
Published in Language Resources and Evaluation (01-09-2017)“…This paper presents the methodology, design principles and detailed evaluation of a new freely available multilayer corpus, collected and edited via classroom…”
Get full text
Journal Article -
3
-
4
Opinion Piece: Can we Fix the Scope for Coreference?: Problems and Solutions for Benchmarks beyond OntoNotes
Published in Dialogue and discourse (15-04-2022)“…Current work on automatic coreference resolution has focused on the OntoNotes benchmark dataset, due to both its size and consistency. However many aspects of…”
Get full text
Journal Article -
5
Opinion Piece: Can we Fix the Scope for Coreference?
Published in Dialogue and discourse (01-01-2022)“…Current work on automatic coreference resolution has focused on the OntoNotes benchmark dataset, due to both its size and consistency. However many aspects of…”
Get full text
Journal Article -
6
ANNIS3: A new architecture for generic corpus query and visualization
Published in Digital Scholarship in the Humanities (01-04-2016)Get full text
Journal Article -
7
eRST: A Signaled Graph Theory of Discourse Relations and Organization
Published in Computational linguistics - Association for Computational Linguistics (15-11-2024)“…Abstract In this article we present Enhanced Rhetorical Structure Theory (eRST), a new theoretical framework for computational discourse analysis, based on an…”
Get full text
Journal Article -
8
A Collaborative Ecosystem for Digital Coptic Studies
Published in Journal of data mining and digital humanities (01-09-2020)“…Scholarship on underresourced languages bring with them a variety of challenges which make access to the full spectrum of source materials and their evaluation…”
Get full text
Journal Article -
9
Treebanking user-generated content: a UD based overview of guidelines, corpora and unified recommendations
Published in Language resources and evaluation (01-06-2023)“…This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in…”
Get full text
Journal Article -
10
A Neural Approach to Discourse Relation Signal Detection
Published in Dialogue and discourse (2020)“…Previous data-driven work investigating the types and distributions of discourse relation signals, including discourse markers such as 'however' or phrases…”
Get full text
Journal Article -
11
RIDGES Herbology: designing a diachronic multi-layer corpus
Published in Language Resources and Evaluation (01-09-2017)“…This paper introduces a multi-layer corpus architecture with multiple tokenizations using the open source historical, diachronic corpus of German called…”
Get full text
Journal Article -
12
Probabilistic pragmatics and probabilistic experience
Published in Zeitschrift für Sprachwissenschaft (01-06-2016)Get full text
Journal Article -
13
Can we Fix the Scope for Coreference? Problems and Solutions for Benchmarks beyond OntoNotes
Published 17-12-2021“…Current work on automatic coreference resolution has focused on the OntoNotes benchmark dataset, due to both its size and consistency. However many aspects of…”
Get full text
Journal Article -
14
Unifying the Scope of Bridging Anaphora Types in English: Bridging Annotations in ARRAU and GUM
Published 01-10-2024“…Comparing bridging annotations across coreference resources is difficult, largely due to a lack of standardization across definitions and annotation schemas…”
Get full text
Journal Article -
15
serialising the ISO SynAF syntactic object model
Published in Language Resources and Evaluation (01-03-2015)“…This paper introduces , an XML format developed to serialise the object model defined by the ISO Syntactic Annotation Framework SynAF. Based on widespread best…”
Get full text
Journal Article -
16
GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres
Published 31-01-2024“…As NLP models become increasingly capable of understanding documents in terms of coherent entities rather than strings, obtaining the most salient entities for…”
Get full text
Journal Article -
17
CityU corpus of essay drafts of English language learners: a corpus of textual revision in second language writing
Published in Language Resources and Evaluation (01-09-2015)“…Learner corpora consist of texts produced by non-native speakers. In addition to these texts, some learner corpora also contain error annotations, which can…”
Get full text
Journal Article -
18
Computational Methods for Coptic: Developing and Using Part-of-Speech Tagging for Digital Scholarship in the Humanities
Published in Digital Scholarship in the Humanities (01-12-2015)Get full text
Journal Article -
19
Are UD Treebanks Getting More Consistent? A Report Card for English UD
Published 01-02-2023“…Recent efforts to consolidate guidelines and treebanks in the Universal Dependencies project raise the expectation that joint training and dataset comparison…”
Get full text
Journal Article -
20
MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning
Published 23-12-2022“…Transformer language models (TLMs) are critical for most NLP tasks, but they are difficult to create for low-resource languages because of how much pretraining…”
Get full text
Journal Article