Vagueness and referential ambiguity in a large-scale annotated corpus
- In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative investigation to assess which effects or problems are likely to be the most prominent. Several examples where such problems occur are discussed in more detail, and we then propose a generalisation of Poesio, Reyle and Stevenson’s Justified Sloppiness Hypothesis to provide a unified model for these cases of disagreement and argue that a deeper understanding of the phenomena involved allows to tackle problematic cases in a more principled fashion than would be possible using only pre-theoretic intuitions.
Author: | Yannick Versley |
---|---|
URN: | urn:nbn:de:hebis:30-1128402 |
URL: | http://www.versley.de/rolc-preprint.pdf |
DOI: | https://doi.org/10.1007/s11168-008-9059-1 |
ISSN: | 1572-8706 |
ISSN: | 1570-7075 |
Document Type: | Preprint |
Language: | English |
Year of Completion: | 2009 |
Year of first Publication: | 2009 |
Publishing Institution: | Universitätsbibliothek Johann Christian Senckenberg |
Release Date: | 2009/05/05 |
Tag: | Coreference annotation; Sloppiness; Vagueness |
Page Number: | 24 |
First Page: | 1 |
Last Page: | 24 |
Note: | Erschienen in: Research on language and computation, 6.2008, Nr. 3–4, S. 333–353, doi:10.1007/s11168-008-9059-1 |
Source: | http://www.versley.de/rolc-preprint.pdf ; (in:) Massimo Poesio / Ron Artstein : Ambiguity in Anaphora : Special Issue of the Journal on Research in Language and Computation |
HeBIS-PPN: | 217391427 |
Institutes: | keine Angabe Fachbereich / Extern |
Dewey Decimal Classification: | 4 Sprache / 40 Sprache / 400 Sprache |
Sammlungen: | Linguistik |
Linguistik-Klassifikation: | Linguistik-Klassifikation: Computerlinguistik / Computational linguistics |
Licence (German): | Deutsches Urheberrecht |