Vagueness and referential ambiguity in a large-scale annotated corpus

In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate thi
In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative investigation to assess which effects or problems are likely to be the most prominent. Several examples where such problems occur are discussed in more detail, and we then propose a generalisation of Poesio, Reyle and Stevenson’s Justified Sloppiness Hypothesis to provide a unified model for these cases of disagreement and argue that a deeper understanding of the phenomena involved allows to tackle problematic cases in a more principled fashion than would be possible using only pre-theoretic intuitions.
show moreshow less

Download full text files

Export metadata

  • Export Bibtex
  • Export RIS

Additional Services

    Share in Twitter Search Google Scholar
Metadaten
Author:Yannick Versley
URN:urn:nbn:de:hebis:30-1128402
Document Type:Preprint
Language:English
Date of Publication (online):2009/05/05
Year of first Publication:2009
Publishing Institution:Univ.-Bibliothek Frankfurt am Main
Release Date:2009/05/05
Source:http://www.versley.de/rolc-preprint.pdf ; (in:) Massimo Poesio / Ron Artstein : Ambiguity in Anaphora : Special Issue of the Journal on Research in Language and Computation
HeBIS PPN:217391427
Dewey Decimal Classification:400 Sprache
Sammlungen:Linguistik
Linguistic-Classification:Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Licence (German):License Logo Veröffentlichungsvertrag für Publikationen

$Rev: 11761 $