From phrase structure to dependencies, and back
- Transforming constituent-based annotation into dependency-based annotation has been shown to work for different treebanks and annotation schemes (e.g. Lin (1995) has transformed the Penn treebank, and Kübler and Telljohann (2002) the Tübinger Baumbank des Deutschen (TüBa-D/Z)). These ventures are usually triggered by the conflict between theory-neutral annotation, that targets most needs of a wider audience, and theory-specific annotation, that provides more fine-grained information for a smaller audience. As a compromise, it has been pointed out that treebanks can be designed to support more than one theory from the start (Nivre, 2003). We argue that information can also be added to an existing annotation scheme so that it supports additional theory-specific annotations. We also argue that such a transformation is useful for improving and extending the original annotation scheme with respect to both ambiguous annotation and annotation errors. We show this by analysing problems that arise when generating dependency information from the constituent-based TüBa-D/Z.
Author: | Tylman Ule, Sandra KüblerORCiDGND |
---|---|
URN: | urn:nbn:de:hebis:30-1110570 |
URL: | http://cl.indiana.edu/~skuebler/papers/uk04evid.pdf |
Document Type: | Article |
Language: | English |
Year of Completion: | 2004 |
Year of first Publication: | 2004 |
Publishing Institution: | Universitätsbibliothek Johann Christian Senckenberg |
Release Date: | 2008/10/21 |
GND Keyword: | Satzanalyse |
Page Number: | 2 |
First Page: | 1 |
Last Page: | 2 |
Note: | Erschienen in: Proceedings of the International Conference on Linguistic Evidence, Tübingen 2004 |
Source: | http://jones.ling.indiana.edu/~skuebler/papers/uk04evid.pdf ; Proceedings of the International Conference on Linguistic Evidence (Tübingen 2004). |
HeBIS-PPN: | 206763263 |
Institutes: | keine Angabe Fachbereich / Extern |
Dewey Decimal Classification: | 4 Sprache / 40 Sprache / 400 Sprache |
Sammlungen: | Linguistik |
Linguistik-Klassifikation: | Linguistik-Klassifikation: Computerlinguistik / Computational linguistics |
Licence (German): | Deutsches Urheberrecht |