Developing a TT-MCTAG for German with an RCG-based parser

  • Developing linguistic resources, in particular grammars, is known to be a complex task in itself, because of (amongst others) redundancy and consistency issues. Furthermore some languages can reveal themselves hard to describe because of specific characteristics, e.g. the free word order in German. In this context, we present (i) a framework allowing to describe tree-based grammars, and (ii) an actual fragment of a core multicomponent tree-adjoining grammar with tree tuples (TT-MCTAG) for German developed using this framework. This framework combines a metagrammar compiler and a parser based on range concatenation grammar (RCG) to respectively check the consistency and the correction of the grammar. The German grammar being developed within this framework already deals with a wide range of scrambling and extraction phenomena.
Metadaten
Author:Laura KallmeyerORCiDGND, Timm LichteORCiDGND, Wolfgang Maier, Yannick Parmentier, Johannes DellertGND
URN:urn:nbn:de:hebis:30-1110235
URL:http://www.sfs.uni-tuebingen.de/emmy/papers/lrec08-mctag.pdf
ISBN:2-9517408-4-0
ISSN:2522-2686
Document Type:Preprint
Language:English
Year of Completion:2008
Year of first Publication:2008
Publishing Institution:Universit├Ątsbibliothek Johann Christian Senckenberg
Release Date:2008/10/20
Tag:Multicomponent Tree Adjoining Grammar; Range Concatenation Grammar
GND Keyword:Deutsch; Syntaktische Analyse
Page Number:8
Note:
Erschienen in: Nicoletta Calzolari ; Khalid Choukri ; Bente Maegaard ; Joseph Mariani ; Jan Odijk ; Stelios Piperidis ; Daniel Tapias (Hrsg.): Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC-2008), May, 28-30, 2008. Marrakech, Marocco, Paris : ELRA, 2008, S. 782-789, ISBN: 2-9517408-4-0
Source:http://www.sfb441.uni-tuebingen.de/emmy-noether-kallmeyer/papers/lrec08-mctag.pdf , Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC-2008) (Marrakech 2008).
HeBIS-PPN:206692307
Institutes:keine Angabe Fachbereich / Extern
Dewey Decimal Classification:4 Sprache / 40 Sprache / 400 Sprache
Sammlungen:Linguistik
Linguistik-Klassifikation:Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Licence (German):License LogoDeutsches Urheberrecht