What linguists always wanted to know about german and did not know how to estimate

  • This paper profiles significant differences in syntactic distribution and differences in word class frequencies for two treebanks of spoken and written German: the TüBa-D/S, a treebank of transliterated spontaneous dialogues, and the TüBa-D/Z treebank of newspaper articles published in the German daily newspaper die tageszeitung´(taz). The approach can be used more generally as a means of distinguishing and classifying language corpora of different genres.
Metadaten
Author:Erhard Hinrichs, Sandra KüblerORCiDGND
URN:urn:nbn:de:hebis:30-1111319
URL:http://cl.indiana.edu/~skuebler/papers/karlsson.pdf
ISSN:1456-8438
ISSN:0785-3157
ISSN:1796-279X
Parent Title (English):SKY journal of linguistics
Publisher:Suomen Kielitieteellinen Yhdistys
Place of publication:Helsinki
Document Type:Article
Language:English
Year of Completion:2006
Year of first Publication:2006
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2008/11/03
Volume:19
Issue:special supplement
Page Number:10
First Page:24
Last Page:33
Source:http://jones.ling.indiana.edu/~skuebler/papers/karlsson.pdf ; Special Supplement to SKY Journal of Linguistics 19.
HeBIS-PPN:206938268
Institutes:keine Angabe Fachbereich / Extern
Dewey Decimal Classification:4 Sprache / 40 Sprache / 400 Sprache
Sammlungen:Linguistik
Linguistik-Klassifikation:Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Licence (German):License LogoDeutsches Urheberrecht