Treebank profiling of spoken and written German
This paper profiles significant differences in syntactic distribution and differences in word class frequencies for two treebanks of spoken and written German: the TüBa-D/S, a treebank of transliterated spontaneous dialogs, and the TüBa-D/Z treebank of newspaper articles published in the German daily newspaper ´die tageszeitung´(taz). The approach can be used more generally as a means of distinguishing and classifying language corpora of different genres.
| Author: | Erhard W. Hinrichs, Sandra Kübler |
|---|---|
| URN: | urn:nbn:de:hebis:30-1111304 |
| Document Type: | Article |
| Language: | German |
| Date of Publication (online): | 03.11.2008 |
| Year of first Publication: | 2005 |
| Publishing Institution: | Univ.-Bibliothek Frankfurt am Main |
| Source: | http://jones.ling.indiana.edu/~skuebler/papers/GermanEstimation.pdf ; Proceedings of the Fourth Workshop on Treebanks and Linguistic Theories - Barcelona, Spain. |
| HeBIS PPN: | 206937660 |
| Dewey Decimal Classification: | 400 Sprache |
| Sammlungen: | Linguistik |
| Linguistik-Klassifikation: | Linguistik-Klassifikation: Computerlinguistik / Computational linguistics |
| Licence (German): | Veröffentlichungsvertrag für Publikationen ohne Print on Demand |





