Heterogeneity and standardization in data, use, and annotation : a diachronic corpus of German

  • This paper describes the standardization problems that come up in a diachronic corpus: it has to cope with differing standards with regard to diplomaticity, annotation, and header information. Such highly heterogeneous texts must be standardized to allow for comparative research without (too much) loss of information.

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Anke LüdelingGND
URN:urn:nbn:de:hebis:30-1112337
URL:http://www.sfb632.uni-potsdam.de/publications/isis02_3luedeling.pdf
ISBN:978-3-937786-48-3
Parent Title (German):Heterogeneity in focus: creating and using linguistic databases / Dipper, Stefanie, M. Götze and M. Stede (eds.) ; Working Papers of the SFB 632, Interdisciplinary studies on information structure ; Vol. 2
Publisher:Univ.-Verl.
Place of publication:Potsdam
Document Type:Part of a Book
Language:English
Year of Completion:2005
Year of first Publication:2005
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2008/11/06
Page Number:12
First Page:43
Last Page:54
Source:S. Dipper / M. Götze / M. Stede : Heterogeneity in Focus : Creating and Using Linguistic Databases, Interdisciplinary Studies on Information Structure (ISIS) 2, 2005, S. 43-54 ; http://www.sfb632.uni-potsdam.de/publications/isis02_3luedeling.pdf
HeBIS-PPN:207711070
Dewey Decimal Classification:4 Sprache / 40 Sprache / 400 Sprache
Sammlungen:Linguistik
Licence (German):License LogoDeutsches Urheberrecht