Part of a Book
Refine
Year of publication
- 2005 (1)
Document Type
- Part of a Book (1) (remove)
Language
- English (1)
Has Fulltext
- yes (1)
Is part of the Bibliography
- no (1)
Heterogeneity and standardization in data, use, and annotation : a diachronic corpus of German
(2005)
This paper describes the standardization problems that come up in a diachronic corpus: it has to cope with differing standards with regard to diplomaticity, annotation, and header information. Such highly heterogeneous texts must be standardized to allow for comparative research without (too much) loss of information.