• Treffer 2902 von 3001
Zurück zur Trefferliste

The TUSNELDA annotation standard : an XML encoding standard for multilingual corpora supporting various aspects of linguistic research

  • This paper proposes a corpus encoding standard that meets the needs of linguistic research using a variety of linguistic data structures. The standard was developed in SFB 441, a research project at the University of Tuebingen. The principal concern of SFB 441 are the empirical data structures which feed into linguistic theory building. SFB 441 consists of several projects, most of which are building corpora to empirically investigate various linguistic phenomena in various languages (e.g. modal verbs in German, forms of address and politeness in Russian). These corpora will form the components of the "Tuebingen collection of reusable, empirical, linguistic data structures (TUSNELDA)". The TUSNELDA annotation standard aims at providing a uniform encoding scheme for all subcorpora and texts of TUSNELDA such that they can be processed with uniform standardized tools. To guarantee maximal reusability we use XML for encoding. Previous SGML standards for text encoding were provided by the Text Encoding Initiative (TEI) and the Expert Advisory Group on Language Engineering Standards (Corpus Encoding Standard, CES). The TUSNELDA standard is based on TEI and XCES (XML version of CES) but takes into account the specific needs of the SFB projects, i.e. the peculiarities of the examined languages and linguistic phenomena.

Volltext Dateien herunterladen

Metadaten exportieren

Metadaten
Verfasserangaben:Laura KallmeyerORCiDGND, Andreas Wagner
URN:urn:nbn:de:hebis:30-1110436
URL:http://www.lingexp.uni-tuebingen.de/sfb441/c1/drh-abstract
ISBN:1897791151
Herausgeber*in:Marilyn Deegan, Michael Fraser, Nigel Williamson
Dokumentart:Preprint
Sprache:Englisch
Jahr der Fertigstellung:2000
Jahr der Erstveröffentlichung:2000
Veröffentlichende Institution:Universitätsbibliothek Johann Christian Senckenberg
Datum der Freischaltung:21.10.2008
Freies Schlagwort / Tag:TUSNELDA
Seitenzahl:4
Bemerkung:
Erschienen in: Marilyn Deegan ; Michael Fraser ; Nigel Williamson (Hrsg.): Digital evidence : selected papers from DRH2000, Digital Resources for the Humanities Conference, University of Sheffield, September 2000, London : Office for Humanities Communication, ISBN: 1897791151
Quelle:http://www.sfb441.uni-tuebingen.de/c1/drh-abstract ; Proceedings of the conference Digital Resources for the Humanities (Sheffield 2000).
HeBIS-PPN:20673378X
Institute:keine Angabe Fachbereich / Extern
DDC-Klassifikation:4 Sprache / 40 Sprache / 400 Sprache
Sammlungen:Linguistik
Linguistik-Klassifikation:Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Lizenz (Deutsch):License LogoDeutsches Urheberrecht