Linguistik
Refine
Year of publication
Document Type
- Conference Proceeding (166) (remove)
Has Fulltext
- yes (166) (remove)
Is part of the Bibliography
- no (166)
Keywords
- Computerlinguistik (20)
- Informationsstruktur (19)
- Deutsch (16)
- Phonetik (13)
- Japanisch (10)
- Maschinelle Übersetzung (9)
- Englisch (7)
- Grammatik (7)
- Nungisch (6)
- Tibetobirmanische Sprachen (6)
Institute
We present an effort for the development of multilingual named entity grammars in a unification-based finite-state formalism (SProUT). Following an extended version of the MUC7 standard, we have developed Named Entity Recognition grammars for German, Chinese, Japanese, French, Spanish, English, and Czech. The grammars recognize person names, organizations, geographical locations, currency, time and date expressions. Subgrammars and gazetteers are shared as much as possible for the grammars of the different languages. Multilingual corpora from the business domain are used for grammar development and evaluation. The annotation format (named entity and other linguistic information) is described. We present an evaluation tool which provides detailed statistics and diagnostics, allows for partial matching of annotations, and supports user-defined mappings between different annotation and grammar output formats.
Du fait de la traite négrière qui a vu des millions d’Africains être déportés aux Amériques, les langues européennes (anglais, espagnol, français, néerlandais, portugais) des colons qui y étaient déjà installés et qui avaient un fort besoin en main-d’oeuvre africaine, ont eu à intégrer à des degrés divers de nombreux mots africains. Les chercheurs qui travaillent sur ces africanismes sont d’accord pour dire que ces mots ont deux grandes origines africaines : bantoue et non-bantoue.
"Ich mag so Wasserpfeifeladen" : the interaction of grammar and information structure in Kiezdeutsch
(2008)
This article examines the expression of natural gender in Icelandic nouns denoting human beings. Particular attention will be paid to the system's symmetry with regards to nouns denoting women and men. Our society consists more or less exactly of half women and half men. One would therefore assume that systems for terms denoting persons would also be symmetrically organised. Yet this assumption could not be further from the truth, and not just in single isolated cases, but in many languages: I will attempt to show that Icelandic has numerous methods for referring to women, but also many barriers and idiosyncrasies.