Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
5 search hits
-
Corpora and evaluation tools for multilingual named entity grammar development
(2003)
-
Christian Bering
Witold Droźdźyński
Gregor Erbach
Clara Guasch
Petr Homola
Sabine Lehmann
Hong Li
Hans-Ulrich Krieger
Jakub Piskorski
Ulrich Schäfer
Atsuko Shimada
Melanie Siegel
Feiyu Xu
Dorothee Ziegler-Eisele
- We present an effort for the development of multilingual named entity grammars in a unification-based finite-state formalism (SProUT). Following an extended version of the MUC7 standard, we have developed Named Entity Recognition grammars for German, Chinese, Japanese, French, Spanish, English, and Czech. The grammars recognize person names, organizations, geographical locations, currency, time and date expressions. Subgrammars and gazetteers are shared as much as possible for the grammars of the different languages. Multilingual corpora from the business domain are used for grammar development and evaluation. The annotation format (named entity and other linguistic information) is described. We present an evaluation tool which provides detailed statistics and diagnostics, allows for partial matching of annotations, and supports user-defined mappings between different annotation and grammar output formats.
-
Factoring Predicate Argument and Scope Semantics : underspecified Semantics with LTAG
(2003)
-
Laura Kallmeyer
Aravind K. Joshi
- In this paper we propose a compositional semantics for lexicalized tree-adjoining grammar (LTAG). Tree-local multicomponent derivations allow separation of the semantic contribution of a lexical item into one component contributing to the predicate argument structure and a second component contributing to scope semantics. Based on this idea a syntax-semantics interface is presented where the compositional semantics depends only on the derivation structure. It is shown that the derivation structure (and indirectly the locality of derivations) allows an appropriate amount of underspecification. This is illustrated by investigating underspecified representations for quantifier scope ambiguities and related phenomena such as adjunct scope and island constraints.
-
Flexible composition in LTAG : quantifier scope and inverse linking
(2003)
-
Laura Kallmeyer
Aravind K. Joshi
Maribel Romero
-
Parsing without grammar - using complete trees instead
(2003)
-
Sandra Kübler
- The definition of similarity between sentences is formulated on the levels of words, POS tags, and chunks (Abney 91; Abney 96). The evaluation of this approach shows that while precision and recall based on the PARSEVAL measures (Black et al. 91) do not reach state of the art Parsers yet (F1=87.19 on syntactic constituents, F1=77.78 including functionargument structure), the parser shows a very reliable performance where function-argument structure is concerned (F1=96.52). The lower F-scores are very often due to unattached constituents.
-
Semantic construction in feature-based TAG
(2003)
-
Claire Gardent
Laura Kallmeyer
- We propose a semantic construction method for Feature-Based Tree Adjoining Grammar which is based on the derived tree, compare it with related proposals and briefly discuss some implementation possibilities.