31 search hits
-
Dialogue acts in Verbmobil 2
(1998)
-
Jan Alexandersson
Bianka Buschbeck-Wolf
Tsutomu Fujinami
Michael Kipp
Stefan Koch
Elisabeth Maier
Norbert Reithinger
Birte Schmitz
Melanie Siegel
- This report describes the dialogue phases and the second edition dialogue acts which are used in the VERBMOBIL 2 project [...]. While in the first project phase the scenario was restricted to appointment scheduling dialogues, it has been extended to travel planning in the second phase with appointment scheduling being only a part of the new scenario.
-
Implementing the syntax of Japanese numeral classifiers
(2005)
-
Emily M. Bender
Melanie Siegel
- While the sortal constraints associated with Japanese numeral classifiers are well-studied, less attention has been paid to the details of their syntax. We describe an analysis implemented within a broad-coverage HPSG that handles an intricate set of numeral classifier construction types and compositionally relates each to an appropriate semantic representation, using Minimal Recursion Semantics.
-
Implementing the syntax of japanese numeral classifiers
(2004)
-
Emily M. Bender
Melanie Siegel
- While the sortal constraints associated with Japanese numeral classifiers are wellstudied, less attention has been paid to the details of their syntax. We describe an analysis implemented within a broadcoverage HPSG that handles an intricate set of numeral classifier construction types and compositionally relates each to an appropriate semantic representation, using Minimal Recursion Semantics.
-
Corpora and evaluation tools for multilingual named entity grammar development
(2003)
-
Christian Bering
Witold Droźdźyński
Gregor Erbach
Clara Guasch
Petr Homola
Sabine Lehmann
Hong Li
Hans-Ulrich Krieger
Jakub Piskorski
Ulrich Schäfer
Atsuko Shimada
Melanie Siegel
Feiyu Xu
Dorothee Ziegler-Eisele
- We present an effort for the development of multilingual named entity grammars in a unification-based finite-state formalism (SProUT). Following an extended version of the MUC7 standard, we have developed Named Entity Recognition grammars for German, Chinese, Japanese, French, Spanish, English, and Czech. The grammars recognize person names, organizations, geographical locations, currency, time and date expressions. Subgrammars and gazetteers are shared as much as possible for the grammars of the different languages. Multilingual corpora from the business domain are used for grammar development and evaluation. The annotation format (named entity and other linguistic information) is described. We present an evaluation tool which provides detailed statistics and diagnostics, allows for partial matching of annotations, and supports user-defined mappings between different annotation and grammar output formats.
-
Open-Source Machine Translation with DELPH-IN
(2005)
-
Francis Bond
Ann Copestake
Dan Flickinger
Stephan Oepen
Melanie Siegel
- The Deep Linguistic Processing with HPSG Initiative (DELH-IN) provides the infrastructure needed to produce open-source semantic transfer-based machine translation systems. We have made available a prototype Japanese-English machine translation system built from existing resources include parsers, generators, bidirectional grammars and a transfer engine.
-
Ontology-based Information Extraction with SOBA
(2006)
-
Paul Buitelaar
Philipp Cimiano
Stefania Racioppa
Melanie Siegel
- In this paper we describe SOBA, a sub-component of the SmartWeb multi-modal dialog system. SOBA is a component for ontologybased information extraction from soccer web pages for automatic population of a knowledge base that can be used for domainspecific question answering. SOBA realizes a tight connection between the ontology, knowledge base and the information extraction component. The originality of SOBA is in the fact that it extracts information from heterogeneous sources such as tabular structures, text and image captions in a semantically integrated way. In particular, it stores extracted information in a knowledge base, and in turn uses the knowledge base to interpret and link newly extracted information with respect to already existing entities.
-
Generating and visualizing a soccer knowledge base
(2006)
-
Paul Buitelaar
Thomas Eigner
Greg Gulrajani
Alexander Schutz
Melanie Siegel
Nicolas Weber
Philipp Cimiano
Günter Ladwig
Matthias Mantel
Honggang Zhu
- This demo abstract describes the SmartWeb Ontology-based Information Extraction System (SOBIE). A key feature of SOBIE is that all information is extracted and stored with respect to the SmartWeb ontology. In this way, other components of the systems, which use the same ontology, can access this information in a straightforward way. We will show how information extracted by SOBIE is visualized within its original context, thus enhancing the browsing experience of the end user.
-
The DeepThought Core Architecture Framework
(2004)
-
Ulrich Callmeier
Andreas Eisele
Ulrich Schäfer
Melanie Siegel
- The research performed in the DeepThought project aims at demonstrating the potential of deep linguistic processing if combined with shallow methods for robustness. Classical information retrieval is extended by high precision concept indexing and relation detection. On the basis of this approach, the feasibility of three ambitious applications will be demonstrated, namely: precise information extraction for business intelligence; email response management for customer relationship management; creativity support for document production and collective brainstorming. Common to these applications, and the basis for their development is the XML-based, RMRS-enabled core architecture framework that will be described in detail in this paper. The framework is not limited to the applications envisaged in the DeepThought project, but can also be employed e.g. to generate and make use of XML standoff annotation of documents and linguistic corpora, and in general for a wide range of NLP-based applications and research purposes.
-
An integrated architecture for shallow and deep processing
(2002)
-
Berthold Crysmann
Anette Frank
Bernd Kiefer
Stefan Müller
Günter Neumann
Jakub Piskorski
Ulrich Schäfer
Melanie Siegel
Hans Uszkoreit
Feiyu Xu
Markus Becker
Hans-Ulrich Krieger
- We present an architecture for the integration of shallow and deep NLP components which is aimed at flexible combination of different language technologies for a range of practical current and future applications. In particular, we describe the integration of a high-level HPSG parsing system with different high-performance shallow components, ranging from named entity recognition to chunk parsing and shallow clause recognition. The NLP components enrich a representation of natural language text with layers of new XML meta-information using a single shared data structure, called the text chart. We describe details of the integration methods, and show how information extraction and language checking applications for realworld German text benefit from a deep grammatical analysis.
-
An HSPG-to-CFG Approximation of Japanese
(2000)
-
Bernd Kiefer
Hans-Ulrich Krieger
Melanie Siegel
- We present a simple approximation method for turning a Head-Driven Phrase Structure Grammar into a context-free grammar. The approximation method can be seen as the construction of the least fixpoint of a certain monotonic function. We discuss an experiment with a large HPSG for Japanese.