Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Generating and Visualizing a Soccer Knowledge Base
- This demo abstract describes the SmartWeb Ontology-based Information Extraction System (SOBIE). A key feature of SOBIE is that all information is extracted and stored with respect to the SmartWeb ontology. In this way, other components of the systems, which use the same ontology, can access this information in a straightforward way. We will show how information extracted by SOBIE is visualized within its original context, thus enhancing the browsing experience of the end user.
Ontology-based Information Extraction with SOBA
- In this paper we describe SOBA, a sub-component of the SmartWeb multi-modal dialog system. SOBA is a component for ontologybased information extraction from soccer web pages for automatic population of a knowledge base that can be used for domainspecific question answering. SOBA realizes a tight connection between the ontology, knowledge base and the information extraction component. The originality of SOBA is in the fact that it extracts information from heterogeneous sources such as tabular structures, text and image captions in a semantically integrated way. In particular, it stores extracted information in a knowledge base, and in turn uses the knowledge base to interpret and link newly extracted information with respect to already existing entities.
JACY - A Grammar for Annotating Syntax, Semantics and Pragmatics of Written and Spoken Japanese for NLP Application Purposes
- In this text, we describe the development of a broad coverage grammar for Japanese that has
been built for and used in different application contexts. The grammar is based on work done
in the Verbmobil project (Siegel 2000) on machine translation of spoken dialogues in the
domain of travel planning. The second application for JACY was the automatic email
response task. Grammar development was described in Oepen et al. (2002a). Third, it was
applied to the task of understanding material on mobile phones available on the internet, while
embedded in the project DeepThought (Callmeier et al. 2004, Uszkoreit et al. 2004).
Currently, it is being used for treebanking and ontology extraction from dictionary definition
sentences by the Japanese company NTT (Bond et al. 2004).