Institutes
Refine
Year of publication
- 2006 (3) (remove)
Document Type
- Conference Proceeding (3) (remove)
Language
- English (3) (remove)
Has Fulltext
- yes (3)
Keywords
- Computerlinguistik (2)
- Suchmaschine (2)
- Alf laila wa-laila (1)
- Benutzeroberfläche (1)
- Libanon (1)
- Ontologie <Wissensverarbeitung> (1)
Institute
- Extern (3)
No other country is influenced in its political, social and cultural structures by both western and eastern mentality such as Lebanon, and hardly any other country has such a pivotal function. In this mediator function it can be compared with a literary work, that merits its role in world literature as hardly any other piece of literature in regard to the co-operation of Orient and Occident. I am thinking of the collection of "A Thousand and One Nights", or with its original title "Alf Laila wa-Laila".
In this paper we describe SOBA, a sub-component of the SmartWeb multi-modal dialog system. SOBA is a component for ontologybased information extraction from soccer web pages for automatic population of a knowledge base that can be used for domainspecific question answering. SOBA realizes a tight connection between the ontology, knowledge base and the information extraction component. The originality of SOBA is in the fact that it extracts information from heterogeneous sources such as tabular structures, text and image captions in a semantically integrated way. In particular, it stores extracted information in a knowledge base, and in turn uses the knowledge base to interpret and link newly extracted information with respect to already existing entities.
This demo abstract describes the SmartWeb Ontology-based Information Extraction System (SOBIE). A key feature of SOBIE is that all information is extracted and stored with respect to the SmartWeb ontology. In this way, other components of the systems, which use the same ontology, can access this information in a straightforward way. We will show how information extracted by SOBIE is visualized within its original context, thus enhancing the browsing experience of the end user.