Refine
Year of publication
Document Type
- Article (181)
- Part of Periodical (69)
- Preprint (62)
- Book (37)
- Part of a Book (28)
- Conference Proceeding (26)
- Working Paper (15)
- Report (8)
- Doctoral Thesis (4)
- Other (3)
Language
- English (437) (remove)
Has Fulltext
- yes (437)
Is part of the Bibliography
- no (437) (remove)
Keywords
- Computerlinguistik (28)
- Deutsch (20)
- Syntax (16)
- Japanisch (15)
- new species (11)
- Grammatik (10)
- Multicomponent Tree Adjoining Grammar (9)
- Optimalitätstheorie (9)
- Maschinelle Übersetzung (8)
- Syntaktische Analyse (8)
Institute
- Extern (437) (remove)
We present an architecture for the integration of shallow and deep NLP components which is aimed at flexible combination of different language technologies for a range of practical current and future applications. In particular, we describe the integration of a high-level HPSG parsing system with different high-performance shallow components, ranging from named entity recognition to chunk parsing and shallow clause recognition. The NLP components enrich a representation of natural language text with layers of new XML meta-information using a single shared data structure, called the text chart. We describe details of the integration methods, and show how information extraction and language checking applications for realworld German text benefit from a deep grammatical analysis.
Hybrid robust deep and shallow semantic processing for creativity support in document production
(2004)
The research performed in the DeepThought project (http://www.project-deepthought.net) aims at demonstrating the potential of deep linguistic processing if added to existing shallow methods that ensure robustness. Classical information retrieval is extended by high precision concept indexing and relation detection. We use this approach to demonstrate the feasibility of three ambitious applications, one of which is a tool for creativity support in document production and collective brainstorming. This application is described in detail in this paper. Common to all three applications, and the basis for their development is a platform for integrated linguistic processing. This platform is based on a generic software architecture that combines multiple NLP components and on robust minimal recursive semantics (RMRS) as a uniform representation language.
In this paper we show an approach to the customization of GermaNet to the German HPSG grammar lexicon developed in the Verbmobil project. GermaNet has a broad coverage of the German base vocabulary and fine-grained semantic classification; while the HPSG grammar lexicon is comparatively small und has a coarse-grained semantic classification. In our approach, we have developed a mapping algorithm to relate the synsets in GermaNet with the semantic sorts in HPSG. The evaluation result shows that this approach is useful for the lexical extension of our deep grammar development to cope with real-world text understanding.
Particles fullfill several distinct central roles in the Japanese language. They can mark arguments as well as adjuncts, can be functional or have semantic functions. There is, however, no straightforward matching from particles to functions, as, e.g., 'ga' can mark the subject, the object or the adjunct of a sentence. Particles can cooccur. Verbal arguments that could be identified by particles can be eliminated in the Japanese sentence. And finally, in spoken language particles are often omitted. A proper treatment of particles is thus necessary to make an analysis of Japanese sentences possible. Our treatment is based on an empirical investigation of 800 dialogues. We set up a type hierarchy of particles motivated by their subcategorizational and modificational behaviour. This type hierarchy is part of the Japanese syntax in VERBMOBIL.
Preferences and defaults for definiteness and number in japanese to german machine translation
(1996)
A significant problem when translating Japanese dialogues into German is the missing information on number and definiteness in the Japanese analysis output. The integration of the search for such information into the transfer process provides an efficient solution. General transfer includes conditions to make it possible to consider external knowledge. Thereby, grammatical and lexical knowledge of the source language, knowledge of lexical restrictions on the target language, domain knowledge and discourse knowledge are accessible.
A comprehensive investigation of Japanese particle was missing up to now. General implications were set up without the fact that a comprehensive analysis was carried out. [...] We offer a lexicalist treatment of the problem. Instead of assuming different phrase structure rules we state a type hierarchy of Japanese particles. This makes a uniform treatment of phrase structure as well as a differentiation of subcategorization patterns possible.
We present a solution for the representation of Japanese honorifical information in the HPSG framework. Basically, there are three dimensions of honorification. We show that a treatment is necessary that involves both the syntactic and the contextual level of information. The japanese grammar is part of a machine translation system.
The research performed in the DeepThought project aims at demonstrating the potential of deep linguistic processing if combined with shallow methods for robustness. Classical information retrieval is extended by high precision concept indexing and relation detection. On the basis of this approach, the feasibility of three ambitious applications will be demonstrated, namely: precise information extraction for business intelligence; email response management for customer relationship management; creativity support for document production and collective brainstorming. Common to these applications, and the basis for their development is the XML-based, RMRS-enabled core architecture framework that will be described in detail in this paper. The framework is not limited to the applications envisaged in the DeepThought project, but can also be employed e.g. to generate and make use of XML standoff annotation of documents and linguistic corpora, and in general for a wide range of NLP-based applications and research purposes.
In this paper we describe SOBA, a sub-component of the SmartWeb multi-modal dialog system. SOBA is a component for ontologybased information extraction from soccer web pages for automatic population of a knowledge base that can be used for domainspecific question answering. SOBA realizes a tight connection between the ontology, knowledge base and the information extraction component. The originality of SOBA is in the fact that it extracts information from heterogeneous sources such as tabular structures, text and image captions in a semantically integrated way. In particular, it stores extracted information in a knowledge base, and in turn uses the knowledge base to interpret and link newly extracted information with respect to already existing entities.
This demo abstract describes the SmartWeb Ontology-based Information Extraction System (SOBIE). A key feature of SOBIE is that all information is extracted and stored with respect to the SmartWeb ontology. In this way, other components of the systems, which use the same ontology, can access this information in a straightforward way. We will show how information extracted by SOBIE is visualized within its original context, thus enhancing the browsing experience of the end user.
Dialogue acts in Verbmobil 2
(1998)
This report describes the dialogue phases and the second edition dialogue acts which are used in the VERBMOBIL 2 project [...]. While in the first project phase the scenario was restricted to appointment scheduling dialogues, it has been extended to travel planning in the second phase with appointment scheduling being only a part of the new scenario.
The Deep Linguistic Processing with HPSG Initiative (DELH-IN) provides the infrastructure needed to produce open-source semantic transfer-based machine translation systems. We have made available a prototype Japanese-English machine translation system built from existing resources include parsers, generators, bidirectional grammars and a transfer engine.
While the sortal constraints associated with Japanese numeral classifiers are well-studied, less attention has been paid to the details of their syntax. We describe an analysis implemented within a broad-coverage HPSG that handles an intricate set of numeral classifier construction types and compositionally relates each to an appropriate semantic representation, using Minimal Recursion Semantics.
While the sortal constraints associated with Japanese numeral classifiers are wellstudied, less attention has been paid to the details of their syntax. We describe an analysis implemented within a broadcoverage HPSG that handles an intricate set of numeral classifier construction types and compositionally relates each to an appropriate semantic representation, using Minimal Recursion Semantics.
We present an effort for the development of multilingual named entity grammars in a unification-based finite-state formalism (SProUT). Following an extended version of the MUC7 standard, we have developed Named Entity Recognition grammars for German, Chinese, Japanese, French, Spanish, English, and Czech. The grammars recognize person names, organizations, geographical locations, currency, time and date expressions. Subgrammars and gazetteers are shared as much as possible for the grammars of the different languages. Multilingual corpora from the business domain are used for grammar development and evaluation. The annotation format (named entity and other linguistic information) is described. We present an evaluation tool which provides detailed statistics and diagnostics, allows for partial matching of annotations, and supports user-defined mappings between different annotation and grammar output formats.
Cephaloleia consanguinea Baly, Cephaloleia fulvolimbata Baly, Cephaloleia ruficollis Baly, Chalepus amabilis Baly, Chalepus brevicornis (Baly), Chalepus pici Descarpentries and Villiers, Microrhopala erebus (Newman), Octhispa bimaculata Uhmann, Octotoma championi Baly, Pseudispa tuberculata Staines, Sceloenopla erudita (Baly), Stenispa guatemalensis Uhmann, Sumitrosis gestroi (Weise), and Sumitrosis terminatus (Baly) (Coleoptera: Chrysomelidae: Cassidinae) are new country records of hispine chrysomelids for Belize, based on collections cited herein. These collections also document new host records for Calyptocephala gerstaeckeri Boheman (Chamaedorea tepejilote Liebm., Arecaceae), Cephaloleia consanguinea (Heliconia bourgaeana Petersen, H. collinsiana Griggs, H. latispatha Benth., H. wagneriana Petersen; Heliconiaceae), and Cephaloleia perplexa Baly (Heliconia bourgaeana, H. latispatha; Heliconiaceae).
In this contribution I shall be interested, among other things, in finding a place for the European phenomenon of the ›Speculum humanae salvationis‹ within German literary history, which will inescapably involve revisiting the unfashionable discussion of date and origins. I also intend to ask about the place of this text in the ‘didactic’ literature of the Middle Ages. Is a religious text structured according to sacred history didactic? Much didactic poetry is in the vernacular: What does it mean that the ›Speculum‹ was composed in Latin? And what place should be accorded to its vernacular reception? The ›Speculum‹ is inscribed within a set of oppositions that would appear to be recurrent in the didactic literature of the later Middle Ages: Latin and vernacular, verse and prose, words and pictures, religious and profane, moral
teaching and devotion, clerical and lay. In view of its exceptionally broad transmission in the German lands, both in Latin and in vernacular reworkings, is it possible to describe this text so that it takes a place within a larger picture? In some respects it may stand at a threshold in the history of European didacticism.
Ein Vordenker, der in der internationalen Diskussion um « cultural translation » so gut wie nie diskutiert wird, ist Antonio Gramsci. Der Philosoph aus Sardinien, von Kindes Tagen an in Zweisprachigkeit (Sardisch-Italienisch) geübt, hat ein feines Sensorium für kulturelle Differenzen ausgebildet. In seinen Gefängnisjahren übersetzt er – als intellektuelles Training – aus dem Russischen und dem Deutschen ins Italienische, und in den Gefängnisheften setzt er sich wiederholt mit dem Begriff der traducibilità (Übersetzbarkeit) auseinander: Übersetzbarkeit von Sprachen, aber auch von Kulturen. Der Artikel geht den Linien nach, die von Gramscis Überlegungen zu der aktuellen Diskussion gezogen werden können, und diskutiert am Ende vergleichend die Positionen Homi K. Bhabhas und Gayatri Spivaks.
Pope Benedict XVI’s Regensburg lecture has been exposed by some learned voices of 'the Muslim world' as alluding, by the means of one particular quotation, to age-old stereotypes about Islam being an essentially violent creed in which moderation through reason has no legitimate place, and of representing Muhammadas an evil and inhuman man who preached that Islam should be spread by the sword. While none of these presumably 'Muslim' voices deny that the Pope has the right to express his opinions, even when they are plainly wrong in the face of historic facts that show how Islam and Christianity were spread (or were made to spread) across the world, he is criticised for a host of omissions in terms of intellectual honesty and factual accuracy. These omissions, it is argued here, cast an unfortunate light on the compatibility of scientific and religious rationality much advocated by the Pope in his 12 September 2006 lecture. This flagrant 'performative contradiction' (Habermas) leaves room for speculation about the true aim of the speech. Is Benedict XVI's appeal to theology as a legitimate academic discipline a credible attempt to explicate Roman Catholicism's rightful place in a modern world governed by liberal democracy and ethical-political pluralism, or is it a reflection of a move to restore the age-old, intolerant, anti-scientific, and anti-democratic legacy of the pre-Vatican II Catholic Church?
There are many aspects of Haas' life and experiences in India which deserve greater attention. I would like to refer briefly only to his attempts as a litterateur to come to terms with 'India' as presented in his autobiographical recollection and to some comparative cultural reflections in his essays. Like all reconstructions his autobiographical recollection of India is also a construct in which the site of India as a place of exile is justified by an achieved awareness between conscious individual choice and inevitability. An individual acts out a personal history, the prefiguration of which he only becomes aware of in the form of a subsequent epiphanic realization. Given Haas' literary background, it is not surprising that this is articulated through a literary association.