OPUS 4 | Search

Why is German dependency parsing more reliable than constituent parsing? (2006)

In recent years, research in parsing has extended in several new directions. One of these directions is concerned with parsing languages other than English. Treebanks have become available for many European languages, but also for Arabic, Chinese, or Japanese. However, it was shown that parsing results on these treebanks depend on the types of treebank annotations used. Another direction in parsing research is the development of dependency parsers. Dependency parsing profits from the non-hierarchical nature of dependency relations, thus lexical information can be included in the parsing process in a much more natural way. Especially machine learning based approaches are very successful (cf. e.g.). The results achieved by these dependency parsers are very competitive although comparisons are difficult because of the differences in annotation. For English, the Penn Treebank has been converted to dependencies. For this version, Nivre et al. report an accuracy rate of 86.3%, as compared to an F-score of 92.1 for Charniaks parser. The Penn Chinese Treebank is also available in a constituent and a dependency representations. The best results reported for parsing experiments with this treebank give an F-score of 81.8 for the constituent version and 79.8% accuracy for the dependency version. The general trend in comparisons between constituent and dependency parsers is that the dependency parser performs slightly worse than the constituent parser. The only exception occurs for German, where F-scores for constituent plus grammatical function parses range between 51.4 and 75.3, depending on the treebank, NEGRA or TüBa-D/Z. The dependency parser based on a converted version of Tüba-D/Z, in contrast, reached an accuracy of 83.4%, i.e. 12 percent points better than the best constituent analysis including grammatical functions.

What linguists always wanted to know about german and did not know how to estimate (2006)

Hinrichs, Erhard ; Kübler, Sandra

This paper profiles significant differences in syntactic distribution and differences in word class frequencies for two treebanks of spoken and written German: the TüBa-D/S, a treebank of transliterated spontaneous dialogues, and the TüBa-D/Z treebank of newspaper articles published in the German daily newspaper die tageszeitung´(taz). The approach can be used more generally as a means of distinguishing and classifying language corpora of different genres.

Verb derivation in modern Greek inside alternation classes (2006)

Charitōnidēs, Charitōn Ch.

In this paper I present five alternations of the verb system of Modern Greek, which are recurrently mapped on the syntactic frame NPi__NP. The actual claim is that only the participation in alternations and/or the allocation to an alternation variant can reliably determine the relation between a verb derivative and its base. In the second part, the conceptual structures and semantic/situational fields of a large number of “-ízo” derivatives appearing inside alternation classes are presented. The restricted character of the conceptual and situational preferences inside alternations classes suggests the dominant character of the alternations component.

Towards case-based parsing : are chunks reliable indicators for syntax trees? (2006)

Kübler, Sandra

This paper presents an approach to the question whether it is possible to construct a parser based on ideas from case-based reasoning. Such a parser would employ a partial analysis of the input sentence to select a (nearly) complete syntax tree and then adapt this tree to the input sentence. The experiments performed on German data from the Tüba-D/Z treebank and the KaRoPars partial parser show that a wide range of levels of generality can be reached, depending on which types of information are used to determine the similarity between input sentence and training sentences. The results are such that it is possible to construct a case-based parser. The optimal setting out of those presented here need to be determined empirically.

The superstable marker as an indicator of categorial weakness? (2006)

Dammel, Antje ; Nübling, Damaris

In this article we examine and "exapt" Wurzel's concept of superstable markers in an innovative manner. We develop an extended view of superstability through a critical discussion of Wurzel's original definition and the status of marker-superstability versus allomorphy in Natural Morphology: As we understand it, superstability is - above and beyond a step towards uniformity - mainly a symptom for the weakening of the category affected (cf. 1.,2. and 4.). This view is exemplified in four short case studies on superstability in different grammatical categories of four Germanic languages: genitive case in Mainland Scandinavian and English (3.1), plural formation in Dutch (3.2), second person singular ending -st in German (3.3), and ablaut generalisation in Luxembourgish (3.4).

The importance of early breeding in Black-tailed Godwits (Limosa limosa) (2006)

Schröder, Julia ; Hooijmeijer, Jos ; Both, Christiaan ; Piersma, Theunis

Human impacts on the landscape have increased the penalties for Black-tailed Godwits laying their eggs too late, especially in the very intensive agricultural landscapes of The Netherlands. Thus, godwits have experienced a dramatic change of their fitness landscape, because the advance in mowing date made late clutches worthless destroying either eggs or chicks. To determine the driving forces of the recent population decline we study the individual variation in timing of breeding with respect to reproductive success in a population unaffected by mowing. Our results show that even in a low intensity agricultural area it is very important for godwits to breed early in the season.

Schweig und tanze! : Elektra, by Hugo von Hofmannsthal and Richard Strauss (2006)

Sheinberg, Esti

The last scene of Richard Strauss’ Elektra builds tension towards Elektra’s dance of victory and joy, leading to her ecstatic abdication in death. Indeed, the opera’s librettist Hugo von Hofmannsthal describes this scene in one of his sketches as Elektra’s "dissolution of self."

Quantifier scope in German : an MCTAG analysis (2006)

Kallmeyer, Laura ; Romero, Maribel

Relative quantifier scope in German depends, in contrast to English, very much on word order. The scope possibilities of a quantifier are determined by its surface position, its base position and the type of the quantifier. In this paper we propose a multicomponent analysis for German quantifiers computing the scope of the quantifier, in particular its minimal nuclear scope, depending on the syntactic configuration it occurs in.

Predation on meadowbirds in The Netherlands : results of a four-year study (2006)

Teunissen, Wolf ; Schekkerman, Hans ; Willems, Frank

Meadowbird populations in The Netherlands are under great pressure. Recently, predation is named increasingly often as one of the key factors in contributing to the declines. A four-year research project (2001-2005) aimed to collect (as yet mostly nonexisting) data to provide a factual basis for this discussion. A country-wide inventory based on data for wader nests found by volunteers who mark nests for their protection from grazing/mowing indicated that above-average predation losses are found predominantly in the half-open landscapes of northern and eastern Netherlands, but also locally in the low-lying open grasslands which are the key areas for meadowbirds. Nest predation has increased in recent years, but the same is true for agricultural losses, at least in areas where no nest-protection takes place. At a local scale, predation losses vary greatly from area to area and from year to year. Temperature loggers in nest showed that diurnal and nocturnal predators contribute equally in total predation losses up to 50%, but higher predation losses are mainly caused by nocturnal predators. As many as 10 animal species were identified as nest predators on nests under surveillance with video cameras. Chick survival, investigated using radiotelemetry, was very low. About 60-80% were lost by predation, 5-15% by agricultural activities and 10-15% to all kind of other losses. At least 15 predator species were implied, with an apparently larger share taken by birds (notably Buzzard (16%) and Grey Heron (7-18%)) than mammals, with one exception: stoat (16%). Of the most-discussed predator species, Carrion Crows were W. Teunissen et al. Osnabrücker Naturwiss. Mitt. 32 2006 138 remarkably rarely involved in both nest and chick predation, while Red Foxes take a large toll of clutches in some areas, but not in others. Of all losses during the reproductive cycle about 75% and 60% was due to predation in Lapwing and Black-tailed Godwit respectively. Predation on chicks by birds had the largest effect on total breeding success, but at the same time elimination of this loss factor (if at all possible) alone would not be sufficient to establish a self-sustaining population. Predation seems to have become a factor of importance in some areas, in combination with already existing other losses. Our findings suggest that solutions to predation problems probably have to be found in locally/regionally targeted, specific action on multiple fronts rather than countrywide measures.

Ontology-based Information Extraction with SOBA (2006)

Buitelaar, Paul ; Cimiano, Philipp ; Racioppa, Stefania ; Siegel, Melanie

In this paper we describe SOBA, a sub-component of the SmartWeb multi-modal dialog system. SOBA is a component for ontologybased information extraction from soccer web pages for automatic population of a knowledge base that can be used for domainspecific question answering. SOBA realizes a tight connection between the ontology, knowledge base and the information extraction component. The originality of SOBA is in the fact that it extracts information from heterogeneous sources such as tabular structures, text and image captions in a semantically integrated way. In particular, it stores extracted information in a knowledge base, and in turn uses the knowledge base to interpret and link newly extracted information with respect to already existing entities.

Open Access

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

29 search hits