Linguistik
Refine
Year of publication
Document Type
- Article (222)
- Part of a Book (89)
- Review (55)
- Preprint (19)
- Conference Proceeding (16)
- Working Paper (15)
- Book (8)
- Part of Periodical (8)
- Report (4)
- Periodical (1)
Language
- German (290)
- English (86)
- Portuguese (42)
- Turkish (14)
- Croatian (2)
- Multiple languages (2)
- Spanish (1)
Keywords
- Deutsch (437) (remove)
Institute
The status of quantifier raising in German and other languages where scope is fairly rigid is debated. The first part of this paper argues that quantifiers in German can undergo covert extraction out of coordinations, and therefore that quantifier raising is available in German. The second part argues that quantifier raising in German is constrained to never move one DP across another. This result might provide part of an explanation of scope rigidity in German.
Ziel des Teilprojekts ist die thematische Erschließung der Korpora, um sowohl themenspezifische virtuelle Subkorpora zusammenstellen zu können als auch aufgrund der Analyse sachgebietsbezogener Häufigkeitsverteilungen z.B. Lesarten disambiguieren zu können. Ausgangspunkt ist die Erstellung einer Taxonomie von Sachgebietsthemen. Dies erfolgt in einem semiautomatischen Verfahren, welches die Anwendung von Textmining (Dokumentclustering) und die manuelle Zuordnung von Clustern in eine externen Ontologie beinhaltet. Es wird argumentiert, dass die so gewonnene Taxonomie sowohl intuitiver als auch objektiver ist als bestehende, rein manuelle Ansätze. Sie eignet sich zudem gleichermaßen für manuelle als auch für maschinelle Klassifikation. Für letzteres wird der Naive Bayes'sche Textklassifikator motiviert und für ein klassifiziertes Korpus von knapp zwei Milliarden Wörtern evaluiert.
Mit Erstaunen stellen LinguistInnen aus Deutschland, Österreich und der Schweiz immer wieder fest, dass sich in der "kleinen" Schweiz der geschlechtergerechte Sprachgebrauch in Öffentlichkeit und Alltag weit stärker durchgesetzt hat als in den anderen deutschsprachigen Ländern. Diese Einschätzung gilt es hier zu überprüfen und, falls sie zutrifft, zu belegen. Ausserdem werden - als erster Schritt fur weitere Untersuchungen - Thesen formuliert, die Erklärungen liefern, worauf diese Entwicklung zurückgeführt werden kann. Mit diesem Artikel geben wir anband von ausgewählten, konkreten Beispielen einen Einblick in die Situation, wie sie sich zur Zeit in der Schweiz präsentiert. Wir konzentrieren uns - unter sprachsoziologischer Perspektive - auf eine erste Bestandesaufnahme mit dem Blick auf die Diskussion in den Medien, die Institutionalisierung und die Einstellungen, die die spezifische sprachliche Situation in der Deutschschweiz prägen. Einen Rahmen fur unsere Untersuchung bilden die Überlegungen von Schräpel (SCHRÄPEL 1986), die die Auseinandersetzung um nichtsexistische Sprache als ein besonderes Sprachwandelphänomen untersucht. Sprachwandel im Vollzug ist einerseits einfacher zu erfassen als einer, der weiter zurückliegt, andererseits erschwert die Fülle des greifbaren Materials auch den Durchblick und das klare Erkennen von Tendenzen. Aus diesem Grund werten wir unser Datenmaterial nicht quantitativ aus, sondern konzentrieren uns darauf, für verschiedene Aspekte typische Beispiele zu geben und so den Stand der öffentlichen Diskussion und die Breite der vertretenen Meinungen darzustellen. Es wäre verlockend, das hier vorliegende Material auch allgemeinerer Form unter der Thematik "Sprachkritik" oder "Einstellungen" zu analysieren. Dies ist jedoch nicht im Zentrum unserer Fragestellung, weshalb wir bei einigen Beispielen auf entsprechende Untersuchungen (z.B. BLAUBERGS 1980, SCHOENTHAL 1989) verweisen.
This paper provides an analysis of an alternative strategy to A´-movement in both German and Dutch where the extracted constituent is preceded by a preposition and a coreferential pronoun appears in the extraction site. The construction has properties of both binding and movement: Whereas reconstruction effects suggest movement out of the embedded clause, there is strong evidence that the operator constituent is linked to an A-position in the matrix clause; this paradox is resolved by assuming a Control-like approach that involves movement from the embedded clause into a theta-position in the matrix clause with subsequent short A´- movement. The coreferential pronoun is interpreted as a resumptive heading a Big-DP which hosts the antecedent in its specifier.
In this paper I argue in favor of a Matching Analysis for German relative clauses. The Head Raising Analysis is shown to fail to account for parts of the reconstruction pattern in German, especially cases where only the external head is interpreted and the absence of Principle C effects. I propose a Matching Analysis with Vehicle Change and make consistent assumptions about possible deletion operations in relatives so that the entire pattern can be captured by one analysis which therefore proves superior to previous ones.
Deutsche Rundfunksprache in mehrsprachiger Umwelt : am Beispiel der Verwendung von Phraseologismen
(1995)
The ACL 2008 Workshop on Parsing German features a shared task on parsing German. The goal of the shared task was to find reasons for the radically different behavior of parsers on the different treebanks and between constituent and dependency representations. In this paper, we describe the task and the data sets. In addition, we provide an overview of the test results and a first analysis.
Chunk parsing has focused on the recognition of partial constituent structures at the level of individual chunks. Little attention has been paid to the question of how such partial analyses can be combined into larger structures for complete utterances. Such larger structures are not only desirable for a deeper syntactic analysis. They also constitute a necessary prerequisite for assigning function-argument structure. The present paper offers a similaritybased algorithm for assigning functional labels such as subject, object, head, complement, etc. to complete syntactic structures on the basis of prechunked input. The evaluation of the algorithm has concentrated on measuring the quality of functional labels. It was performed on a German and an English treebank using two different annotation schemes at the level of function argument structure. The results of 89.73% correct functional labels for German and 90.40%for English validate the general approach.
This paper reports on the SYN-RA (SYNtax-based Reference Annotation) project, an on-going project of annotating German newspaper texts with referential relations. The project has developed an inventory of anaphoric and coreference relations for German in the context of a unified, XML-based annotation scheme for combining morphological, syntactic, semantic, and anaphoric information. The paper discusses how this unified annotation scheme relates to other formats currently discussed in the literature, in particular the annotation graph model of Bird and Liberman (2001) and the pie-in-thesky scheme for semantic annotation.
This paper provides an overview of current research on a hybrid and robust parsing architecture for the morphological, syntactic and semantic annotation of German text corpora. The novel contribution of this research lies not in the individual parsing modules, each of which relies on state-of-the-art algorithms and techniques. Rather what is new about the present approach is the combination of these modules into a single architecture. This combination provides a means to significantly optimize the performance of each component, resulting in an increased accuracy of annotation.
Tree-local MCTAG with shared nodes : an analysis of word order variation in German and Korean
(2004)
Tree Adjoining Grammars (TAG) are known not to be powerful enough to deal with scrambling in free word order languages. The TAG-variants proposed so far in order to account for scrambling are not entirely satisfying. Therefore, an alternative extension of TAG is introduced based on the notion of node sharing. Considering data from German and Korean, it is shown that this TAG-extension can adequately analyse scrambling data, also in combination with extraposition and topicalization.
In this paper, we present an open-source parsing environment (Tübingen Linguistic Parsing Architecture, TuLiPA) which uses Range Concatenation Grammar (RCG) as a pivot formalism, thus opening the way to the parsing of several mildly context-sensitive formalisms. This environment currently supports tree-based grammars (namely Tree-Adjoining Grammars (TAG) and Multi-Component Tree-Adjoining Grammars with Tree Tuples (TT-MCTAG)) and allows computation not only of syntactic structures, but also of the corresponding semantic representations. It is used for the development of a tree-based grammar for German.
Existing analyses of German scrambling phenomena within TAG-related formalisms all use non-local variants of TAG. However, there are good reasons to prefer local grammars, in particular with respect to the use of the derivation structure for semantics. Therefore this paper proposes to use local TDGs, a TAG-variant generating tree descriptions that shows a local derivation structure. However the construction of minimal trees for the derived tree descriptions is not subject to any locality constraint. This provides just the amount of non-locality needed for an adequate analysis of scrambling. To illustrate this a local TDG for some German scrambling data is presented.
Relative quantifier scope in German depends, in contrast to English, very much on word order. The scope possibilities of a quantifier are determined by its surface position, its base position and the type of the quantifier. In this paper we propose a multicomponent analysis for German quantifiers computing the scope of the quantifier, in particular its minimal nuclear scope, depending on the syntactic configuration it occurs in.
This paper investigates the relation between TT-MCTAG, a formalism used in computational linguistics, and RCG. RCGs are known to describe exactly the class PTIME; simple RCG even have been shown to be equivalent to linear context-free rewriting systems, i.e., to be mildly context-sensitive. TT-MCTAG has been proposed to model free word order languages. In general, it is NP-complete. In this paper, we will put an additional limitation on the derivations licensed in TT-MCTAG. We show that TT-MCTAG with this additional limitation can be transformed into equivalent simple RCGs. This result is interesting for theoretical reasons (since it shows that TT-MCTAG in this limited form is mildly context-sensitive) and, furthermore, even for practical reasons: We use the proposed transformation from TT-MCTAG to RCG in an actual parser that we have implemented.
TT-MCTAG lets one abstract away from the relative order of co-complements in the final derived tree, which is more appropriate than classic TAG when dealing with flexible word order in German. In this paper, we present the analyses for sentential complements, i.e., wh-extraction, thatcomplementation and bridging, and we work out the crucial differences between these and respective accounts in XTAG (for English) and V-TAG (for German).
Developing linguistic resources, in particular grammars, is known to be a complex task in itself, because of (amongst others) redundancy and consistency issues. Furthermore some languages can reveal themselves hard to describe because of specific characteristics, e.g. the free word order in German. In this context, we present (i) a framework allowing to describe tree-based grammars, and (ii) an actual fragment of a core multicomponent tree-adjoining grammar with tree tuples (TT-MCTAG) for German developed using this framework. This framework combines a metagrammar compiler and a parser based on range concatenation grammar (RCG) to respectively check the consistency and the correction of the grammar. The German grammar being developed within this framework already deals with a wide range of scrambling and extraction phenomena.
This paper examines the development of periphrastic constructions involving auxiliary "have" and "be" with a past participle in the history of English, on the basis of parsed electronic corpora. It is argued that the two constructions represented distinct syntactic and semantic structures: while the one with have developed into a true perfect in the course of Middle English, the one with be remained a stative resultative throughout its history. In this way, it is explained why the be construction was rarely or never used in a number of contexts, including past counterfactuals, iteratives, duratives, certain kinds of infinitives and various other utterance types that cannot be characterized as perfects of result. When the construction with have became a true perfect, it was used in such contexts, regardless of the identity of the main verb, leading to the appearance of have with verbs like come which had previously only taken be. Crucially, however, have was not spreading at the expense of be, as the be perfect had never been used in such contexts, but rather at the expense of the old simple past. At least until the end of the Early Modern English period, the shift in the relative frequency of have and be perfects is to be explained in terms of the expansion of the former into new contexts, while the latter remained stable. A formal analysis is proposed, taking as its starting point a comparison with German which shows that the older English be perfect indeed behaves more like the German stative passive than its haben and sein perfects.