Refine
Year of publication
Document Type
- Article (29)
- Part of a Book (4)
- Preprint (4)
- Conference Proceeding (2)
- Book (1)
- Doctoral Thesis (1)
- Report (1)
Language
- English (20)
- German (18)
- Portuguese (2)
- Turkish (2)
Has Fulltext
- yes (42)
Is part of the Bibliography
- no (42) (remove)
Keywords
- German (42) (remove)
Institute
- Neuere Philologien (6)
- Extern (4)
- Sprachwissenschaften (2)
- Gesellschaftswissenschaften (1)
- Medizin (1)
- Sprach- und Kulturwissenschaften (1)
By focusing human factors by the phraseological nomination, it becomes possible to expose obvious cases as reflections of everyday collective observations, experiences and evaluations considering a certain behavior or action. The subject of this investigation is differently molded phraseological units that permit to be listed under the hypernym ‘THE END OF LIFE BY HUMAN BEINGS’. The execution follows up the role of the linguistic image by the constitution of a slice of reality and gives representative examples of the metaphorization of the concept ‚DECEASE‘ in German, Romanian and Swedish. Productive source domains for the conceptualization of this notion will be considered; this due to the insight that conceptual spheres give keys to thought models, values and ideals anchored in the language.
The article presents a chronicle of the town of Kaaden (Kadaň) dating from the 16th century, currently held in Prague's Monastery of Our Lady of the Snows. It explores several aspects of Humanistic urban history writing, including the presence of the author in the text of the chronicle, the methodology of the author's historiographic work, and his choice and use of language (German, Latin). The study also presents this chronicle as an interesting and important source of information on writing practices in north-west Bohemia from a text-analytical perspective.
Manual development of deep linguistic resources is time-consuming and costly and therefore often described as a bottleneck for traditional rule-based NLP. In my PhD thesis I present a treebank-based method for the automatic acquisition of LFG resources for German. The method automatically creates deep and rich linguistic presentations from labelled data (treebanks) and can be applied to large data sets. My research is based on and substantially extends previous work on automatically acquiring wide-coverage, deep, constraint-based grammatical resources from the English Penn-II treebank (Cahill et al.,2002; Burke et al., 2004; Cahill, 2004). Best results for English show a dependency f-score of 82.73% (Cahill et al., 2008) against the PARC 700 dependency bank, outperforming the best hand-crafted grammar of Kaplan et al. (2004). Preliminary work has been carried out to test the approach on languages other than English, providing proof of concept for the applicability of the method (Cahill et al., 2003; Cahill, 2004; Cahill et al., 2005). While first results have been promising, a number of important research questions have been raised. The original approach presented first in Cahill et al. (2002) is strongly tailored to English and the datastructures provided by the Penn-II treebank (Marcus et al., 1993). English is configurational and rather poor in inflectional forms. German, by contrast, features semi-free word order and a much richer morphology. Furthermore, treebanks for German differ considerably from the Penn-II treebank as regards data structures and encoding schemes underlying the grammar acquisition task. In my thesis I examine the impact of language-specific properties of German as well as linguistically motivated treebank design decisions on PCFG parsing and LFG grammar acquisition. I present experiments investigating the influence of treebank design on PCFG parsing and show which type of representations are useful for the PCFG and LFG grammar acquisition tasks. Furthermore, I present a novel approach to cross-treebank comparison, measuring the effect of controlled error insertion on treebank trees and parser output from different treebanks. I complement the cross-treebank comparison by providing a human evaluation using TePaCoC, a new testsuite for testing parser performance on complex grammatical constructions. Manual evaluation on TePaCoC data provides new insights on the impact of flat vs. hierarchical annotation schemes on data-driven parsing. I present treebank-based LFG acquisition methodologies for two German treebanks. An extensive evaluation along different dimensions complements the investigation and provides valuable insights for the future development of treebanks.
Tree-local MCTAG with shared nodes : an analysis of word order variation in German and Korean
(2004)
Tree Adjoining Grammars (TAG) are known not to be powerful enough to deal with scrambling in free word order languages. The TAG-variants proposed so far in order to account for scrambling are not entirely satisfying. Therefore, an alternative extension of TAG is introduced based on the notion of node sharing. Considering data from German and Korean, it is shown that this TAG-extension can adequately analyse scrambling data, also in combination with extraposition and topicalization.
The ACL 2008 Workshop on Parsing German features a shared task on parsing German. The goal of the shared task was to find reasons for the radically different behavior of parsers on the different treebanks and between constituent and dependency representations. In this paper, we describe the task and the data sets. In addition, we provide an overview of the test results and a first analysis.
This paper presents three acceptability experiments investigating German verb-final clauses in order to explore possible sources of sentence complexity during human parsing. The point of departure was De Vries et al.'s (2011) generalization that sentences with three or more crossed or nested dependencies are too complex for being processed by the human parsing mechanism without difficulties. This generalization is partially based on findings from Bach et al. (1986) concerning the acceptability of complex verb clusters in German and Dutch. The first experiment tests this generalization by comparing two sentence types: (i) sentences with three nested dependencies within a single clause that contains three verbs in a complex verb cluster; (ii) sentences with four nested dependencies distributed across two embedded clauses, one center-embedded within the other, each containing a two-verb cluster. The results show that sentences with four nested dependencies are judged as acceptable as control sentences with only two nested dependencies, whereas sentences with three nested dependencies are judged as only marginally acceptable. This argues against De Vries et al.'s (2011) claim that the human parser can process no more than two nested dependencies. The results are used to refine the Verb-Cluster Complexity Hypothesis of Bader and Schmid (2009a). The second and the third experiment investigate sentences with four nested dependencies in more detail in order to explore alternative sources of sentence complexity: the number of predicted heads to be held in working memory (storage cost in terms of the Dependency Locality Theory [DLT], Gibson, 2000) and the length of the involved dependencies (integration cost in terms of the DLT). Experiment 2 investigates sentences for which storage cost and integration cost make conflicting predictions. The results show that storage cost outweighs integration cost. Experiment 3 shows that increasing integration cost in sentences with two degrees of center embedding leads to decreased acceptability. Taken together, the results argue in favor of a multifactorial account of the limitations on center embedding in natural languages.
The Intensifying Function of Modal Particles and Modal Elements in a Cross-Linguistic Perspective
(2015)
The aim of this paper is to analyze the intensifying function of German modal particles and equivalent modal expressions in Croatian and English. Our hypothesis is that some modal particles in German and their functional equivalents in Croatian and English can express different degrees of intensity and types of intensification. The presented study comprises two parts. First, the use of intensifying modal particles by a group of speakers of L1 Croatian and L2 German/English is investigated. On the basis of the results obtained, and by means of a previously conducted corpus analysis (cf. Kresić and Batinić 2014), an intensification scale with respect to the inventory of German modal particles and corresponding particles in Croatian as well as equivalent English expressions is suggested. Some German and Croatian modal particles and equivalent modal elements in English can be classified on the upper and partially on the lower part of the proposed intensification scale when compared to the norm, i.e. an utterance unmarked by a modal particle.
Existing analyses of German scrambling phenomena within TAG-related formalisms all use non-local variants of TAG. However, there are good reasons to prefer local grammars, in particular with respect to the use of the derivation structure for semantics. Therefore this paper proposes to use local TDGs, a TAG-variant generating tree descriptions that shows a local derivation structure. However the construction of minimal trees for the derived tree descriptions is not subject to any locality constraint. This provides just the amount of non-locality needed for an adequate analysis of scrambling. To illustrate this a local TDG for some German scrambling data is presented.
In this paper I argue in favor of a Matching Analysis for German relative clauses. The Head Raising Analysis is shown to fail to account for parts of the reconstruction pattern in German, especially cases where only the external head is interpreted and the absence of Principle C effects. I propose a Matching Analysis with Vehicle Change and make consistent assumptions about possible deletion operations in relatives so that the entire pattern can be captured by one analysis which therefore proves superior to previous ones.
Im vorliegenden Beitrag werden die Ergebnisse der kontrastiven Forschung von phraseologischen Neologismen im Deutschen und Ukrainischen gezeigt. Das Ziel dieses Beitrags besteht darin, die phraseologischen Neologismen, die in der Corona-Pandemie in Deutschland und in der Ukraine in digitalen Medien verwendet wurden, zu analysieren, die strukturell-semantischen Merkmale ausgewählter phraseologischer Einheiten in beiden Sprachen zu beschreiben, ihre konzeptuelle Analyse durchzuführen, die vorhandenen Neologismenpaare einander gegenüberzustellen sowie Äquivalenztypen zu untersuchen. Die Materialbasis bilden diesbezüglich 96 deutsche und 60 ukrainische phraseologische Neologismen der Jahre 2020–2021.