Refine
Year of publication
Document Type
- Part of a Book (61)
- Preprint (17)
- Article (8)
- Conference Proceeding (5)
- Working Paper (4)
- Book (1)
- Report (1)
- Review (1)
Language
- English (98) (remove)
Has Fulltext
- yes (98)
Is part of the Bibliography
- no (98)
Keywords
Institute
In this paper I investigate the usage of the adverb and particle 'so' in spontaneous speech (interviews) collected from 21 speakers of the urban multi-ethnolectal youth language Kiezdeutsch. Speakers from the neighborhoods Kreuzberg and Wedding in Berlin are ranging in age from 14 to 18. The 1454 tokens of so available in the corpus (about 5 hours of speech) were classified into 10 different categories; some were structurally defined while others were defined along dimensions of meaning. Our current results indicate that there are differential usages patterns depending on the speaker's gender and age for some of these categories. Further, it appears that some patterns that have been attributed grammatical meaning may not appear frequently enough to establish a separate meaningful grammatical category. Rather, most instances of this kind of use of so appear to have a hedging function, indicating speakers' non-commitance to a specific circumstance.
This study examines articulatory and acoustic inter-speaker variability in the production of the German vowels /i/, /u/ and /a/. Our subjects are 3 monozygotic twin pairs (2 female and 1 male pair) and 2 dizygotic female twin pairs. All of them were born, raised and are still living in Berlin and see their twin brother or sister regularly. We assume that monozygotic twins that are genetically identical and share the same physiology should be more similar in their articulation than dizygotic twins but that the shared time and social environment of twins, regardless of their genetic similarity, also plays a crucial role in the acoustic similarity of twins. Articulatory measurements were made with EMA (Electromagnetic Articulography) and the target positions of the produced vowels were analyzed. Additionally, the formants F1-F4 of each vowel were measured and compared within the twin pairs. Our data seems to point out the importance of a shared environment and the strong influence of learning over the anatomical identity of the monozygotic twins regarding the production of vowels. But, additional results suggest (1) the impact of physiology on the production of a vowel following a velar consonant and (2) the interaction of physiology and stress in inter-speaker variability.
This paper examines the applicability of the combination of data types in a study of German idioms of life with the tools of cognitive metaphor theory. The data sources for conceptual metaphors were mainly metaphors found in the relevant literature. These metaphors are of introspective nature to a great extent. The primary data sources for metaphorical expressions were dictionaries that represent introspective data, too. These data have been complemented by corpus data. The paper discusses the problems of introspective and corpus data raised by the study of German idioms of life. Two case studies demonstrate the advantages of the combination of data and methods.
Parsing coordinations
(2009)
The present paper is concerned with statistical parsing of constituent structures in German. The paper presents four experiments that aim at improving parsing performance of coordinate structure: 1) reranking the n-best parses of a PCFG parser, 2) enriching the input to a PCFG parser by gold scopes for any conjunct, 3) reranking the parser output for all possible scopes for conjuncts that are permissible with regard to clause structure. Experiment 4 reranks a combination of parses from experiments 1 and 3. The experiments presented show that n- best parsing combined with reranking improves results by a large margin. Providing the parser with different scope possibilities and reranking the resulting parses results in an increase in F-score from 69.76 for the baseline to 74.69. While the F-score is similar to the one of the first experiment (n-best parsing and reranking), the first experiment results in higher recall (75.48% vs. 73.69%) and the third one in higher precision (75.43% vs. 73.26%). Combining the two methods results in the best result with an F-score of 76.69.
The paper investigates the origins of the German/Dutch particle toch/doch) in the hope of shedding light on a puzzle with respect to doch/toch and to shed some light on two theoretical issues. The puzzle is the nearly opposite meaning of the stressed and unstressed versions of the particle which cannot be accounted for in standard theories of the meaning of stress. One theoretical issue concerns the meaning of stress: whether it is possible to reduce the semantic contribution of a stressed item to the meaning of the item and the meaning of stress. The second issue is whether the complex use of a particle like doch/toch can be seen as an instance of spread or whether it has to be seen as having a core meaning which is differentiated by pragmatics operating in different contexts.
We use the etymology of doch and doch as to+u+h (that+ question marker+ emphatic marker) to argue for an origin as a question tag checking a hearer opinion. Stress on the tag indicates an opposite opinion (of the common ground or the speaker) and this sets apart two groups of uses spreading in different directions. This solves the puzzle, indicates that the assumption of spread is useful and offers a subtle correction of the interpretation of stress. While stress always means contrast with a contrasting item, if the particle use is due to spread, it is not guaranteed that the unstressed particle has a corresponding use (or inversely).
This paper investigates the relation between TT-MCTAG, a formalism used in computational linguistics, and RCG. RCGs are known to describe exactly the class PTIME; simple RCG even have been shown to be equivalent to linear context-free rewriting systems, i.e., to be mildly context-sensitive. TT-MCTAG has been proposed to model free word order languages. In general, it is NP-complete. In this paper, we will put an additional limitation on the derivations licensed in TT-MCTAG. We show that TT-MCTAG with this additional limitation can be transformed into equivalent simple RCGs. This result is interesting for theoretical reasons (since it shows that TT-MCTAG in this limited form is mildly context-sensitive) and, furthermore, even for practical reasons: We use the proposed transformation from TT-MCTAG to RCG in an actual parser that we have implemented.
The ACL 2008 Workshop on Parsing German features a shared task on parsing German. The goal of the shared task was to find reasons for the radically different behavior of parsers on the different treebanks and between constituent and dependency representations. In this paper, we describe the task and the data sets. In addition, we provide an overview of the test results and a first analysis.
Class features as probes
(2008)
In this article, we adress (i) the form and (ii) the function on inflection class features in minimalist grammar. The empirical evidence comes from noun inflection systems involving fusional markers in German, Greek, and Russian. As for (i), we argue (based on instances of transparadigmatic syncretism) that class features are not privative; rather, class information must be decomposed into more abstract, binary features. Concerning (ii), we propose that class features qualify as the very device that brings about fusional infection: They are uninterpretable in syntax and actas probes on stems, with matching inflection markers as goels, and thus trigger morphological Agree operations that merge stem and inflection marker before syntax is reached.
We present the results of an experimental study which targets prosodic correlates of subclausal quotation marks. We found that written sentences containing passages enclosed by quotation marks were read aloud in a manner that significantly differs in prosody from spoken realizations of corresponding disquoted counterparts. However, we also observed that such prosodic marking of subclausal quotation wasn't strong enough to survive subsequent back-translation into written language: there was no correlation between the presence/absence of quotation marks in the original written examples, and the presence/absence of quotation marks in corresponding back-translations from oral renditions. We investigated three different kinds of uses of quotation marks and found no systematic difference between them with respect to prosodic marking.