Linguistik
Refine
Year of publication
Document Type
- Article (1213)
- Part of a Book (784)
- Working Paper (254)
- Review (181)
- Conference Proceeding (166)
- Preprint (122)
- Book (108)
- Part of Periodical (64)
- Report (58)
- Doctoral Thesis (23)
Language
Has Fulltext
- yes (2991) (remove)
Keywords
- Deutsch (436)
- Syntax (151)
- Linguistik (126)
- Englisch (123)
- Semantik (112)
- Spracherwerb (96)
- Phonologie (85)
- Rezension (77)
- Kroatisch (68)
- Fremdsprachenlernen (67)
Institute
- Extern (438)
- Institut für Deutsche Sprache (IDS) Mannheim (113)
- Neuere Philologien (43)
- Sprachwissenschaften (43)
- Universitätsbibliothek (4)
- Sprach- und Kulturwissenschaften (3)
- Gesellschaftswissenschaften (2)
- Medizin (2)
- Präsidium (2)
- SFB 268 (2)
We investigate methods to improve the recall in coreference resolution by also trying to resolve those definite descriptions where no earlier mention of the referent shares the same lexical head (coreferent bridging). The problem, which is notably harder than identifying coreference relations among mentions which have the same lexical head, has been tackled with several rather different approaches, and we attempt to provide a meaningful classification along with a quantitative comparison. Based on the different merits of the methods, we discuss possibilities to improve them and show how they can be effectively combined.
Using a qualitative analysis of disagreements from a referentially annotated newspaper corpus, we show that, in coreference annotation, vague referents are prone to greater disagreement. We show how potentially problematic cases can be dealt with in a way that is practical even for larger-scale annotation, considering a real-world example from newspaper text.
Tagging kausaler Relationen
(2005)
In dieser Diplomarbeit geht es um kausale Beziehungen zwischen Ereignissen und Erklärungsbeziehungen zwischen Ereignissen, bei denen kausale Relationen eine wichtige Rolle spielen. Nachdem zeitliche Relationen einerseits ihrer einfacheren Formalisierbarkeit und andererseits ihrer gut sichtbaren Rolle in der Grammatik (Tempus und Aspekt, zeitliche Konjunktionen) wegen in jüngerer Zeit stärker im Mittelpunkt des Interesses standen, soll hier argumentiert werden, dass kausale Beziehungen und die Erklärungen, die sie ermöglichen, eine wichtigere Rolle im Kohärenzgefüge des Textes spielen. Im Gegensatz zu “tiefen” Verfahren, die auf einer detaillierten semantischen Repr¨asentation des Textes aufsetzen und infolgedessen für unrestringierten Text m. E. nicht geeignet sind, wird hier untersucht, wie man dieses Ziel erreichen kann, ohne sich auf eine aufwändig konstruierte Wissensbasis verlassen zu müssen.
In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative investigation to assess which effects or problems are likely to be the most prominent. Several examples where such problems occur are discussed in more detail, and we then propose a generalisation of Poesio, Reyle and Stevenson’s Justified Sloppiness Hypothesis to provide a unified model for these cases of disagreement and argue that a deeper understanding of the phenomena involved allows to tackle problematic cases in a more principled fashion than would be possible using only pre-theoretic intuitions.
Distributional approximations to lexical semantics are very useful not only in helping the creation of lexical semantic resources (Kilgariff et al., 2004; Snow et al., 2006), but also when directly applied in tasks that can benefit from large-coverage semantic knowledge such as coreference resolution (Poesio et al., 1998; Gasperin and Vieira, 2004; Versley, 2007), word sense disambiguation (Mc- Carthy et al., 2004) or semantical role labeling (Gordon and Swanson, 2007). We present a model that is built from Webbased corpora using both shallow patterns for grammatical and semantic relations and a window-based approach, using singular value decomposition to decorrelate the feature space which is otherwise too heavily influenced by the skewed topic distribution of Web corpora.
We adopt Markert and Nissim (2005)’s approach of using the World Wide Web to resolve cases of coreferent bridging for German and discuss the strength and weaknesses of this approach. As the general approach of using surface patterns to get information on ontological relations between lexical items has only been tried on English, it is also interesting to see whether the approach works for German as well as it does for English and what differences between these languages need to be accounted for. We also present a novel approach for combining several patterns that yields an ensemble that outperforms the best-performing single patterns in terms of both precision and recall.
In the past, a divide could be seen between ’deep’ parsers on the one hand, which construct a semantic representation out of their input, but usually have significant coverage problems, and more robust parsers on the other hand, which are usually based on a (statistical) model derived from a treebank and have larger coverage, but leave the problem of semantic interpretation to the user. More recently, approaches have emerged that combine the robustness of datadriven (statistical) models with more detailed linguistic interpretation such that the output could be used for deeper semantic analysis. Cahill et al. (2002) use a PCFG-based parsing model in combination with a set of principles and heuristics to derive functional (f-)structures of Lexical-Functional Grammar (LFG). They show that the derived functional structures have a better quality than those generated by a parser based on a state-of-the-art hand-crafted LFG grammar. Advocates of Dependency Grammar usually point out that dependencies already are a semantically meaningful representation (cf. Menzel, 2003). However, parsers based on dependency grammar normally create underspecified representations with respect to certain phenomena such as coordination, apposition and control structures. In these areas they are too "shallow" to be directly used for semantic interpretation. In this paper, we adopt a similar approach to Cahill et al. (2002) using a dependency-based analysis to derive functional structure, and demonstrate the feasibility of this approach using German data. A major focus of our discussion is on the treatment of coordination and other potentially underspecified structures of the dependency data input. F-structure is one of the two core levels of syntactic representation in LFG (Bresnan, 2001). Independently of surface order, it encodes abstract syntactic functions that constitute predicate argument structure and other dependency relations such as subject, predicate, adjunct, but also further semantic information such as the semantic type of an adjunct (e.g. directional). Normally f-structure is captured as a recursive attribute value matrix, which is isomorphic to a directed graph representation. Figure 5 depicts an example target f-structure. As mentioned earlier, these deeper-level dependency relations can be used to construct logical forms as in the approaches of van Genabith and Crouch (1996), who construct underspecified discourse representations (UDRSs), and Spreyer and Frank (2005), who have robust minimal recursion semantics (RMRS) as their target representation. We therefore think that f-structures are a suitable target representation for automatic syntactic analysis in a larger pipeline of mapping text to interpretation. In this paper, we report on the conversion from dependency structures to fstructure. Firstly, we evaluate the f-structure conversion in isolation, starting from hand-corrected dependencies based on the TüBa-D/Z treebank and Versley (2005)´s conversion. Secondly, we start from tokenized text to evaluate the combined process of automatic parsing (using Foth and Menzel (2006)´s parser) and f-structure conversion. As a test set, we randomly selected 100 sentences from TüBa-D/Z which we annotated using a scheme very close to that of the TiGer Dependency Bank (Forst et al., 2004). In the next section, we sketch dependency analysis, the underlying theory of our input representations, and introduce four different representations of coordination. We also describe Weighted Constraint Dependency Grammar (WCDG), the dependency parsing formalism that we use in our experiments. Section 3 characterises the conversion of dependencies to f-structures. Our evaluation is presented in section 4, and finally, section 5 summarises our results and gives an overview of problems remaining to be solved.
The unusual development of the PDE [present-day English] s-genitive can be historically motivated, if the 's form is supposed to be not a mere leftover of the Old English (henceforth OE) casemarking, but the outcome of the merging of two patterns: the inflectional genitive ending (levelled to -s) and the construction "John his book" (henceforth 'possessive-linked genitive') during the Middle and the Early Modem English phases.
As my corpus analysis will show, the semantic and syntactic constraints ruling the occurrence of the 's pattern in the time interval of the rise of the 's-pattern (1400 - 1650) are the same ones as those ruling the occurrence of the possessive-linked genitive.
This hypothesis is further confirmed by cross-language comparison (with the other West Germanic languages, especially Afrikaans).
Sob uma perspectiva crítica, são abordadas neste artigo, imagens sobre a língua alemã e seus falantes, apresentadas por estudantes universitários brasileiros, interessados em aprender esse idioma ou engajados em estágio inicial de sua aprendizagem. O propósito é discutir evidências de imagens estereotipadas, bem como refletir sobre possíveis decorrências dessas imagens no/para o processo de ensino-aprendizagem da língua, buscando, com isso, apontar a importância de se considerar como objetivo relevante no ensino de alemão, a necessidade de um percurso de "desestranhamento" do idioma, por meio de enfoque metodológico orientado para auxiliar os aprendizes no processo de desenvolvimento de competência intercultural. Tais reflexões têm como base pressupostos teóricos como as noções de "outro" e de "próprio", as concepções de competência intercultural e de ensino intercultural, e resultados obtidos em pesquisa desenvolvida no campo de ensino e aprendizagem de língua estrangeira (alemão).
Rezension zu Lorenz Hofer, Sprachwandel im städtischen Dialektrepertoire. Eine variationslinguistische Untersuchung am Beispiel des Baseldeutschen. Tübingen: A. Francke Verlag 1997 (Basler Studien zur deutschen Sprache und Literatur 72, xiv + 306 S., 68,00 DM, ISBN 3-7720-2671-0)
U ovome su radu obradena 232 obiteljska nadimka u Puciscima na otoku Bracu. Obiteljski su nadimci, kao dodatan vid identifikacije koji se razvio još u pretprezimenskome razdoblju, a kasnije je sve zastupljeniji zbog brojnosti nositelja pojedinih prezimena, svojevrsni specifikum hrvatskih otoka koji dosad nije dostatno proucen. U Puciscima se obiteljski nadimci bilježe od konca 16. st. te se na temelju njihove motivacije može djelomicno rekonstruirati fond osobnih imena (odnos hrvatskih narodnih imena te hrvatskih i novijih romanskih prilagodenica kršcanskih imena), vanjština (posebice tjelesne mane), karakterne crte (uglavnom nekonvencionalne) te podrijetlo i svakodnevni život Puciscana. Fond je obiteljskih nadimaka znatno otvoreniji inojezicnim sustavima (poglavito romanskim) te je odraz svojevrsne tisucljetne hrvatsko-romanske simbioze na istocnoj obali Jadranskoga mora.
U ovome se radu nastoji dati pregled mnogobrojnih i raznolikih odraza svetačkog imena Ivan u hrvatskome antroponimijskom fondu s osobitim naglaskom na područje južne Dalmacije (uključujući Boku kotorsku) i Donje Hercegovine. U uvodnome se dijelu rada donose odrazi hebrejskoga muškog osobnog imena Jehochánán u raznim indoeuropskim i neindoeuropskim jezicima, potom se tumači postanje hrvatskoga svetačkog imena Ivan i njegovi odrazi u hrvatskome antroponimijskom fondu s posebnim naglaskom na sličnosti i razlike s antroponimijskim fondovima bliskih južnoslavenskih jezika.
U ovome se radu pokušava dati pregled mnogobrojnih i raznolikih odraza svetačkog imena Juraj u hrvatskome antroponimijskom sustavu s osobitim naglaskom na područje Zažablja (prostora između rječice Misline, istočno od Metkovića, i zapadnih granica nekadašnje Dubrovačke Republike, a danas općine Dubrovačko primorje, te prostora od Hrasna na sjeveru do Neuma na jugu) i Popova (jugozapadne Hercegovine). Na temelju odabrane literature i autorova terenskog istraživanja nastoje se iznijeti i neke izvanjezične (poglavito povijesne i sociolingvističke) činjenice koje su uzrok takvu stanju.
U radu se, na temelju terenskoga i arhivskoga istraživanja, obrađuje oko 300 toponima sela Orahovi Do u južnome dijelu Popova. U prvome se dijelu rada iznosi osvrt na demografske prilike. Navedeno je područje bilo izloženo velikim migracijama stanovništva zbog kojih ondje danas živi dvadeset puta manji broj stanovnika nego koncem 15. st. U drugome se dijelu rada iznosi mjestopis Orahova Dola i podatci o mjesnim rodovima. U trećemu se dijelu obrađuje mjesna toponimija u kojoj prevladavaju antroponimni toponimi.
U ovome se radu na temelju terenskog istraživanja obrađuje toponimija danas gotovo posve napuštenoga sela Dubljani u Popovu u istočnoj Hercegovini. U mjesnoj su toponimiji najzastupljeniji toponimi antroponimnoga postanja s pomoću kojih se upoznajemo s negdašnjim i današnjim imovinsko-pravnim ustrojem srednjovjekovnog Huma, toponim Satùlija (‘Sanctus Elias’) spomen je na davne romansko-hrvatske dodire, a na primjeru toponima Sačìvišće upoznajemo se s veoma složenom dijalektnom slikom istočne Hercegovine.
U ovome se radu obrađuju naglasne značajke u govorima i toponimiji Zažablja i Popova. U prvome se dijelu rada iznose bitne fonološke, morfološke, sintaktičke i leksičke značajke obrađenoga područja te ga se uspoređuje s drugim štokavskim govorima. U središnjemu se dijelu rada obrađuju mjesne naglasne osobitosti, primjerice naglašivanje starih i suvremenih posuđenica, razlikovna uloga naglaska te odrazi praslavenskih naglasnih paradigma. U završnome se dijelu iznose i neke naglasne razlike u mjesnim govorima, poglavito s obzirom na narodnosnu pripadnost.
U radu se ponajprije na temelju sklonidbe jednosložnih imenica o-osnova iznose temeljne razlike u uspostavi naglasnih tipova u Klaićevim djelima Rječnik stranih riječi i Naglasni sustav standardnoga hrvatskog jezika te Školskoga rječnika hrvatskoga jezika Instituta za hrvatski jezik i jezikoslovlje i Školske knjige. Zajednička je navedenim djelima dosljedna provedba novoštokavskih naglasnih pravila te nastojanje za usustavljivanjem naglašivanja. Razlike se odnose na odabir naglasnih tipova koji su u Klaićevim djelima utemeljeni na starijim jezičnim priručnicima i dijalektološkoj građi, a u Školskome su rječniku zasnovani na odrazima praslavenskih naglasnih paradigma. Iz rada je razvidno da bi Klaićev doprinos usustavljivanju naglašivanja u hrvatskome standardnom jeziku bio znatno veći da je njegov naglasni priručnik bio otisnut kad je nastao. U radu se navode i razlike u naglascima u suvremenim hrvatskim rječnicima koje su djelomično posljedicom nedostatka pravogovornih priručnika te selektivnoga odstupanja od novoštokavskih naglasnih pravila.
The material reported on in this paper is part of a set of experiments in which the role of Information Structure on L2 processing of words is tested. Pitch and duration of 4 sets of experimental material in German and English are measured and analyzed in this paper. The well-known finding that accent boosts duration and pitch is confirmed. Syntactic and lexical means of marking focus, however, do not give the duration and the pitch of a word an extra boost.
Meine Untersuchung behandelt das Problem der Kennzeichnung notivischer Bestimmtheit/Unbestimmtheit aus der Perspektive der Wortstellung in Sätzen mit Objekt, also in sogenannten transitiven Sätzen. Relativsätze und Sätze, in denen das Verb diskontinuierlich ist, wurden dabei nicht berücksichtigt, weil die Wortstellung hier von anderen Faktoren abhängt. Die Möglichkeit der grammatischen Realisierung des Ausdrucks von notivischer Bestimmtheit/Unbestimmtheit […] wird dabei mit berücksichtigt.