OPUS 4 | Linguistik

Antecedent selection techniques for high-recall roreference resolution (2007)

We investigate methods to improve the recall in coreference resolution by also trying to resolve those definite descriptions where no earlier mention of the referent shares the same lexical head (coreferent bridging). The problem, which is notably harder than identifying coreference relations among mentions which have the same lexical head, has been tackled with several rather different approaches, and we attempt to provide a meaningful classification along with a quantitative comparison. Based on the different merits of the methods, we discuss possibilities to improve them and show how they can be effectively combined.

Disagreement dissected : vagueness as a source of ambiguity in nominal (co-)reference (2006)

Versley, Yannick

Using a qualitative analysis of disagreements from a referentially annotated newspaper corpus, we show that, in coreference annotation, vague referents are prone to greater disagreement. We show how potentially problematic cases can be dealt with in a way that is practical even for larger-scale annotation, considering a real-world example from newspaper text.

Tagging kausaler Relationen (2005)

Versley, Yannick

In dieser Diplomarbeit geht es um kausale Beziehungen zwischen Ereignissen und Erklärungsbeziehungen zwischen Ereignissen, bei denen kausale Relationen eine wichtige Rolle spielen. Nachdem zeitliche Relationen einerseits ihrer einfacheren Formalisierbarkeit und andererseits ihrer gut sichtbaren Rolle in der Grammatik (Tempus und Aspekt, zeitliche Konjunktionen) wegen in jüngerer Zeit stärker im Mittelpunkt des Interesses standen, soll hier argumentiert werden, dass kausale Beziehungen und die Erklärungen, die sie ermöglichen, eine wichtigere Rolle im Kohärenzgefüge des Textes spielen. Im Gegensatz zu “tiefen” Verfahren, die auf einer detaillierten semantischen Repr¨asentation des Textes aufsetzen und infolgedessen für unrestringierten Text m. E. nicht geeignet sind, wird hier untersucht, wie man dieses Ziel erreichen kann, ohne sich auf eine aufwändig konstruierte Wissensbasis verlassen zu müssen.

Vagueness and referential ambiguity in a large-scale annotated corpus (2009)

Versley, Yannick

In this paper, we argue that difficulties in the definition of coreference itself contribute to lower inter-annotator agreement in certain cases. Data from a large referentially annotated corpus serves to corroborate this point, using a quantitative investigation to assess which effects or problems are likely to be the most prominent. Several examples where such problems occur are discussed in more detail, and we then propose a generalisation of Poesio, Reyle and Stevenson’s Justified Sloppiness Hypothesis to provide a unified model for these cases of disagreement and argue that a deeper understanding of the phenomena involved allows to tackle problematic cases in a more principled fashion than would be possible using only pre-theoretic intuitions.

Decorrelation and shallow semantic patterns for distributional clustering of nouns and verbs (2009)

Versley, Yannick

Distributional approximations to lexical semantics are very useful not only in helping the creation of lexical semantic resources (Kilgariff et al., 2004; Snow et al., 2006), but also when directly applied in tasks that can benefit from large-coverage semantic knowledge such as coreference resolution (Poesio et al., 1998; Gasperin and Vieira, 2004; Versley, 2007), word sense disambiguation (Mc- Carthy et al., 2004) or semantical role labeling (Gordon and Swanson, 2007). We present a model that is built from Webbased corpora using both shallow patterns for grammatical and semantic relations and a window-based approach, using singular value decomposition to decorrelate the feature space which is otherwise too heavily influenced by the skewed topic distribution of Web corpora.

Using the web to resolve coreferent bridging in German newspaper text (2007)

Versley, Yannick

We adopt Markert and Nissim (2005)’s approach of using the World Wide Web to resolve cases of coreferent bridging for German and discuss the strength and weaknesses of this approach. As the general approach of using surface patterns to get information on ontological relations between lexical items has only been tried on English, it is also interesting to see whether the approach works for German as well as it does for English and what differences between these languages need to be accounted for. We also present a novel approach for combining several patterns that yields an ensemble that outperforms the best-performing single patterns in terms of both precision and recall.

From surface dependencies towards deeper semantic representations [Semantic representations] (2006)

Versley, Yannick ; Zinsmeister, Heike

In the past, a divide could be seen between ’deep’ parsers on the one hand, which construct a semantic representation out of their input, but usually have significant coverage problems, and more robust parsers on the other hand, which are usually based on a (statistical) model derived from a treebank and have larger coverage, but leave the problem of semantic interpretation to the user. More recently, approaches have emerged that combine the robustness of datadriven (statistical) models with more detailed linguistic interpretation such that the output could be used for deeper semantic analysis. Cahill et al. (2002) use a PCFG-based parsing model in combination with a set of principles and heuristics to derive functional (f-)structures of Lexical-Functional Grammar (LFG). They show that the derived functional structures have a better quality than those generated by a parser based on a state-of-the-art hand-crafted LFG grammar. Advocates of Dependency Grammar usually point out that dependencies already are a semantically meaningful representation (cf. Menzel, 2003). However, parsers based on dependency grammar normally create underspecified representations with respect to certain phenomena such as coordination, apposition and control structures. In these areas they are too "shallow" to be directly used for semantic interpretation. In this paper, we adopt a similar approach to Cahill et al. (2002) using a dependency-based analysis to derive functional structure, and demonstrate the feasibility of this approach using German data. A major focus of our discussion is on the treatment of coordination and other potentially underspecified structures of the dependency data input. F-structure is one of the two core levels of syntactic representation in LFG (Bresnan, 2001). Independently of surface order, it encodes abstract syntactic functions that constitute predicate argument structure and other dependency relations such as subject, predicate, adjunct, but also further semantic information such as the semantic type of an adjunct (e.g. directional). Normally f-structure is captured as a recursive attribute value matrix, which is isomorphic to a directed graph representation. Figure 5 depicts an example target f-structure. As mentioned earlier, these deeper-level dependency relations can be used to construct logical forms as in the approaches of van Genabith and Crouch (1996), who construct underspecified discourse representations (UDRSs), and Spreyer and Frank (2005), who have robust minimal recursion semantics (RMRS) as their target representation. We therefore think that f-structures are a suitable target representation for automatic syntactic analysis in a larger pipeline of mapping text to interpretation. In this paper, we report on the conversion from dependency structures to fstructure. Firstly, we evaluate the f-structure conversion in isolation, starting from hand-corrected dependencies based on the TüBa-D/Z treebank and Versley (2005)´s conversion. Secondly, we start from tokenized text to evaluate the combined process of automatic parsing (using Foth and Menzel (2006)´s parser) and f-structure conversion. As a test set, we randomly selected 100 sentences from TüBa-D/Z which we annotated using a scheme very close to that of the TiGer Dependency Bank (Forst et al., 2004). In the next section, we sketch dependency analysis, the underlying theory of our input representations, and introduce four different representations of coordination. We also describe Weighted Constraint Dependency Grammar (WCDG), the dependency parsing formalism that we use in our experiments. Section 3 characterises the conversion of dependencies to f-structures. Our evaluation is presented in section 4, and finally, section 5 summarises our results and gives an overview of problems remaining to be solved.

John his book vs. John's book : possession marking in English (2000)

Vezzosi, Letizia

The unusual development of the PDE [present-day English] s-genitive can be historically motivated, if the 's form is supposed to be not a mere leftover of the Old English (henceforth OE) casemarking, but the outcome of the merging of two patterns: the inflectional genitive ending (levelled to -s) and the construction "John his book" (henceforth 'possessive-linked genitive') during the Middle and the Early Modem English phases. As my corpus analysis will show, the semantic and syntactic constraints ruling the occurrence of the 's pattern in the time interval of the rise of the 's-pattern (1400 - 1650) are the same ones as those ruling the occurrence of the possessive-linked genitive. This hypothesis is further confirmed by cross-language comparison (with the other West Germanic languages, especially Afrikaans).

O desestranhamento em relação ao alemão na aprendizagem do idioma : um processo de aproximação ao "outro" sob a perspectiva da competência intercultural (2011)

Viana, Nelson ; de Faria Rozenfeld, Cibele Cecilio

Sob uma perspectiva crítica, são abordadas neste artigo, imagens sobre a língua alemã e seus falantes, apresentadas por estudantes universitários brasileiros, interessados em aprender esse idioma ou engajados em estágio inicial de sua aprendizagem. O propósito é discutir evidências de imagens estereotipadas, bem como refletir sobre possíveis decorrências dessas imagens no/para o processo de ensino-aprendizagem da língua, buscando, com isso, apontar a importância de se considerar como objetivo relevante no ensino de alemão, a necessidade de um percurso de "desestranhamento" do idioma, por meio de enfoque metodológico orientado para auxiliar os aprendizes no processo de desenvolvimento de competência intercultural. Tais reflexões têm como base pressupostos teóricos como as noções de "outro" e de "próprio", as concepções de competência intercultural e de ensino intercultural, e resultados obtidos em pesquisa desenvolvida no campo de ensino e aprendizagem de língua estrangeira (alemão).

[Rezension zu:] Lorenz Hofer. Sprachwandel im städtischen Dialektrepertoire. Eine variationlinguistische Untersuchung am Beispiel des Baseldeutschen. Tübingen: A. Francke Verlag 1997 (Basler Studien zur deutschen Sprache und Literatur 72, xiv + 306 pág., DM 68,00, ISBN 3-7720-2671-0) (2000)

Viaro, Mário Eduardo

Rezension zu Lorenz Hofer, Sprachwandel im städtischen Dialektrepertoire. Eine variationslinguistische Untersuchung am Beispiel des Baseldeutschen. Tübingen: A. Francke Verlag 1997 (Basler Studien zur deutschen Sprache und Literatur 72, xiv + 306 S., 68,00 DM, ISBN 3-7720-2671-0)

Open Access

Linguistik

Refine

Author

Year of publication

Document Type

Language

Has Fulltext

Is part of the Bibliography

Keywords

Institute

2991 search hits