Linguistik
Refine
Year of publication
Document Type
- Conference Proceeding (34) (remove)
Has Fulltext
- yes (34)
Is part of the Bibliography
- no (34)
Keywords
- Japanisch (34) (remove)
Institute
- Extern (10)
This paper hypothesizes that transfer-based machine translation systems can be improved by encoding information structure in both the source and target grammars, and preserving information structure in the transfer stage. We explore how information structure can be represented within the HPSG/MRS formalism (Pollard and Sag, 1994; Copestake et al., 2005) and how it can help refine multilingual MT. Building upon that framework, we provide a sample translation between English and Japanese and check the feasibility of the proposals in small-scale translation systems built with the HPSG/MRS-based LOGON MT infrastructure (Oepen et al., 2007). Our experiment shows the information structure-based MT system that we propose in this paper reduces the number of translations 75.71% for Japanese and 80.23% for Korean. The dramatic reductions in the number of translations is expected to make a contribution to our HPSG/MRS-based MT in terms of latency as well as accuracy.
In this paper, I first make an observation that there is a certain parallelism in the scope interpretation possibilities of adverbs and quantifiers with respect to different types complex predicates in Japanese, drawing on a comparison of the light verb construction and the causative construction. I will then argue that previous approaches to complex predicates in Japanese in the lexicalist tradition (Matsumoto 1996; Manning et al. 1999) fail to capture this generalization successfully. Finally, building on a novel approach to syntax/semantics interface in HPSG by Cipollone (2001), I develop an analysis of the semantic structure of complex predicates that accounts for the empirical observation straightforwardly.
In this paper, I argue (i) that Japanese has constructions that are almost the exact mirror images of the right-node raising constructions in English, and (ii) that the properties of those constructions, which I refer to as left-node raising constructions, can be captured straightforwardly if and only if the CONTENT values of domain objects, not those of signs, are assumed to be the principal locus of meaning assembly. In the theory proposed, it is claimed that semantic composition (including "quantifier retrieval") takes place not when some signs are syntactically combined to produce a new, larger sign but when some domain objects (which are essentially prosodic constituents) are merged (by the total or partial compaction operation) to produce a new domain object (i.e. a new, larger prosodic constituent).
Particles fullfill several distinct central roles in the Japanese language. They can mark arguments as well as adjuncts, can be functional or have semantic functions. There is, however, no straightforward matching from particles to functions, as, e.g., 'ga' can mark the subject, the object or the adjunct of a sentence. Particles can cooccur. Verbal arguments that could be identified by particles can be eliminated in the Japanese sentence. And finally, in spoken language particles are often omitted. A proper treatment of particles is thus necessary to make an analysis of Japanese sentences possible. Our treatment is based on an empirical investigation of 800 dialogues. We set up a type hierarchy of particles motivated by their subcategorizational and modificational behaviour. This type hierarchy is part of the Japanese syntax in VERBMOBIL.
We examine the fine structure of clausal right-node raising constructions in Japanese, and argue that there are sentences in which a tensed verb is right-node-raised out of coordinated tensed clauses as well as sentences in which a verb stem is right-node-raised out of coordinated tenseless phrases. In the latter case, the tense morpheme has to be assumed to take a tenseless complement clause, and we note that the existence of such a structure contradicts the so-called lexicalist hypothesis, according to which a verb stem and the tense morpheme immediately following it always form a morphosyntactic constituent.
Sprachtechnologie für übersetzungsgerechtes Schreiben am Beispiel Deutsch, Englisch, Japanisch
(2009)
Wir [...] haben uns zur Aufgabe gesetzt, Wege zu finden, wie linguistisch basierte Software den Prozess des Schreibens technischer Dokumentation unterstützen kann. Dabei haben wir einerseits die Schwierigkeiten im Blick, die japanische und deutsche Autoren (und andere Nicht-Muttersprachler des Englischen) beim Schreiben englischer Texte haben. Besonders japanische Autoren haben mit Schwierigkeiten zu kämpfen, weil sie hochkomplexe Ideen in einer Sprache ausdrücken müssen, die von Informationsstandpunkt her sehr unterschiedlich zu ihrer Muttersprache ist. Andererseits untersuchen wir technische Dokumentation, die von Autoren in ihrer Muttersprache geschrieben wird. Obwohl hier die fremdsprachliche Komponente entfällt, ist doch auch erhebliches Verbesserungspotential vorhanden. Das Ziel ist hier, Dokumente verständlich, konsistent und übersetzungsgerecht zu schreiben. Der fundamentale Ansatz in der Entwicklung linguistisch-basierter Software ist, dass gute linguistische Software auf Datenmaterial basiert und sich an den konkreten Zielen der besseren Dokumentation orientiert.
Resultative phrases are generally believed to conform to the Direct Object Restriction: that is, they describe the direct object if verbs are transitive. However, some exceptions have occasionally been reported, and this paper investigates the problem by focusing on resultative phrases that occur with the valency alternation verbs in Japanese and Mandarin Chinese. Verbs that license the locative alternation and locatum-subject alternation describe events that involve two arguments, the location and the locatum, which are perceived to concurrently undergo a change of state. It will be shown that resultative phrases with a valency alternation verb can be predicated of either argument regardless of whether it is expressed as direct object. Furthermore, resultative verbal suffixes in Mandarin, interpreted as description of either the location or the locatum, give rise to the locative alternation while their interpretation remains the same. Thus, it is claimed that in Japanese and Mandarin, the predication relation of resultative phrases is not determined by the grammatical function of arguments as generally believed, but rather by the lexical semantics of the verbs.
Whether the Coordinate Structure Constraint (CSC) (Ross, 1967) is a syntactic constraint has been discussed much in the literature. This paper reconsiders this issue by drawing on evidence from Japanese and Korean. Our examination of the CSC patterns in relative clauses in the two languages reveals that a pragmatically-based approach along the lines of Kehler (2002) predicts the relevant empirical patterns straightforwardly whereas alternative syntactic approaches run into many problems. We take these results to provide strong support for the view that the CSC is a pragmatic principle rather than a syntactic constraint.
Preferences and defaults for definiteness and number in japanese to german machine translation
(1996)
A significant problem when translating Japanese dialogues into German is the missing information on number and definiteness in the Japanese analysis output. The integration of the search for such information into the transfer process provides an efficient solution. General transfer includes conditions to make it possible to consider external knowledge. Thereby, grammatical and lexical knowledge of the source language, knowledge of lexical restrictions on the target language, domain knowledge and discourse knowledge are accessible.
We will observe which stem allomorph the affixes, the so-called 'non-past' affix, the past affix, the imperative affix, the negative affix and the voice affix-like verbs, select between the longer and the shorter in Japanese-Yanagawa dialect on the assumption that verbal lexemes may be associated with more than one stem. Observing the phenomenon more closely, we found that the verbal stem forms entertain default implicative relations in the stem dependency hierarchy. We will propose i) an implemented analysis of the past affix and ii) an implementation of the allomorph selections by the 'non-past' affix in Koga and Ono, 2010 as two examples.