400 Sprache
Refine
Year of publication
- 2005 (92) (remove)
Document Type
- Article (30)
- Part of a Book (28)
- Conference Proceeding (15)
- Preprint (9)
- Book (4)
- Report (3)
- diplomthesis (1)
- Other (1)
- Working Paper (1)
Language
- English (65)
- German (24)
- Danish (1)
- Portuguese (1)
- Turkish (1)
Has Fulltext
- yes (92)
Is part of the Bibliography
- no (92)
Keywords
- Phonetik (7)
- Deutsch (4)
- Englisch (3)
- Indogermanische Sprachen (3)
- Informationsstruktur (3)
- Lexikologie (3)
- Slawische Sprachen (3)
- Übersetzung (3)
- Baltische Sprachen (2)
- Familienname (2)
Institute
- Extern (18)
- Neuere Philologien (2)
- Sprachwissenschaften (1)
Seit der zweiten Hälfte des letzten Jahrhunderts zieht die Übersetzung im Fremdsprachenunterricht (FSU) das Interesse der Fremdsprachendidaktiker auf sich. In den anhaltenden Diskussionen über den Stellenwert der Übersetzung im FSU bestehen aber immer noch verschiedene Meinungen. Die Meinungsverschiedenheiten beruhen vor allem auf diversen miteinander konkurrierenden Lerntheorien und damit auch auf unterschiedlichen methodischen Prinzipien. Im Zusammenhang mit den herrschenden didaktischen Richtungen und mit den unterschiedlichen Lernzielen, die im Fremdsprachenunterricht verfolgt werden können, wird auch die Übersetzung unter mehreren Gesichtspunkten betrachtet und bewertet. Hinsichtlich der Funktion der Übersetzung ist es inzwischen üblich geworden, zwischen zwei Verwendungsweisen zu unterscheiden: Einerseits wird die Übersetzung als ein methodisches Mittel zur Festigung, Erweiterung und Prüfung sprachlicher Fertigkeiten angewendet, andererseits ist sie als eine eigene Fertigkeit selbst ein Übungs- und Unterrichtsziel.
Man kann die Menschheit in zwei Gruppen teilen: in solche, die sich an fremden Orten verbal anschmiegen und die lokale Sprache in sich aufnehmen als wäre es schon immer die eigene gewesen und solche, die ihre Sprache an einem fremden Ort beibehalten und sich kaum merklich oder gar nicht von ihrer neuen Umgebung sprachlich beeinflussen lassen. Daher fällt einem meist auf, ob jemand beispielsweise in Solothurn seinen Berner, Basler oder Thurgauer Dialekt beibehalten hat, oder aber man beobachtet – vielleicht sodann mit einer gewissen Skepsis –, dass ein zugezogener Mensch das Solothurnische langsam annimmt und seinen eigenen Dialekt nach und nach zu verlernen scheint. ...
Ausgangspunkt: Die Kritik am "Zwei-Welten-Modell": Die grundlegende linguistische Unterscheidung zwischen "Sprache" und "Sprechen" ist im Rahmen der neueren Debatten um Sprachmedialität wieder verstärkt thematisiert und kritisiert worden. Lässt sich dieses schulbildende, in der Linguistik geradezu eherne Begriffspaar überhaupt noch sinnvollerweise aufrechterhalten? Oder muss es mindestens umdefiniert, vielleicht sogar gänzlich verworfen werden? Hat sich insbesondere die auf Chomsky zurückgehende Unterscheidung von Sprachkompetenz und -performanz nicht von selbst ad absurdum geführt, nachdem der linguistische Kognitivismus chomskyscher Provenienz Sprache als lebendiges Phänomen, als Medium menschlicher Kommunikation, vollständig aus dem Blick verloren hat? Führt nicht schon die scheinbar harmlose linguistische Differenzierung zwischen einer Sprachregel und ihrer Anwendung zu einer irreführenden und unangemessenen Verdinglichung von Sprache? ...
Die moderne Gesellschaft ist von Veränderungen epistemischer und institutioneller Strukturmerkmale der Wissenschaft geprägt, die ihrerseits einen Wandel in anderen Bereichen der Gesellschaft auslösen. In diesem Zusammenhang – wie auch in der neuzeitlichen Wissenschaftsentwicklung überhaupt – kommt der Sprachlichkeit, dem Kulturphänomen "Wissenschaftssprache", eine eminente Rolle zu, etablierte sich doch in den letzten Jahrzehnten eine „linguistische Teildisziplin der Wissenschaftssprachforschung“ (vgl. KRETZENBACHER 1992: 1; HESS-LÜTTICH 1998). "Wissenschaft" scheint mir jedoch ein (interkultureller) Problembegriff zu sein, beispielsweise auch schon deswegen, da dieses Wort (samt seinen Ableitungen wie Wissenschaftler, wissenschaftlich, Wissenschaftlichkeit) stark kulturbedingt ist (vgl. CLYNE/KREUTZ 2003: 60); so korreliert etwa der deutsche Terminus Wissenschaft nicht mit dem englischen science etc. Das Englische kann zweifellos auf eine konkurrenzlose Karriere als wissenschaftliche Universalsprache zurückblicken: Wissenschaftler – auch deutschsprachige – bedienen sich bei der Veröffentlichung wichtiger Forschungsergebnisse zunehmend der englischen Sprache. Der Anteil der wissenschaftlichen Publikationen auf Englisch beträgt heute weltweit über 90 Prozent, während nur noch wenige Prozent des wissenschaftlichen Publikationsaufkommens deutschsprachig sind. Auch die Zahl der wissenschaftlichen Tagungen (selbst im deutschen Sprach- und Kulturraum), die ausschließlich Englisch als Konferenzsprache zulassen, nimmt stetig zu. Außerdem werden immer mehr Vorlesungen bzw. ganze Studiengänge an sonst deutschsprachigen Universitäten in Englisch angeboten. „Die Spitzenforschung spricht englisch“ – stellte der spätere Präsident der Max-Planck-Gesellschaft, Hubert Markl, bereits vor zwanzig Jahren lapidar fest (Quelle: DUZ, 22/2002, S. 12). Gleichwohl wird immer wieder – oft etwas euphorisch – auf Ostmittel-, Ost- und Südosteuropa verwiesen, die traditionell als ein Refugium des Deutschen u.a. auch als Wissenschaftssprache galten bzw. auf weiten Strecken nach wie vor gelten. So kann exemplarisch die „Physikalische Zeitschrift der Sowjetunion“ erwähnt werden, die von 1932 bis 1937 auf Deutsch erschien. In diesem interessanten und zugleich äußerst komplexen Spannungsfeld soll es sich im vorliegenden Beitrag um das Thema ‘Sprachen in den Wissenschaften’ als Denk- und Darstellungsmedia handeln. Dabei soll zum einen die Problematik der Mehrsprachigkeit der Wissenschaften (mit besonderer Berücksichtigung des Deutschen) im mehrsprachigen, multikulturellen und kultursensiblen Kontaktraum Mittel- und Osteuropa angesprochen werden, zum anderen – weil ja auf unserer Tagung auch andere Teilareale, wie z.B. Rumänien, vertreten sind – soll der besondere Schwerpunkt auf Ungarn liegen. Hauptziel der Erörterungen besteht darin, die Entwicklung der in dieser Region wirksamen Wissenschaftssprachen diachron herauszuarbeiten, den derzeitigen Stand für die Bereiche Sprachen in der akademischen Lehre, Forschungssprachen (d.h. Sprachen der Forschungskommunikation) und Publikationssprachen – auch mit Hilfe empirischer Daten – mehrperspektivisch zu dokumentieren und aktuelle Tendenzen reflektorisch aufzuzeigen.
Wie formt sich die Sprache im Kopf? : Kognitive Linguistik: Sprache, Grammatik und die Wissenswelten
(2005)
In this paper I will discuss the formation of different types of yes/no questions in Serbian (examples in (1)), focusing on the syntactically and semantically puzzling example (1d), which involves the negative auxiliary inversion. Although there is a negative marker on the fronted auxiliary, the construction does not involve sentential negation. This coincides with the fact that the negative quantifying NPIs cannot be licensed. The question formation and sentential negation have similar syntactic effects cross-linguistically. This has led to various attempts to formulate a unifying syntactic account of the phenomena (ever since Klima 1964). One striking fact about the two syntactic contexts is that both license weak NPIs (Negative Polarity Items). It has been suggested (cf. Laka 1990, Culicover 1991) that the derivation of both interrogatives and negatives involves the same type of functional projection PolP (polarity phrase). One such account of the formation of negative interrogatives in Serbo- Croatian is offered by Progovac (2005). She proposes that there are two PolPs optionally cooccurring in the same clause, in which both positive and negative polarity items check their positive or negative features (following Haegeman and Zanuttini (1991) feature-checking account of negative structures, and the insights of Brown(1999) on the negation in Russian). On her account, the negative auxiliary question in (1d), is the case when both polarity phrases are present. The higher has [-pos +neg] features, and the lower one (below TP) is [-pos -neg]. Although her account correctly predicts the ungrammaticality of (2a) in contrast with (1c), it wrongly predicts the (2b) to be grammatical. I will argue that Progovac’s theory regarding the nature of the PolP is wrong. It employs both the binary feature valuation on the polarity head and the hierarchical ordering of the two polarity phrases, which eventually leads to overgeneration. On the account presented here the nature of the question marker (li vs zar) is highly relevant. Notice that (1b) and (1d) express presuppositions regarding the truth value of the propositions. In this way they contrast with (1a) and (1c). In addition, the type (1b) (with the question particle zar) can introduce both the positive and negative presupposition as shown in (3), which, semantically, makes this construction compatible with negative auxiliary questions in English (4a). The polarity items licensed in the relevant structures are also of the same type in both languages. The fronted-negative-auxiliary questions (1d) in Serbian are only possible with the particle li. In this case the presupposition is exclusively positive. The peculiar question/focus marking function of li (in Bulgarian and Russian) is well known. However, it is always assumed that its focus marking role is not relevant for the formation of yes/no questions. This I believe is not correct. The syntactic explanation of the interpretational facts points to the following: A) The possibility of the separate lexical encoding (particle zar) of the ‘rhetorical’ yes/no questions in Serbian allows the embedding of both positive and negated sentences, in which case the (weak) NPIs can remain in local relation with the negated verb. B) Recall that Serbian is an NC language, which requires local/c-command relation between the verbal negative marker and the NPI. With the negative inverted auxiliary questions this condition is not met, and the licensing of an n-word is not possible. C) The impossibility of licensing a weak NPI (i-words in the examples below) is due to the nature of the question marker li. (1) a. Da li je Vera videla ikoga / nekoga / *nikoga? DA Q aux Vera see.part.F.Sg anyone someone noone “Did Vera see anyone/someone/noone?” b. Zar je Vera videla ikoga / nekoga / *nikoga? ZAR aux Vera see.part.F.Sg anyone someone noone “Is it really the fact that Vera saw anyone/someone?” c. Je li Vera videla ikoga / nekoga /*nikoga? aux Q Vera see.part.F.Sg anyone someone noone “Did Vera see anyone/someone/noone?” d. Nije li Vera videla *ikoga / nekoga / *nikoga? neg+aux Q Vera see.part.F.Sg anyone someone noone “Didn’t Vera see someone?”/ “Vera saw someone, didn’t she?” (2) a. *Nije li Vera videla nikoga? neg+aux Q Vera see.part.F.Sg noone b. *Nije li Vera videla ikoga? neg+aux Q Vera see.part.F.Sg anyone (3) a. Zar je Vera videla nekoga / ikoga? ZAR aux Vera see.part.F.Sg someone/anyone b. Zar Vera nije videla nekoga/nikoga? ZAR Vera neg+aux see.part.F.Sg someone/anyone (4) a. Didn’t Vera (NOT) see someone/anyone? b. Vera saw someone, didn’t she?
Theories of cognition that are based on information processing and representation are reactive (Rosen, 1985) or backwards looking, not anticipatory. In a previous article (Thibault, 2005a), I looked at the reasons why humans and bonobos do not need an innate language faculty in order to be minded, languaging beings. The present article takes up some of the questions explored there, but, it asks, on the other hand, what sort of a minded agent has language and what kind of account of language and more broadly meaning do we need to explain minded, languaged agents and the activities they participate in? Following Rosen (1985), I also take up and further develop a point first raised in Thibault (2004a: 187) on language as an anticipatory system, rather than a reactively ‘representational’ one (see also Bickhard, 2005).
Woher kommt das neuerwachte Interesse an Sprachrichtigkeit? Woher kommt die ausgeprägte sprachliche Unsicherheit, die auch bei vielen hochgebildeten Menschen den Wunsch entstehen lässt, von Sprachpflegern über ihr Ureigenstes, nämlich ihre Muttersprache, belehrt zu werden? Obwohl Antworten auf diese Fragen letztlich spekulativ bleiben, wage ich doch die These, dass eine Ursache hierfür die Rechtschreibreform ist, die von einem Großteil der Bevölkerung nach wie vor nicht angenommen wird, die insgesamt weder zur Vereinfachung noch zu einer höheren Einheitlichkeit geführt hat; die aber andererseits ein öffentliches Nachdenken und Diskutieren über Sprachrichtigkeit in Gang setzte. – Jedenfalls ist die Verunsicherung ein Faktum, das von Linguisten nicht ignoriert werden sollte.
Fronting of an infinite VP across a finite main verb - akin to German "VP-topicalization" - can be found also in Czech and Polish. The paper discusses evidence from large corpora for this process and some of its properties, both syntactic and information-structural. Based on this case, criteria for more user-friedly searching and retrieval of corpus data in syntactic research are being developed.
The contribution of von Kempelen’s “Mechanism of Speech” to the ‘phonetic sciences‘ will be analyzed with respect to his theoretical reasoning on speech and speech production on the one hand and on the other in connection with his practical insights during his struggle in constructing a speaking machine. Whereas in his theoretical considerations von Kempelen’s view is focussed on the natural functioning of the speech organs – cf. his membraneous glottis model – in constructing his speaking machine he clearly orientates himself towards the auditory result – cf. the bag pipe model for the sound generator used for the speaking machine instead. Concerning vowel production his theoretical description remains questionable, but his practical insight that vowels and speech sounds in general are only perceived correctly in connection with their surrounding sounds – i.e. the discovery of coarticulation – is clearly a milestone in the development of the phonetic sciences: He therefore dispenses with the Kratzenstein tubes, although they might have been based on more thorough acoustic modelling. Finally, von Kempelen’s model of speech production will be discussed in relation to the discussion of the acoustic nature of vowels afterwards [Willis and Wheatstone as well as von Helmholtz and Hermann in the 19th century and Stumpf, Chiba & Kajiyama as well as Fant and Ungeheuer in the 20th century].
Die deutsche Präposition-Artikel-Enklise bietet wie kaum eine andere Grammatikalisierung Einblicke in den Mikrobereich von Grammatikalisierungsprozessen: Klare, "zielorientierte" Verhältnisse sind hier nicht zu beschreiben, was der Grund für ihre bisher so geringe Beachtung durch die Grammatikalisierungsforschung sein dürfte. Es wurde deutlich, dass bezüglich der hier als zentral bewerteten Morphologisierung des Artikels das gesamte Spektrum von Nichtverschmelzbarkeit bis hin zu (kurz vor Flexiven stehenden) obligatorisch verschmelzenden speziellen Klitika abgedeckt ist. Diachron hat sich zwar insgesamt eine deutliche Rechtsdrift auf der Grammatikalisierungsskala vollzogen; bezüglich des Genitivartikels hat jedoch eine Degrammatikalisierung in Form von sog. retraction (gemäß Hapelmath 2004) stattgefunden, die hier in einer Demorphologisierung (Resyntaktisierung) eines Klitikons besteht. Dabei findet keine "Relexikalisierung" im Sinne einer lexikalischen Anreicherung eines bereits grammatikalisierten Elements statt (siehe hierzu Haspelmath 1999). Mittel- und frühneuhochdeutsche Verschriftungen deuten auf reichere Inventare an Verschmelzungs formen hin, doch sind hierzu diachrone Untersuchungen erforderlich. Ebenso ist der Übergangsbereich zwIschen einfachen und speziellen Klitika in sich abgestuft und weitaus komplexer gestaltet als hier dargestellt. Auch dazu besteht Bedarf an Detailanalysen unter der Fragestellung, welche der unter Abschnitt 2.2 aufgeführten Artikelfunkttonen am ehesten eine Präposition-Artikel-Verschmelzung erfordern. Einiges deutet auf den am stärksten desemantisierten (expletiven) Artikel z.B. vor Eigennamen hin. Um den Einfluss von Schriftlichkeit und Standardisierung auf Grammatikalisierungsprozesse ermitteln zu können, wurden zwei Dialekte in den Blick genommen: das Ruhrdeutsche, das die Erwartung nach deutlich fortgeschritteneren Verhältnissen erfüllt, und das Alemannische, das andere Phänomene ausgebildet hat wie etwa die Proklise des Artikels an das Substantiv, die Nullrealisierung klitischer Artikelformen und den kategorialen Umbau der vier Nominalkategorien am Artikel. Die Einbeziehung weiterer Dialekte und vor allem auch der gesprochenen "Umgangssprache" könnte weiteren Aufschluss über die Ratio dieser Grammatikalisierung liefern. Sollten flektierende Präpositionen Ziel dieses Wandels sein, so hätte dies tiefgreifende Konsequenzen für die Grammatikschreibung.
Dutch has a three-way contrast in labiodental sounds, which causes problems for native speakers of German in their acquisition of Dutch, since German contrasts only two labiodentals. The present study investigates the perception of the Dutch labiodental fricative system by German L2 learners of Dutch and shows that native Germans with no or little knowledge of the Dutch language categorize the Dutch labiodental voiced fricative and approximant as their native voiced fricative. Advanced learners, however, succeed in acquiring a category for the voiced fricative, illustrating that plasticity in the perception of a second language develops with the amount of exposure to the language.
This paper describes the creation and preparation of TUSNELDA, a collection of corpus data built for linguistic research. This collection contains a number of linguistically annotated corpora which differ in various aspects such as language, text sorts / data types, encoded annotation levels, and linguistic theories underlying the annotation. The paper focuses on this variation on the one hand and the way how these heterogeneous data are integrated into one resource on the other hand.
Typology and complexity
(2005)
For the Workshop I was asked to talk about complexity in language from a typological perspective. My way of approaching this topic was to ask myself some questions, and then see where the answers led. The first one was of course, "What sort of system are we looking at complexity in - what kind of system is language?"
This paper profiles significant differences in syntactic distribution and differences in word class frequencies for two treebanks of spoken and written German: the TüBa-D/S, a treebank of transliterated spontaneous dialogs, and the TüBa-D/Z treebank of newspaper articles published in the German daily newspaper ´die tageszeitung´(taz). The approach can be used more generally as a means of distinguishing and classifying language corpora of different genres.
This paper addresses remarks made by Flemming (2003) to the effect that his analysis of the interaction between retroflexion and vowel backness is superior to that of Hamann (2003b). While Hamann maintained that retroflex articulations are always back, Flemming adduces phonological as well as phonetic evidence to prove that retroflex consonants can be non-back and even front (i.e. palatalised). The present paper, however, shows that the phonetic evidence fails under closer scrutiny. A closer consideration of the phonological evidence shows, by making a principled distinction between articulatory and perceptual drives, that a reanalysis of Flemming’s data in terms of unviolated retroflex backness is not only possible but also simpler with respect to the number of language-specific stipulations.
The semantics of ellipsis
(2005)
There are four phenomena that are particularly troublesome for theories of ellipsis: the existence of sloppy readings when the relevant pronouns cannot possibly be bound; an ellipsis being resolved in such a way that an ellipsis site in the antecedent is not understood in the way it was there; an ellipsis site drawing material from two or more separate antecedents; and ellipsis with no linguistic antecedent. These cases are accounted for by means of a new theory that involves copying syntactically incomplete antecedent material and an analysis of silent VPs and NPs that makes them into higher order definite descriptions that can be bound into.
Language planning for the Irish language in the Republic of Ireland has featured prominently in international language policy and planning literature over the years. Researchers in the field may not be up to date, however, with recent developments in the area of Irish language planning and their impact on the language ecology. This monograph describes the language planning situation in the Republic of Ireland in its historical and social contexts as well as delineating language policy and planning for the Irish language implemented over the past number of years, showing developments in education, community, media, religion and local politics.
Articulatory token-to-token variability not only depends on linguistic aspects like the phoneme inventory of a given language but also on speaker specific morphological and motor constraints. As has been noted previously (Perkell (1997), Mooshammer et al. (2004)) , speakers with coronally high "domeshaped" palates exhibit more articulatory variability than speakers with coronally low "flat" palates. One explanation for that is based on perception oriented control by the speaker. The influence of articulatory variation on the cross sectional area and consequently on the acoustics should be greater for flat palates than for domeshaped ones. This should force speakers with flat palates to place their tongue very precisely whereas speakers with domeshaped palates might tolerate a greater variability. A second explanation could be a greater amount of lateral linguo-palatal contact for flat palates holding the tongue in position. In this study both hypotheses were tested.
A survey of 170 Tibeto-Burman languages showed 69 with a distinction between inclusive and exclusive first-person plural pronouns, 18 of which also show inclusive- exclusive in Idual. Only the Kiranti languages and some Chin languages have inclusive-exclusive in the person marking. Of the forms of the pronouns involved in the inclusive-exclusive opposition, usually the exclusive form is less marked and historically prior to the inclusive form, and we find the distinction cannot be reconstructed to Proto-Tibeto-Burman or to mid level groupings. Qnly the Kiranti group has marking of the distinction that can be reconstructed to the proto level, and this is also reflected in the person-marking system.
The epistemic step
(2005)
The present study shows that though retroflex segments can be considered articulatorily marked, there are perceptual reasons why languages introduce this class into their phoneme inventory. This observation is illustrated with the diachronic developments of retroflexes in Norwegian (North- Germanic), Nyawaygi (Australian) and Minto-Nenana (Athapaskan). The developments in these three languages are modelled in a perceptually oriented phonological theory, since traditional articulatorily-based features cannot deal with such processes.
Tagging kausaler Relationen
(2005)
In dieser Diplomarbeit geht es um kausale Beziehungen zwischen Ereignissen und Erklärungsbeziehungen zwischen Ereignissen, bei denen kausale Relationen eine wichtige Rolle spielen. Nachdem zeitliche Relationen einerseits ihrer einfacheren Formalisierbarkeit und andererseits ihrer gut sichtbaren Rolle in der Grammatik (Tempus und Aspekt, zeitliche Konjunktionen) wegen in jüngerer Zeit stärker im Mittelpunkt des Interesses standen, soll hier argumentiert werden, dass kausale Beziehungen und die Erklärungen, die sie ermöglichen, eine wichtigere Rolle im Kohärenzgefüge des Textes spielen. Im Gegensatz zu “tiefen” Verfahren, die auf einer detaillierten semantischen Repr¨asentation des Textes aufsetzen und infolgedessen für unrestringierten Text m. E. nicht geeignet sind, wird hier untersucht, wie man dieses Ziel erreichen kann, ohne sich auf eine aufwändig konstruierte Wissensbasis verlassen zu müssen.
Face-to-face communication is multimodal. In unscripted spoken discourse we can observe the interaction of several “semiotic layers”, modalities of information such as syntax, discourse structure, gesture, and intonation. We explore the role of gesture and intonation in structuring and aligning information in spoken discourse through a study of the co-occurrence of pitch accents and gestural apices. Metaphorical spatialization through gesture also plays a role in conveying the contextual relationships between the speaker, the government and other external forces in a naturally-occurring political speech setting.
Elke Kasimir´s paper (in this volume) argues against employing the notion of Givenness in the explanation of accent assignment. I will claim that the arguments against Givenness put forward by Kasimir are inconclusive because they beg the question of the role of Givenness. It is concluded that, more generally, arguments against Givenness as a diagnostic for information structural partitions should not be accepted offhand, since the notion of Givenness of discourse referents is (a) theoretically simple, (b) readily observable and quantifiable, and (c) bears cognitive significance.
Die Datenbank wird auf den Ergebnissen der Analyse einschlägiger umfangreicher Korpora des gesprochenen Deutsch basieren. Um jedoch große Korpora analysieren zu können, ist es notwendig, automatische Analyseverfahren der Variation zu entwickeln. Mit traditionellen manuellen Methoden kann der Aufbau einer korpusbasierten Datenbank kaum verwirklicht werden. Dem eigentlichen Variationsprojekt wurde daher eine kleine Pilotstudie vorgeschaltet, die die Möglichkeiten der automatischen Analyse prüfen sollte. Dabei wurde der Frage nachgegangen, ob es möglich ist, regionale Varianten des Deutschen mit Verfahren der automatischen Spracherkennung zu untersuchen, d.h., ob es möglich ist, eine verlässliche Transkription der regionalen Varianten automatisch herzustellen. Diese Pilotstudie zur automatischen Transkription stützte sich auf das im IDS bereits vorhandene System SPRAT (Speech Recognition and Alignment Tool), das zum Alignieren (Text-Ton-Synchronisation) verwendet wird. Im Rahmen der Pilotstudie wurde dieses System modifiziert und in einer Reihe von Tests dessen automatische Transkription evaluiert (vgl. Abschnitt 3). Das Ziel des vorliegenden Beitrags ist es, die Ergebnisse dieser Pilotstudie vorzustellen. Zunächst aber soll ein kurzer Exkurs verdeutlichen, um welches System es sich beim IDS-Aligner SPRAT handelt.
This paper develops a framework for TAG (Tree Adjoining Grammar) semantics that brings together ideas from different recent approaches.Then, within this framework, an analysis of scope is proposed that accounts for the different scopal properties of quantifiers, adverbs, raising verbs and attitude verbs. Finally, including situation variables in the semantics, different situation binding possibilities are derived for different types of quantificational elements.
This paper discusses the use of XSLT stylesheets as a filtering mechanism for refining the results of user queries on treebanks. The discussion is within the context of the TIGER treebank, the associated search engine and query language, but the general ideas can apply to any search engine for XML-encoded treebanks. It will be shown that important classes of linguistic phenomena can be accessed by applying relatively simple XSLT templates to the output of a query, effectively simulating the universal quantifier for a subset of the query language.
In order to investigate the empirical properties of focus, it is necessary to diagnose focus (or: “what is focused”) in particular linguistic examples. It is often taken for granted that the application of one single diagnostic tool, the so-called question-answer test, which roughly says that whatever a question asks for is focused in the answer, is a fool-proof test for focus. This paper investigates one example class where such uncritical belief in the question-answer test has led to the assumption of rather complex focus projection rules: in these examples, pitch accent placement has been claimed to depend on certain parts of the focused constituents being given or not. It is demonstrated that such focus projection rules are unnecessarily complex and in turn require the assumption of unnecessarily complicated meaning rules, not to speak of the difficulties to give a precise semantic/pragmatic definition of the allegedly involved givenness property. For the sake of the argument, an alternative analysis is put forward which relies solely on alternative sets following Mats Rooth´s work, and avoids any recourse to givenness. As it turns out, this alternative analysis is not only simpler but also makes in a critical case the better predictions.
Protected Mode
(2005)
Innerhalb der Reihe "GrenzBereiche des Lesens" gehaltener Vortrag. "GrenzBereiche des Lesens" ist eine kulturwissenschaftliche Vortragsreihe, die 2003 und 2004 an der Universität Frankfurt stattfand. Gegenstand von Harald Hillgärtners Untersuchung ist die Frage nach der Lesbarkeit des Computers, vielmehr seiner System- und Programmcodes. Gilt der Computer einerseits als "Textmaschine", die endlose Schreib- und Leseakte prozessiert, so finden jene Programmabläufe doch zumeist jenseits der für alle zugänglichen Benutzeroberflächen statt, die ihrerseits in immer stärkerem Maß mit Icons – Bildern – arbeiten. Und selbst im Falle von frei zugänglichen Software-Codes ist zu fragen, um welche Art Text es sich hier handelt – ob in diesen Fällen gar von Literatur die Rede sein kann. Insofern ist die Frage nach der Lesbarkeit des Computers nicht nur eine Frage nach der Zukunft des Lesens (geht es um Sinn oder um Information?) sondern vielmehr nach dem (Zu-)Stand unserer Schriftkultur selbst.
Plural semantics for natural language understanding : a computational proof-theoretic approach
(2005)
The semantics of natural language plurals poses a number of intricate problems – both from a formal and a computational perspective. In this thesis I investigate problems of representing, disambiguating and reasoning with plurals from a computational perspective. The work defines a computationally suitable representation for important plural constructions, proposes a tractable resolution algorithm for semantic plural ambiguities, and integrates an automatic reasoning component for plurals. My solution combines insights from formal semantics, computational linguistics and automated theorem proving and is based on the following main ideas. Whereas many existing approaches to plural semantics work on a model-theoretic basis using higher-order representation languages I propose a proof-theoretic approach to plural semantics based on a flat firstorder semantic representation language thus showing that a trade-off between expressive power and logical tractability can be found. The problem of automatic disambiguation of plurals is tackled by a deliberate decision to drastically reduce recourse to contextual knowledge for disambiguation but rely instead on structurally available and thus computationally manageable information. A further central aspect of the solution lies in carefully drawing the borderline between real ambiguity and mere indeterminacy in the interpretation of plural noun phrases. As a practical result of my computational proof-theoretic approach to plural semantics I can use my methods to perform automated reasoning with plurals by applying advanced firstorder theorem provers and model-generators available off-the shelf. The results are prototypically implemented within the two logic-oriented natural language understanding applications DRoPs and Attempto. DRoPs provides an automatic plural disambiguation component for uncontrolled natural language whereas Attempto works with a constructive disambiguation strategy for controlled natural language. Both systems provide tools for the automated analysis of technical texts allowing users for example to automatically detect inconsistencies, to perform question answering, to check whether a conjecture follows from a text or to find equivalences and redundancies.
When a statistical parser is trained on one treebank, one usually tests it on another portion of the same treebank, partly due to the fact that a comparable annotation format is needed for testing. But the user of a parser may not be interested in parsing sentences from the same newspaper all over, or even wants syntactic annotations for a slightly different text type. Gildea (2001) for instance found that a parser trained on the WSJ portion of the Penn Treebank performs less well on the Brown corpus (the subset that is available in the PTB bracketing format) than a parser that has been trained only on the Brown corpus, although the latter one has only half as many sentences as the former. Additionally, a parser trained on both the WSJ and Brown corpora performs less well on the Brown corpus than on the WSJ one. This leads us to the following questions that we would like to address in this paper: - Is there a difference in usefulness of techniques that are used to improve parser performance between the same-corpus and the different-corpus case? - Are different types of parsers (rule-based and statistical) equally sensitive to corpus variation? To achieve this, we compared the quality of the parses of a hand-crafted constraint-based parser and a statistical PCFG-based parser that was trained on a treebank of German newspaper text.
This paper investigates the structural properties of morphosyntactically marked focus constructions, focussing on the often neglected non-focal sentence part in African tone languages. Based on new empirical evidence from five Gur and Kwa languages, we claim that these focus expressions have to be analysed as biclausal constructions even though they do not represent clefts containing restrictive relative clauses. First, we relativize the partly overgeneralized assumptions about structural correspondences between the out-of-focus part and relative clauses, and second, we show that our data do in fact support the hypothesis of a clause coordinating pattern as present in clause sequences in narration. It is argued that we deal with a non-accidental, systematic feature and that grammaticalization may conceal such basic narrative structures.
The Deep Linguistic Processing with HPSG Initiative (DELH-IN) provides the infrastructure needed to produce open-source semantic transfer-based machine translation systems. We have made available a prototype Japanese-English machine translation system built from existing resources include parsers, generators, bidirectional grammars and a transfer engine.
[D]ie polnischen Familiennamen [unterlagen] bis ins 19. Jahrhundert hinein nur geringer amtlicher Kontrolle [...]. Diese Situation begünstigte den sukzessiven Aufbau onymischer Allomorphik aus den […] Flexions- und Derivationsmorphemen, die ursprünglich zur Bildung von Herkunftsbezeichnungen, Patronymika und Übernamen angewendet wurden. Die sekundäre Nutzung dieser Flexions- und Wortbildungsmorpheme als onymische Suffixe trieb den […] Dissoziationsprozess der Familiennamen voran. Die wachsende Produktivität dieser onymischen Morphe, die bis heute andauert, sicherte ihnen die Spitzenposition unter den Proprialitätsmarkern im polnischen Familiennamensystem. Heute sind die onymischen Allomorphe -ska, -ski, -icz, -ak das wichtigste Mittel, mit dem die Zugehörigkeit eines Wortes zum Onomastikon gekennzeichnet wird. […] In diesem Beitrag werden die Entstehungswege und die Ausbreitungspfade der drei produktivsten Gruppen der polnischen onymischen Suffixe präsentiert. Es werden auch die außersprachlichen Faktoren berücksichtigt, die die Erhöhung der Produktivität durch sukzessive Erweiterung der Kombinationsmöglichkeiten der einzelnen Suffixe ermöglicht haben. Es wird gezeigt, dass die ursprünglichen Selektionsbeschränkungen der Basen mit den Suffixen (Toponyme + -ska-Suffixe, Appellative und Adjektive + k-haltige Suffixe, Vornamen + -icz-Suffixe) im Zuge ihrer Ausbreitung und Festigung aufgegeben wurden. Die onymischen Allomorphe sind heute frei kombinierbar und können im Falle des Namenwechsels zur Bildung eines neuen Namens herangezogen werden.
Since Donald Davidson’s seminal work “The Logical Form of Action Sentences” (1967) event arguments have become an integral component of virtually every semantic theory. Over the past years Davidson´s proposal has been continuously extended such that nowadays event(uality) arguments are generally associated not only with action verbs but with predicates of all sorts. The reasons for such an extension are seldom explicitly justified. Most problematical in this respect is the case of stative expressions. By taking a closer look at copula sentences the present study assesses the legitimacy of stretching the Davidsonian notion of events and discusses its consequences. A careful application of some standard eventuality diagnostics (perception reports, combination with locative modifiers and manner adverbials) as well as some new diagnostics (behavior of certain degree adverbials) reveals that copular expressions do not behave as expected under a Davidsonian perspective: they fail all eventuality tests, regardless of whether they represent stage-level or individual-level predicates. In this respect, copular expressions pattern with stative verbs like know, hate, and resemble, which in turn differ sharply from state verbs like stand, sit, and sleep. The latter pass all of the eventuality tests and therefore qualify as true “Davidsonian state” expressions. On the basis of these empirical observations and taking up ideas of Kim (1969, 1976) and Asher (1993, 2000), an alternative account of copular expressions (and stative verbs) is provided, according to which the copula introduces a referential argument for a temporally bound property exemplification (= “Kimian state”). Considerations on some logical properties, viz. closure conditions and the latent infinite regress of eventualities, suggest that supplementing Davidsonian eventualities with Kimian states may yield not only a more adequate analysis of copula sentences but also a better understanding of eventualities in general.
Dictionaries contain lexicographic data whose occurrence is restricted to certain geo-graphical areas, subject fields, professions, etc. It is part of the duties of the lexicographer to give an account of such deviations to ensure a successful retrieval of the information on the part of the user. This contribution presents a discussion on labelling issues in the Dictionnaire Français–Mpongwé. Although the main focus is on the presentation of different types of labelling as well as problems in labelling, textual condensation procedures and mediostructural representations (to-gether with some aspects of the user perspective) are also critically evaluated. It is shown that these procedures reveal some inconsistencies which are not accounted for in the outer texts (front matter and back matter texts) of the dictionary. Finally suggestions are made for the improvement of the access structure of this dictionary.
Accusations are a very frequent type of speech act both in everyday life and in formal controversies, and answering accusations is a sophisticated type of linguistic practice well worth analysing from a pragmatic point of view. In my paper I shall first describe some basic properties of accusations and characteristic types of reactions to accusations, i. e. denying the alleged fact, making excuses, and giving justifications. I then go on to describe some fundamental functions of accusations in controversies. Using the basic patterns of accusations and reactions to accusations as an object of comparison, I then analyse some relevant exchanges from historical controversies (l6th to 18th century), among them famous polemical interactions like the Hobbes-Bramhall controversy, but also less well-known debates from the fields of medicine and theology. The present paper is both a contribution to the theory of controversy and to the pragmatic history of controversies. Keywords: historical pragmatics, theory of controversy, ad hominem moves, dynamics of controversy
This paper provides an analysis of an alternative strategy to A´-movement in both German and Dutch where the extracted constituent is preceded by a preposition and a coreferential pronoun appears in the extraction site. The construction has properties of both binding and movement: Whereas reconstruction effects suggest movement out of the embedded clause, there is strong evidence that the operator constituent is linked to an A-position in the matrix clause; this paradox is resolved by assuming a Control-like approach that involves movement from the embedded clause into a theta-position in the matrix clause with subsequent short A´- movement. The coreferential pronoun is interpreted as a resumptive heading a Big-DP which hosts the antecedent in its specifier.
The purpose of this article is to report on the work carried out during the research project "O trabalho de tradutor como fonte para a constituição de base de dados" (The translator´s work as a source for the constitution of a database). Through the restoration, organization and digitalization of the personal glossary and part of the books containing the translations made by the deceased public translator Gustavo Lohnefink, this research project intends to construct a digital database of German – Portuguese technical terms (for the language pair), which could then be used by other translators. In order to achieve this purpose, a specific methodology had to be developed, which could be used as a starting-point for the treatment and recovery of other similarly organized data-collections.
It is gratifying to see that Jay Jasanoff has now (2004) adopted my theory that "the Balto-Slavic acute was a kind of stød or broken tone" (p. 172), which I have been advocating since 1973. Unfortunately, his acceptance of my view is not based on an evaluation of the comparative evidence (for which see Kortlandt 1985a) but on his desire to derive Balto-Slavic “acute” and "circumflex" syllables from the "bimoric" and "trimoric" long vowels which he assumes for Proto-Germanic as the reflexes of the Indo-European "acute" and "circumflex" tones of the neogrammarians. Since the original "circumflex" was limited to Indo-European VHV-sequences, Jasanoff proposes a whole series of additional lengthenings yielding "hyperlong" vowels in Germanic, Baltic and Slavic, which still do not suffice to eliminate the counter-evidence (cf. Kortlandt 2004b: 14). The reason for this failure is his unwillingness to recognize that lengthened grade vowels are circumflex in Balto-Slavic (cf. Kortlandt 1997a).
In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications.
Trubetzkoy's recognition of a delimitative function of phonology, serving to signal boundaries between morphological units, is expressed in terms of alignment constraints in Optimality Theory, where the relevant constraints require specific morphological boundaries to coincide with phonological structure (Trubetzkoy 1936, 1939, McCarthy & Prince 1993). The approach pursued in the present article is to investigate the distribution of phonological boundary signals to gain insight into the criteria underlying morphological analysis. The evidence from English and Swedish suggests that necessary and sufficient conditions for word-internal morphological analysis concern the recognizability of head constituents, which include the rightmost members of compounds and head affixes. The claim is that the stability of word-internal boundary effects in historical perspective cannot in general be sufficiently explained in terms of memorization and imitation of phonological word form. Rather, these effects indicate a morphological parsing mechanism based on the recognition of word-internal head constituents. Head affixes can be shown to contrast systematically with modifying affixes with respect to syntactic function, semantic content, and prosodic properties. That is, head affixes, which cannot be omitted, often lack inherent meaning and have relatively unmarked boundaries, which can be obscured entirely under specific phonological conditions. By contrast, modifying affixes, which can be omitted, consistently have inherent meaning and have stronger boundaries, which resist prosodic fusion in all phonological contexts. While these correlations are hardly specific to English and Swedish it remains to be investigated to which extent they hold cross-linguistically. The observation that some of the constituents identified on the basis of prosodic evidence lack inherent meaning raises the issue of compositionality. I will argue that certain systematic aspects of word meaning cannot be captured with reference to the syntagmatic level, but require reference to the paradigmatic level instead. The assumption is then that there are two dimensions of morphological analysis: syntagmatic analysis, which centers on the criteria for decomposing words in terms of labelled constituents, and paradigmatic analysis, which centers on the criteria for establishing relations among (whole) words in the mental lexicon. While meaning is intrinsically connected with paradigmatic analysis (e.g. base relations, oppositeness) it is not essential to syntagmatic analysis.