Linguistik
Refine
Year of publication
Document Type
- Part of a Book (591) (remove)
Language
- English (591) (remove)
Has Fulltext
- yes (591)
Is part of the Bibliography
- no (591) (remove)
Keywords
- Syntax (79)
- Spracherwerb (63)
- Deutsch (56)
- Phonologie (46)
- Semantik (42)
- Englisch (40)
- Sprachtest (33)
- Thema-Rhema-Gliederung (32)
- Intonation <Linguistik> (25)
- Morphologie (24)
Institute
- Extern (11)
- Sprachwissenschaften (1)
Based on a Relevance Theory-informed view of language development, this paper argues that grammatical relations are construction-specific conventionalizations (grammaticalizations) of implicatures which arise out of repeated patterns of reference to particular types of referents. Once conventionalized, these structures function to constrain the hearer's identification of referents in discourse. As they are construction-specific, and hence language-specific, there is no category "subject" across languages; different languages will either show this type of grammaticalization or not, and if they do, may show it or not in different constructions. Any cross-linguistic use of terms such as "subject" (and "S", as in "SOV") should then be avoided.
Language contact has become a major focus of inquiry in historical and typological linguistics in the last twenty years, spurred in a large part by the publication of Thomason & Kaufman (1988), which tried to make sense of a large amount of language contact data. They argued that there was a direct relationship between the degree or intensity of language contact and the amount and type of influence the contact would have on one or more of the languages involved. Essentially, the greater the degree of bilingualism, the greater the degree of contact influence (see also Thomason 2001); if the contact and bilingualism was minimal, then there might just be a few loanwords adapted to the borrowing language's phonology and grammatical system, but if the contact and bilingualism was of a greater degree there would be influence in the grammar and phonology of the affected language. As more linguists came to take language contact more seriously, they came to realize how common language contact phenomena are.
1. The functionalist’s view: linguistic forms are instruments used to convey meaningful elements. This is the basis of European structuralism. 2. The formalist’s view: linguistic forms are abstract structures which can be filled with meaningful elements. This is the basis of generative grammar. 3. The parasitologist’s view: linguistic forms are vehicles for the reproduction of meaningful elements. This is the view which I advocated twenty years ago in the Festschrift for Werner Winter’s 60th birthday (1985). Here I intend to discuss the evolutionary origin and the physiological nature of the linguistic parasite. My theory of language is wholly consistent with Gerald Edelman’s theory of neuronal group selection.
In his magnificent book on the language relations across Bering Strait (1998), Michael Fortescue does not consider Nivkh (Gilyak) to be a Uralo-Siberian language. Elsewhere I have argued that the Indo-European verbal system can be understood in terms of its Indo-Uralic origins (2001). All of these languages belong to Joseph Greenberg’s Eurasiatic macro-family (2000). In the following I intend to reconsider the grammatical evidence for including Nivkh into the Uralo-Siberian language family. The Indo-Uralic evidence is of particular importance because it guarantees a time depth which cannot otherwise be attained.
Aramaic is not among the oldest Semitic languages in a strictly chronological sense, but among those languages which are still spoken today, it has the longest continuous written tradition. The existing written documents span a period of three millennia and thus enable us to study language history in a long-term perspective. It is very important, in this respect, that the latest stage of development of Aramaic, Neo-Aramaic, still exists in a multitude of spoken varieties which can be studied in vivo. We can thus describe the phonetics and phonology of the modern varieties with more precision than is possible for the older language stages, which in turn enables us to draw conclusions on diachronic sound change. Likewise, we can study morphology and syntax not only from recorded texts, but we also have recourse to native speakers in order to clarify doubtful points. Thus the latest stage of Aramaic casts a strong light back into the past. It is therefore most unfortunate that many Aramaicists and Syrologists show so little interest in this living heritage.
It is by now a weIl-known topic in semantics that there are striking similarities between the meanings of nominal and verbal expressions, insofar as the mass:count distinction in the nominal domain is reflected in the atelic:telic distinction in the verbal domain (cf. Leisi 1953, Taylor 1977, Bach 1986, to cite just a few authors). However, these supposed similarities have not be made explicit in formal representations.
This paper presents an account of semantics as a system that integrates conceptual representations into language. I define the semantic system as an interface level of the conceptual system CS that translates conceptual representations into a format that is accessible by language. The analysis I put forward does not treat the make up of this level as idiosyncratic, but subsumes it under a unified notion of linguistic interfaces. This allows us to understand core aspects of the linguistic-conceptual interface as an instance of a general pattern underlying the correlation of linguistic and non-linguistic structures. By doing so, the model aims to provide a broader perspective onto the distinction between and interaction of conceptual and linguistic processes and the correlation of semantic and syntactic structures.
This volume brings together a cross-section of recent research on the grammar and representation of pronouns, centering around the typology of pronominal paradigms, the generation of syntactic and semantic representations for constructions containing pronouns, and the neurological underpinnings for linguistic distinctions that are relevant for the production and interpretation of these constructions. In this introductory chapter we first give an exposition of our topic (section 2). Taking the interpretation of pronouns as a starting point, we discuss the basic parameters of pronominal representations, and draw a general picture of how morphological, semantic, discourse-pragmatic and syntactic aspects come together. In section 3, we sketch the different domains of research that are concerned with these phenomena, and the particular questions they are interested in, and show how the papers in the present volume fit into the picture. Section 4 gives summaries of the individual papers, and a short synopsis of their main points of convergence.
What is abductive inference?
(2002)
Abductive reasoning: constitutes according to Peirce the "first stage" of scientific inquiries (CP 6.469) and of any interpretive processes. "Abduction" is the process of adopting an explanatory hypothesis (CP 5.145) and covers two operations: the selection and the formation of plausible hypotheses. As process of finding premisses, it is the basis of interpretive reconstruction of causes and intentions, as well as of inventive construction of theories.
Speakers of various Southern german dialects may be heard to use two syntactic variants of subordinate clauses which are represented by the following Swabian examples: (1) daß er den net will komme lasse (2) daß er den net komme lasse will Of these two variants of the three-element verbal complex, only the non-dialect counterpart of (2) is accepted as standard modern written German: (3) daß er ihn nicht kommen lassen will In earlier periods of the German language, however, both variants were used by authors of written texts.
This article is a contribution to historical dialogue analysis, a field of research which has gained momentum in recent years (Fritz 1995, 1997, Gloning 1999, and other articles in Jucker/Fritz/Lebsanft 1999). In the present paper, I report some results of ongoing research from a project on the history of controversies from 1600 to 1800, which Marcelo Dascal and I are conducting at the Universities of Tel Aviv, Israel and Gießen, Germany.
The study of what makes utterances difficult or easy to understand is one of the central topics of research in comprehension. It is both theoretically attractive and useful in practice. The more we know about difficulties in understanding the more we know about understanding. And the better we grasp typical problems of understanding in certain types of discourse and for certain recipients the better we can overcome these problems and the better we can advise people whose job it is to overcome such problems. It is therefore not surprising that comprehensibility has been the object of much reflection as far back as the days of classical rhetoric and that it is a center of lively interest in several present-day scientific disciplines, ranging from artificial intelligence and educational psychology to linguistics.
Coreference in dialogue
(1997)
Since the early days of discourse analysis coreference has always been considered a major factor in the formation of texts and dialogues. The repetition of nominal elements and the anaphoric use of pronouns in successive sentences is a fundamental cohesive pattern which ties sentences together and contributes to the coherence of sequences. "La coherence transphrastique trouve dans la pronominalisation un des procedes les plus efficaces" (Stati 1990, 160). The basic structural pattern on which linguists focused their interest in the early 1970s is captured by the following examples: (1) A man entered the house. After closing the door, the man sat down. He was tired. (2) Peter The man entered the house. He was tired. He ...
This paper argues for a scopal explanation of the readings of the adverb wieder (‘again’). It is the syntactic entity that wieder is related to which determines whether the repetitive or the restitutive reading obtains. If it is adjoined to the minimal verbal domain, it relates to a situation-internal state thus producing a restitutive interpretation, if adjoined to a higher verbal projection, it relates to an eventuality resulting in a repetitive interpretation. Proceeding from the assumption that adverbial adjuncts have base positions which reflect their semantic relations to the rest of the sentence, repetitive wieder is shown to belong to the class of eventuality adverbs that minimally c-command the base positions of all arguments, whereas restitutive wieder has many properties in common with process (manner) adjuncts that minimally c-command the verb in clause-final base position.
The semantics of gradation
(1989)
The term 'gradation' is meant to cover a range of phenomena which for the time being I shall call quantitative evaluations regarding dimensions or features. I shall actually be looking into the principles governing the way gradation is expressed in language. The quantitative aspect of the adjectives of dimension occupies a key position which can be systematically explained and this aspect will be the crucial point of the discussion. I shall focus on the various grammatical forms of comparison: comparative, equative, superlative and some related constructions, and indications of measurement and adverbial indications of degree.
Simplicity as a methodological orientation applies to linguistic theory just as to any other field of research: ‘Occam’s razor’ is the label for the basic heuristic maxim according to which an adequate analysis must ultimately be reduced to indispensible specifications. In this sense, conceptual economy has been a strict and stimulating guideline in the development of Generative Grammar from the very beginning. Halle’s (1959) argument discarding the level of taxonomic phonemics in order to unify two otherwise separate phonological processes is an early characteristic example; a more general notion is that of an evaluation metric introduced in Chomsky (1957, 1975), which relates the relative simplicity of alternative linguistic descriptions systematically to the quest for explanatory adequacy of the theory underlying the descriptions to be evaluated. Further proposals along these lines include the theory of markedness developed in Chomsky and Halle (1968), Kean (1975, 1981), and others, the notion of underspecification proposed e.g. in Archangeli (1984), Farkas (1990), the concept of default values and related notions. An important step promoting this general orientation was the idea of Principles and Parameters developed in Chomsky (1981, 1986), which reduced the notion of language particular rule systems to universal principles, subject merely to parametrization with restricted options, largely related to properties of particular lexical items. On this account, the notion of a simplicity metric is to be dispensed with, as competing analyses of relevant data are now supposed to be essentially excluded by the restrictive system of principles.
Generative grammar
(2001)
Generative Grammar is the label of the most influential research program in linguistics and related fields in the second half of the 20. century. Initiated by a short book, Noam Chomsky's Syntactic Structures (1957), it became one of the driving forces among the disciplines jointly called the cognitive sciences. The term generative grammar refers to an explicit, formal characterization of the (largely implicit) knowledge determining the formal aspect of all kinds of language behavior. The program had a strong mentalist orientation right from the beginning, documented e.g. in a fundamental critique of Skinner's Verbal behavior (1957) by Chomsky (1959), arguing that behaviorist stimulus-response-theories could in no way account for the complexities of ordinary language use. The "Generative Enterprise", as the program was called in 1982, went through a number of stages, each of which was accompanied by discussions of specific problems and consequences within the narrower domain of linguistics as well as the wider range of related fields, such as ontogenetic development, psychology of language use, or biological evolution. Four stages of the Generative Enterprise can be marked off for expository purposes.
Reflexive pronouns as central anaphoric elements are subject to general principles determined by Universal Grammar and shared by all languages that use reflexives as part of their grammatical structure. In addition to these general conditions, there are language particular properties, which different languages can exhibit on the basis of different regulations. One variation of this sort is the particular role of Reflexives in German, which can show up as improper Arguments, which are subject to standard syntactic and morphological conditions, but do not represent an argument of the head they belong to. Hence the particular property is the effect of syntactic, morphological and semantic conditions. A simple illustration of the phenomena I will explore in this contribution is based on the following observation.
Thematic Roles (or Theta-Roles) are theoretical constructs that account for a variety of well known empirical facts, which are more or less clearly delimited. In other words, Theta-Roles are not directly observable, but they do have empirical content that is open to empirical observation. The objective of the present paper is to sketch the nature and content of Theta-Roles, distinguishing their universal foundation as part of the language faculty, their language particular realization, which depends on the conditions of individual languages, and idiosyncratic properties, determined by specific information of individual lexical items.
Semantic form as interface
(2007)
The term interface had a remarkable career over the past several decades, motivated largely by its use in computer science. Although the concept of a "surface common to two areas" (Oxford Advanced Learner's Dictionary, 1980) is intuitively clear enough, the range of its application is not very sharp and well defined, a "common surface" is open to a wide range of interpretations.
The main tenet of the present paper is the thesis that nominalization – like other cases of derivational morphology – is an essentially lexical phenomenon with well defined syntactic (and semantic) conditions and consequences. More specifically, it will be argued that the relation between a verb and the noun derived from it is subject to both systematic and idiosyncratic conditions with respect to lexical as well as syntactic aspects.
The present study argues that variation across listeners in the perception of a non-native contrast is due to two factors: the listener-specic weighting of auditory dimensions and the listener-specic construction of new segmental representations. The interaction of both factors is shown to take place in the perception grammar, which can be modelled within an OT framework. These points are illustrated with the acquisition of the Dutch three-member labiodental contrast [V v f] by German learners of Dutch, focussing on four types of learners from the perception study by Hamann and Sennema (2005a).
Syntax-semantics interface
(2001)
The article analyzes the system of focus-sensitive particles and, to a lesser extent, clefts in Vietnamese. EVEN/ALSO/ONLY foci are discussed across syntactic categories, and Vietnamese is found to organize its system of focus-sensitive particles along three dimensions of classification: (i) EVEN vs. ALSO vs. ONLY; (ii) particles c-commanding foci vs. particles c-commanding backgrounds; (iii) adverbial focus-sensitive particles vs. particles c-commanding argument foci only. Towards the end of the paper, free-choice constructions and additional sentence-final particles conveying ONLY and ALSO semantics are briefly discussed. The peculiar Vietnamese system reflects core properties of the analogous empirical domain in Chinese, a known source of borrowings into Vietnamese over the millennia.
Introduction
(2007)
Semantics
(2007)
Morphology
(2007)
Introduction
(2007)
The annotation guidelines introduced in this chapter present an attempt to create a unique infrastructure for the encoding of data from very different languages. The ultimate target of these annotations is to allow for data retrieval for the study of information structure, and since information structure interacts with all levels of grammar, the present guidelines cover all levels of grammar too. After introducing the guidelines, the current chapter also presents an evaluation by means of measurements of the inter-annotator agreement.
Information structure
(2007)
Traces
(2007)
Implicated presuppositions
(2008)
It is widely believed that existential quantifiers can bring about the semantic effects of a scope which is wider than their actual syntactic scope (See Fodor & Sag (1982), Cresti (1995), Kratzer (1995), Reinhart (1995) and Winter (1995), among many others.) On the other hand, it is assumed that the syntactic scope of universal quantifiers can be determined unequivocally by the semantics. This paper shows that this second assumption is wrong; universal quantifiers can also bring about scope illusions, though in a very specific environment. In particular, we argue that in the environment of generic tense, universal quantifiers can show the semantic effects of a scope which is wider than the one that is actually realized at LF. Our argument has four steps. First, we show that in generic contexts, universal quantifiers escape standard “scope-islands” (Section 1). Second, we show how the effects of wide scope in generic contexts can be achieved without syntactic wide scope (Section 2.1). Third, we show that this result is actually forced on us, once we take seriously certain independent issues concerning the interpretation of generic tense (Sections 2.2 - 2.4). Finally, the semantics of generic tense and, in particular, its interaction with focus, will yield some intricate new predictions, which, as we show, are borne out (Sections 3 - 5).
The paper presents an additional argument for a specific account of semantic binding: the flat-binding analysis. The argument is based on observations concerning sloppy interpretations in verb phrase ellipsis when the binder is not the subject of the elided VP. In one such case, it is important that one of the binders belong to the domain of the other. This case can be derived from the flat-binding analysis as is shown in the paper, while it is unclear how to account for it within other analyses of semantic binding.
Trubetzkoy's recognition of a delimitative function of phonology, serving to signal boundaries between morphological units, is expressed in terms of alignment constraints in Optimality Theory, where the relevant constraints require specific morphological boundaries to coincide with phonological structure (Trubetzkoy 1936, 1939, McCarthy & Prince 1993). The approach pursued in the present article is to investigate the distribution of phonological boundary signals to gain insight into the criteria underlying morphological analysis. The evidence from English and Swedish suggests that necessary and sufficient conditions for word-internal morphological analysis concern the recognizability of head constituents, which include the rightmost members of compounds and head affixes. The claim is that the stability of word-internal boundary effects in historical perspective cannot in general be sufficiently explained in terms of memorization and imitation of phonological word form. Rather, these effects indicate a morphological parsing mechanism based on the recognition of word-internal head constituents. Head affixes can be shown to contrast systematically with modifying affixes with respect to syntactic function, semantic content, and prosodic properties. That is, head affixes, which cannot be omitted, often lack inherent meaning and have relatively unmarked boundaries, which can be obscured entirely under specific phonological conditions. By contrast, modifying affixes, which can be omitted, consistently have inherent meaning and have stronger boundaries, which resist prosodic fusion in all phonological contexts. While these correlations are hardly specific to English and Swedish it remains to be investigated to which extent they hold cross-linguistically. The observation that some of the constituents identified on the basis of prosodic evidence lack inherent meaning raises the issue of compositionality. I will argue that certain systematic aspects of word meaning cannot be captured with reference to the syntagmatic level, but require reference to the paradigmatic level instead. The assumption is then that there are two dimensions of morphological analysis: syntagmatic analysis, which centers on the criteria for decomposing words in terms of labelled constituents, and paradigmatic analysis, which centers on the criteria for establishing relations among (whole) words in the mental lexicon. While meaning is intrinsically connected with paradigmatic analysis (e.g. base relations, oppositeness) it is not essential to syntagmatic analysis.
In morphological systems of the agglutinative type we sometimes encounter a nearly perfect one-to-one relation between form and function. Turkish inflectional morphology is, of course, the standard textbook example. Things seem to be quite different in systems of the flexive type. Declension in Contemporary Standard Russian (henceforth Russian, for short) may be cited as a typical example: We find, among other things, cumulative markers, “synonymous” endings (e.g., dative singular noun forms in -i, -e, or -u), and “homonymous” endings (e.g., -i, genitive, dative, and prepositional singular). True, some endings are more of an agglutinative nature, being bound to a specific case-number combination and applying across declensions, e.g., -am (dative plural, all nouns); and some cross the boundaries of word classes, e.g., -o, which serves as the nominative/accusative singular ending of neuter forms of pronouns (and adjectives) and as the nominative/accusative singular ending of (most) neuter nouns as well. Still, many observers have been struck by the impression that what we face here are rather uneconomic or even, so to speak, unnatural structures. But perhaps flexive systems are not as complicated as they seem. What seems to be uneconomic complexity may be, at least partially, an artifact of uneconomic descriptions.
When in 1934, Robert BLEICHSTEINER published the Caucasian language specimina contained in the "travel book" of the 17th century Turkish writer Evliya Çelebi , he was struck by the amount of reliability he found in Evliya’s notations: "(Die Sprachproben) sind, von einzelnen Mißverständnissen abgesehen, und wenn man die falschen Punktierungen und Irrtümer der Kopisten abrechnet, außerordentlich gut, ja zuweilen mit einem gewissen phonetischen Geschick wiedergegeben, was der Auffassungsgabe und dem Eifer Evliyas ein hohes Zeugnis ausstellt. Man muß bedenken, wie schwer das arabische Alphabet, ohne weitere Unterscheidungszeichen, wie sie die islamischen Kaukasusvölker anwenden, die verwickelten, oft über 70 verschiedene Phoneme umfassenden Lautsysteme wiederzugeben imstande ist. Wenn trotzdem die Entzifferung der Sprachproben zum größten Teil geglückt ist, so muß man der ungewöhnlichen Begabung des türkischen Reisenden und Gelehrten schrankenlose Bewunderung zollen" (85). ...
When we pay close attention to the prosody of Wh-questions in Japanese, we discover many novel and interesting empirical puzzles that would require us to devise a much finer syntactic component of grammar. This paper addresses the issues that pose some problems to such an elaborated grammar, and offers solutions, making an appeal to the information structure and sentence processing involved in the interpretation of interrogative and focus constructions.
Fronting of an infinite VP across a finite main verb - akin to German "VP-topicalization" - can be found also in Czech and Polish. The paper discusses evidence from large corpora for this process and some of its properties, both syntactic and information-structural. Based on this case, criteria for more user-friedly searching and retrieval of corpus data in syntactic research are being developed.
This paper describes the creation and preparation of TUSNELDA, a collection of corpus data built for linguistic research. This collection contains a number of linguistically annotated corpora which differ in various aspects such as language, text sorts / data types, encoded annotation levels, and linguistic theories underlying the annotation. The paper focuses on this variation on the one hand and the way how these heterogeneous data are integrated into one resource on the other hand.
The paper explicates the notions of topic, contrastive topic, and focus as used in the analysis of Hungarian. Based on distributional criteria, topic and focus are claimed to represent distinct structural positions in the left periphery of the Hungarian sentence, associated with logical rather than discourse functions. The topic is interpreted as the logical subject of predication. The focus is analyzed as a derived main predicate, specifying the referential content of the set denoted by the backgrounded post-focus section of the sentence. The exhaustivity associated with the focus and the existential presupposition associated with the background are shown to be properties following from their specificational predication relation.
The recognition of the prosodic focus position in German-learning infants from 4 to 14 months
(2007)
The aim of the present study was to elucidate in a study with 4-, 6-, 8-, and 14-month-old German-learning children, when and how they may acquire the regularities which underlie Focus-to-Stress Alignment (FSA) in the target language, that is, how prosody is associated with specific communicative functions. Our findings suggest, that 14-month-olds have already found out that German allows for variable focus positions, after having gone through a development which goes from a predominantly prosodically driven processing of the input to a processing where prosody interacts more and more with the growing lexical and syntactic knowledge of the child.
The paper investigates focus marking devices in the scarcely documented North-Ghanaian Gur language Konkomba. The two particles lé and lá occur under specific focus conditions and are therefore regarded as focus markers in the sparse literature. Comparing the distribution and obligatoriness of both alleged focus markers however, I show that one of the particles, lé, is better analyzed as a connective particle, i.e. as a syntactic rather than as a genuine pragmatic marker, and that comparable syntactic focus marking strategies for sentence-initial constituents are also known from related languages.
This paper deals with the conditions under which singular definites, on the one hand, and universally quantified DPs, on the other hand, receive interpretations according to which the sets denoted by the NP-complements of the respective determiner vary with the situations quantified over by a Q-adverb. I show that in both cases such interpretations depend on the availability of situation predicates that are compatible with the presuppositions associated with the respective determiner, as co-variation in both cases comes about via the binding of a covert situation variable that is contained within the NP-complement of the respective determiner. Secondly, I offer an account for the observation that the availability of a co-varying interpretation is more constrained in the case of universally quantified DPs than in the case of singular definites, as far as word order is concerned. This is shown to follow from the fact that co-varying definites in contrast to universally quantified DPs are inherently focus-marked.
Face-to-face communication is multimodal. In unscripted spoken discourse we can observe the interaction of several “semiotic layers”, modalities of information such as syntax, discourse structure, gesture, and intonation. We explore the role of gesture and intonation in structuring and aligning information in spoken discourse through a study of the co-occurrence of pitch accents and gestural apices. Metaphorical spatialization through gesture also plays a role in conveying the contextual relationships between the speaker, the government and other external forces in a naturally-occurring political speech setting.
Elke Kasimir´s paper (in this volume) argues against employing the notion of Givenness in the explanation of accent assignment. I will claim that the arguments against Givenness put forward by Kasimir are inconclusive because they beg the question of the role of Givenness. It is concluded that, more generally, arguments against Givenness as a diagnostic for information structural partitions should not be accepted offhand, since the notion of Givenness of discourse referents is (a) theoretically simple, (b) readily observable and quantifiable, and (c) bears cognitive significance.
A series of production and perception experiments investigating the prosody and well-formedness of special sentences, called Wide Focus Partial Fronting (WFPF), which consist of only one prosodic phrase and a unique initial accented argument, are reported on here. The results help us to decide between different models of German prosody. The absence of pitch height difference on the accent of the sentence speaks in favor of a relative model of prosody, in which accents are scaled relative to each other, and against models in which pitch accents are scaled in an absolute way. The results also speak for a model in which syntax, but not information structure, influences the prosodic phrasing. Finally, perception experiments show that the prosodic structure of sentences with a marked word order needs to be presented for grammaticality judgments. Presentation of written material only is not enough, and falsifies the results.
The semantics of ellipsis
(2005)
There are four phenomena that are particularly troublesome for theories of ellipsis: the existence of sloppy readings when the relevant pronouns cannot possibly be bound; an ellipsis being resolved in such a way that an ellipsis site in the antecedent is not understood in the way it was there; an ellipsis site drawing material from two or more separate antecedents; and ellipsis with no linguistic antecedent. These cases are accounted for by means of a new theory that involves copying syntactically incomplete antecedent material and an analysis of silent VPs and NPs that makes them into higher order definite descriptions that can be bound into.
This paper discusses the use of XSLT stylesheets as a filtering mechanism for refining the results of user queries on treebanks. The discussion is within the context of the TIGER treebank, the associated search engine and query language, but the general ideas can apply to any search engine for XML-encoded treebanks. It will be shown that important classes of linguistic phenomena can be accessed by applying relatively simple XSLT templates to the output of a query, effectively simulating the universal quantifier for a subset of the query language.
In order to investigate the empirical properties of focus, it is necessary to diagnose focus (or: “what is focused”) in particular linguistic examples. It is often taken for granted that the application of one single diagnostic tool, the so-called question-answer test, which roughly says that whatever a question asks for is focused in the answer, is a fool-proof test for focus. This paper investigates one example class where such uncritical belief in the question-answer test has led to the assumption of rather complex focus projection rules: in these examples, pitch accent placement has been claimed to depend on certain parts of the focused constituents being given or not. It is demonstrated that such focus projection rules are unnecessarily complex and in turn require the assumption of unnecessarily complicated meaning rules, not to speak of the difficulties to give a precise semantic/pragmatic definition of the allegedly involved givenness property. For the sake of the argument, an alternative analysis is put forward which relies solely on alternative sets following Mats Rooth´s work, and avoids any recourse to givenness. As it turns out, this alternative analysis is not only simpler but also makes in a critical case the better predictions.
In this paper we compare the behaviour of adverbs of frequency (de Swart 1993) like usually with the behaviour of adverbs of quantity like for the most part in sentences that contain plural definites. We show that sentences containing the former type of Q-adverb evidence that Quantificational Variability Effects (Berman 1991) come about as an indirect effect of quantification over situations: in order for quantificational variability readings to arise, these sentences have to obey two newly observed constraints that clearly set them apart from sentences containing corresponding quantificational DPs, and that can plausibly be explained under the assumption that quantification over (the atomic parts of) complex situations is involved. Concerning sentences with the latter type of Q-adverb, on the other hand, such evidence is lacking: with respect to the constraints just mentioned, they behave like sentences that contain corresponding quantificational DPs. We take this as evidence that Q-adverbs like for the most part do not quantify over the atomic parts of sum eventualities in the cases under discussion (as claimed by Nakanishi and Romero (2004)), but rather over the atomic parts of the respective sum individuals.
Japanese wh-questions always exhibit focus intonation (FI). Furthermore, the domain of FI exhibits a correspondence to the wh-scope. I propose that this phonology-semantics correspondence is a result of the cyclic computation of FI, which is explained under the notion of Multiple Spell-Out in the recent Minimalist framework. The proposed analysis makes two predictions: (1) embedding of an FI into another is possible; (2) (overt) movement of a wh-phrase to a phase edge position causes a mismatch between FI and wh-scope. Both predictions are tested experimentally, and shown to be borne out.
Prosodic focus in Vietnamese
(2007)
This paper reports on pilot work on the expression of Information Structure in Vietnamese and argues that Focus in Vietnamese is exclusively expressed prosodically: there are no specific focus markers, and the language uses phonology to express intonational emphasis in similar ways to languages like English or German. The exploratory data indicates that (i) focus is prosodically expressed while word order remains constant, (ii) listeners show good recoverability of the intended focus structure, and (iii) that there is a trading relationship between several phonetic parameters (duration, f0, amplitude) involved to signal prosodic (acoustic) emphasis.
Phonology and intonation
(2007)
The encoding standards for phonology and intonation are designed to facilitate consistent annotation of the phonological and intonational aspects of information structure, in languages across a range ofprosodic types. The guidelines are designed with the aim that a nonspecialist in phonology can both implement and interpret the resulting annotation.
This paper investigates the structural properties of morphosyntactically marked focus constructions, focussing on the often neglected non-focal sentence part in African tone languages. Based on new empirical evidence from five Gur and Kwa languages, we claim that these focus expressions have to be analysed as biclausal constructions even though they do not represent clefts containing restrictive relative clauses. First, we relativize the partly overgeneralized assumptions about structural correspondences between the out-of-focus part and relative clauses, and second, we show that our data do in fact support the hypothesis of a clause coordinating pattern as present in clause sequences in narration. It is argued that we deal with a non-accidental, systematic feature and that grammaticalization may conceal such basic narrative structures.
This article presents an analysis of German nicht...sondern... (contrastive not...but...) which departs from the commonly held view that this construction should be explained by appeal to its alleged corrective function. It will be demonstrated that in nicht A sondern B (not A but B), A and B just behave like stand-alone unmarked answers to a common question Q, and that this property of sondern is presuppositional in character. It is shown that from this general observation many interesting properties of nicht...sondern... follow, among them distributional differences between German "sondern" and German "aber" (contrastive but, concessive but), intonational requirements and exhaustivity effect sondern presupposition is furthermore argued to be the result of the conventionalization of conversational implicatures.
Three dimensions can be distinguished in a cross-linguistic account of information structure. First, there is the definition of the focus constituent, the part of the linguistic expression which is subject to some focus meaning. Second and third, there are the focus meanings and the array of structural devices that encode them. In a given language, the expression of focus is facilitated as well as constrained by the grammar within which the focus devices operate. The prevalence of focus ambiguity, the structural inability to make focus distinctions, will thus vary across languages, and within a language, across focus meanings.
In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications.
The paper presents an in-depth study of focus marking in Gùrùntùm, a West Ch adic language spoken in Bauchi Province of Northern Nigeria. Focus in Gùrùntùm is marked morphologically by means of a focus marker a, which typically precedes the focus constituent. Even though the morphological focus-marking system of Gùrùntùm allows for a lot of fine-grained distinctions in information structure (IS) in principle, the language is not entirely free of focus ambiguities that arise as the result of conflicting IS- and syntactic requirements that govern the placement of focus markers. We show that morphological focus marking with a applies across different types of focus, such as newinformation, contrastive, selective and corrective focus, and that a does not have a second function as a perfectivity marker, as is assumed in the literature. In contrast, we show at the end of the paper that a can also function as a foregrounding device at the level of discourse structure.
This paper presents the results of a production experiment on the intonation of sentences containing a negative polarity item (NPI) in Tokyo Japanese. The results show that NPI sentences exhibit a focus intonation: the F0-peak of the word to which an NPI is attached is raised, while the pitch contour after the NPI-attached word is compressed until the negation. This intonation pattern is parallel to that of wh-question, in which the F0 of the wh-phrase is raised while the post-wh-contour is compressed until the question particle.
This paper surveys a range of constructions in which prosody affects discourse function and discourse structure.We discuss English tag questions, negative polar questions, and what we call “focus” questions. We postulate that these question types are complex speech acts and outline an analysis in Segmented Discourse Representation Theory (SDRT) to account for the interactions between prosody and discourse.
The aim of this paper is to validate a dataset collected by means of production experiments which are part of the Questionnaire on Information Structure. The experiments generate a range of information structure contexts that have been observed in the literature to induce specific constructions. This paper compares the speech production results from a subset of these experiments with specific claims about the reflexes of information structure in four different languages. The results allow us to evaluate and in most cases validate the efficacy of our elicitation paradigms, to identify potentially fruitful avenues of future research, and to highlight issues involved in interpreting speech production data of this kind.
While the Information Structure (IS) is most naturally interpreted as "structure of information", some may argue that it is structure of something else, and others may object to the use of the word "structure". This paper focuses on the question of whether the informational component can have structural properties such that it can be called "structure". The preliminary conclusion is that, althoughthere are some vague indications of structurehood in it, it is perhaps better understood to be a representation that encodes a finite set of information-based partitions, rather than structure.
In a first step, definitions of the irreducible information structural categories are given, and in a second step, it is shown that there are no invariant phonological or otherwise grammatical correlates of these categories. In other words, the phonology, syntax or morphology are unable to define information structure. It is a common mistake that information structural categories are expressed by invariant grammatical correlates, be they syntactic, morphological or phonological. It is rather the case that grammatical cues help speaker and hearer to sort out which element carries which information structural role, and only in this sense are the grammatical correlates of information structure important. Languages display variation as to the role of grammar in enhancing categories of information structure, and this variation reflects the variation found in the ‘normal’ syntax and phonology of languages.
We argue that there is a crucial difference between determiner and adverbial quantification. Following Herburger [2000] and von Fintel [1994], we assume that determiner quantifiers quantify over individuals and adverbial quantifiers over eventualities. While it is usually assumed that the semantics of sentences with determiner quantifiers and those with adverbial quantifiers basically come out the same, we will show by way of new data that quantification over events is more restricted than quantification over individuals. This is because eventualities in contrast to individuals have to be located in time which is done using contextual information according to a pragmatic resolution strategy. If the contextual information and the tense information given in the respective sentence contradict each other, the sentence is uninterpretable. We conclude that this is the reason why in these cases adverbial quantification, i.e. quantification over eventualities, is impossible whereas quantification over individuals is fine.
Heterogeneity and standardization in data, use, and annotation : a diachronic corpus of German
(2005)
This paper describes the standardization problems that come up in a diachronic corpus: it has to cope with differing standards with regard to diplomaticity, annotation, and header information. Such highly heterogeneous texts must be standardized to allow for comparative research without (too much) loss of information.
Results of a production experiment on the placement of sentence accent in German are reported. The hypothesis that German fulfills some of the most widely accepted rules of accent assignment— predicting focus domain integration—was only partly confirmed. Adjacency between argument and verb induces a single accent on the argument, as recognized in the literature, but interruption of this sequence by a modifier often induces remodeling of the accent pattern with a single accent on the modifier. The verb is rarely stressed. All models based on linear alignment or adjacency between elements belonging to a single accent domain fail to account for this result. A cyclic analysis of prosodic domain formation is proposed in an optimality-theoretic framework that can explain the accent pattern.
Human manual action exhibits a differential use of a non-dominant (typically, left) and a dominant (typically, right) hand. Human communication exhibits a pervasive structuring of utterances into topic and comment. I will point out striking similarities between the coordination of hands in bimanual actions, and the structuring of utterances in topics and comments. I will also show how principles of bimanual coordination influence the expression of topic/comment structure in sign languages and in gestures accompanying spoken language, and suggest that bimanual coordination might have been a preadaptation of the development of information structure in human communication.
We argue that the standard focus theories reach their limits when confronted with the focus systems of the Chadic languages. The backbone of the standard focus theories consists of two assumptions, both called into question by the languages under consideration. Firstly, it is standardly assumed that focus is generally marked by stress. The Chadic languages, however, exhibit a variety of different devices for focus marking. Secondly, it is assumed that focus is always marked. In Tangale, at least, focus is not marked consistently on all types of constituents. The paper offers two possible solutions to this dilemma.
Focus expressions in Foodo
(2007)
This paper aims at presenting different ways of expressing focus in Foodo, a Guang language. We can differentiate between marked and unmarked focus strategies. The marked focus expressions are first syntactically characterized: the focused constituent is in sentence-initial position and is second always marked obligatorily by a focus marker, which is [...] for non-subjects and N for subjects. Complementary to these structures, Foodo knows an elliptic form consisting of the focused constituent and a predication marker [...]. It will be shown that the two focus markers can be analyzed as having developed out of the homophone conjunction n[...] and that the constraints on the use of the focus markers can be best explained by this fact.
Focus asymmetries in Bura
(2008)
This article presents the central aspects of the focus system of Bura (Chadic), which exhibits a number of asymmetries: Grammatical focus marking is obligatory only with focused subjects, where focus is marked by the particle án following the subject. Focused subjects remain in situ and the complement of án is a regular VP. With nonsubject foci, án appears in a cleft-structure between the fronted focus constituent and a relative clause. We present a semantically unified analysis of focus marking in Bura that treats the particle as a focusmarking copula in T that takes a property-denoting expression (the background) and an individual-denoting expression (the focus) as arguments. The article also investigates the realization of predicate and polarity focus, which are almost never marked. The upshot of the discussion is that Bura shares many characteristic traits of focus marking with other Chadic languages, but it crucially differs in exhibiting a structural difference in the marking of focus on subjects and non-subject constituents.
Focus and tone
(2006)
Tone is a distinctive feature of the lexemes in tone languages. The information-structural category focus is usually marked by syntactic and morphological means in these languages, but sometimes also by intonation strategies. In intonation languages, focus is marked by pitch movements, which are also perceived as tone. The present article discusses prosodic focus marking in these two language types.
This paper discusses how focus change s prosodic structure in Tokyo Japanese. It is generally believed that focus blocks the intonational process of downstep and causes a pitch reset. This paper presents experimental evidence against this traditional view by looking at the prosodic behavior of Wh words, which receive focus lexically in Japanese as in other languages. It is demonstrated, specifically, that the focused Wh element does not block downstep although it receives a much higher pitch than its preceding element. This suggests that presence of lexical focus does not trigger pitch reset in Japanese.
The present study examines native and nonnative perceptual processing of semantic information conveyed by prosodic prominence. Five groups of German learners of English each listened to one of 5 experimental conditions. Three conditions differed in place of focus accent in the sentence and two conditions were with spliced stimuli. The experiment condition was presented first in the learners’ L1 (German) and then in a similar set in the L2 (English). The effect of the accent condition and of the length and position of the target in the sentence was evaluated in a probe recognition task. In both the L1 and L2 tasks there was no significant effect in any of the five focus conditions. Target position and target word length had an effect in the L1 task. Word length did not affect accuracy rates in the L2 task. For probe recognition in the L2, word length and the position of the target interacted with the focus condition.
We present a system for the linguistic exploration and analysis of lexical cohesion in English texts. Using an electronic thesaurus-like resource, Princeton WordNet, and the Brown Corpus of English, we have implemented a process of annotating text with lexical chains and a graphical user interface for inspection of the annotated text. We describe the system and report on some sample linguistic analyses carried out using the combined thesaurus-corpus resource.
The aim of this paper is to outline the means for encoding information structure in Yucatec Maya. Yucatec Maya is a tone language, displaying a three-fold opposition in the tonal realization of syllables. From the morpho-syntactic point of view, the grammar of Yucatec Maya contains morphological (topic affixes, morphological marking of out-of-focus predicates) and syntactic (designated positions) means to uniquely specify syntactic constructions for their information structure. After a descriptive overview of these phenomena, we present experimental evidence which reveals the impact of the nonavailability of prosodic alternatives on the choice of syntactic constructions in language production.
In this paper we review the current state of research on the issue of discourse structure (DS) / information structure (IS) interface. This field has received a lot of attention from discourse semanticists and pragmatists, and has made substantial progress in recent years. In this paper we summarize the relevant studies. In addition, we look at the issue of DS/ISinteraction at a different level—that of phonetics. It is known that both information structure and discourse structure can be realized prosodically, but the issue of phonetic interaction between the prosodic devices they employ has hardly ever been discussed in this context. We think that a proper consideration of this aspect of DS/IS-interaction would enrich our understanding of the phenomenon, and hence we formulate some related research-programmatic positions.
This paper investigates the nature of the attraction of XPs to clauseinitial position in German (and other languages). It argues that there are two different types of preposing. First, an XP can move when it is attracted by an EPP-like feature of Comp. Comp can, however, also attract elements that bear the formal marker of some semantic or pragmatic (information theoretic) function. This second type of movement is driven by the attraction of a formal property of the moved element. It has often been misanalysed as “operator” movement in the past.
New evidence is provided for a grammatical principle that singles out contrastive focus (Rooth 1996; Truckenbrodt 1995) and distinguishes it from discourse-new “informational” focus. Since the prosody of discourse-given constituents may also be distinguished from discourse-new, a three-way distinction in representation is motivated. It is assumed that an F-feature marks just contrastive focus (Jackendoff 1972, Rooth 1992), and that a G-feature marks discoursegiven constituents (Féry and Samek-Lodovici 2006), while discoursenew is unmarked. A crucial argument for G-marking comes from second occurrence focus (SOF) prosody, which arguably derives from a syntactic representation where SOF is both F-marked and G-marked. This analysis relies on a new G-Marking Condition specifying that a contrastive focus may be G-marked only if the focus semantic value of its scope is discourse-given, i.e. only if the contrast itself is given.
This article takes stock of the basic notions of Information Structure (IS). It first provides a general characterization of IS — following Chafe (1976) — within a communicative model of Common Ground (CG), which distinguishes between CG content and CG management. IS is concerned with those features of language that concern the local CG. Second, this paper defines and discusses the notions of Focus (as indicating alternatives) and its various uses, Givenness (as indicating that a denotation is already present in the CG), and Topic (as specifying what a statement is about). It also proposes a new notion, Delimitation, which comprises contrastive topics and frame setters, and indicates that the current conversational move does not entirely satisfy the local communicative needs. It also points out that rhetorical structuring partly belongs to IS.
The material reported on in this paper is part of a set of experiments in which the role of Information Structure on L2 processing of words is tested. Pitch and duration of 4 sets of experimental material in German and English are measured and analyzed in this paper. The well-known finding that accent boosts duration and pitch is confirmed. Syntactic and lexical means of marking focus, however, do not give the duration and the pitch of a word an extra boost.
In this paper, we discuss the design and implementation of our first version of the database "ANNIS" (ANNotation of Information Structure). For research based on empirical data, ANNIS provides a uniform environment for storing this data together with its linguistic annotations. A central database promotes standardized annotation, which facilitates interpretation and comparison of the data. ANNIS is used through a standard web browser and offers tier-based visualization of data and annotations, as well as search facilities that allow for cross-level and cross-sentential queries. The paper motivates the design of the system, characterizes its user interface, and provides an initial technical evaluation of ANNIS with respect to data size and query processing.
The paper presents a novel approach to explaining word order variation in the early Germanic languages. Initial observations about verb placement as a device marking types of rhetorical relations made on data from Old High German (cf. Hinterhölzl & Petrova 2005) are now reconsidered on a larger scale and compared with evidence from other early Germanic languages. The paper claims that the identification of information-structural domains in a sentence is best achieved by taking into account the interaction between the pragmatic features of discourse referents and properties of discourse organization.
We investigate methods to improve the recall in coreference resolution by also trying to resolve those definite descriptions where no earlier mention of the referent shares the same lexical head (coreferent bridging). The problem, which is notably harder than identifying coreference relations among mentions which have the same lexical head, has been tackled with several rather different approaches, and we attempt to provide a meaningful classification along with a quantitative comparison. Based on the different merits of the methods, we discuss possibilities to improve them and show how they can be effectively combined.
One aspect of the progress being made is that the focus of attention has widened. Adverbials, though still the heart of the matter, now form part of a much larger set of constituent types subsumed under the general syntactic label of adjunct; while modifier has become the semantic counterpart on the same level of generality. So one of the readings of Modifying Adjuncts stands for the focus on this intersection. Moreover, recent years have seen a number of studies which attest an increasing interest in adjunct issues. There is an impressive number of monographs, e.g. Alexiadou (1997), Laenzlinger (1998), Cinque (1999), Pittner (1999), Ernst (2002), which, by presenting in-depth analyses of the syntax of adjuncts, have sharpened the debate on syntactic theorizing. Serious attempts to gain a broader view on adjuncts are witnessed by several collections, see Alexiadou and Svenonius (2000), Austin, Engelberg and Rauh (in progress); of particular importance are the contributions to vol. 12.1 of the Italian Journal of Linguistics (2000), a special issue on adverbs, the Introductions to which by Corver and Delfitto (2000) and Delfitto (2000) may be seen as the best state-of-the-art article on adverbs and adverbial modification currently on the market. To try and test a fresh view on adjuncts was the leitmotif of the Oslo Conference “Approaching the Grammar of Adjuncts” (Sept 22–25, 1999), which provided the initial forum for the papers contained in this volume and initiated a period of discussion and continuing interaction among the contributors, from which the versions published here have greatly profited. The aim of the Oslo conference, and hence the focus of the present volume, was to encourage syntacticians and semanticists to open their minds to a more integrative approach to adjuncts, thereby paying attention to, and attempting to account for, the various interfaces that the grammar of adjuncts crucially embodies. From this perspective, the present volume is to be conceived of as an interim balance of current trends in modifying the views on adjuncts. In introducing the papers, we will refrain from rephrasing the abstracts, but will instead offer a guided tour through the major problem areas they are tackling. Assessed by thematic convergence and mutual reference, the contributions form four groups, which led us to arrange them into subparts of the book. Our commenting on these is intended (i) to provide a first glance at the contents, (ii) to reveal some of the reasons why adjuncts indeed are, and certainly will remain, a challenging issue, and thereby (iii) to show some facets of what we consider novel and promising approaches.
The article offers evidence that there are two variants of adverbial modification that differ with respect to the way in which a modifier is linked to the verbs eventuality argument. So-called event-external modifiers relate to the full eventuality, whereas event-internal modifiers relate to some integral part of it. The choice between external and internal modification is shown to be dependent on the modifiers syntactic base position. Event-external modifiers are base-generated at the VP periphery, whereas event-internal modifiers are base-generated at the V periphery. These observations are accounted for by a refined version of the standard Davidsonian approach to adverbial modification according to which modification is mediated by a free variable. In the case of external modification, the grammar takes responsibility for identifying the free variable with the verbs eventuality argument, whereas in the case of internal modification, a value for the free variable is determined by the conceptual system on the basis of contextually salient world knowledge. For the intriguing problem that certain locative modifiers occasionally seem to have nonlocative (instrumental, positional, or manner) readings, the advocated approach can provide a rather simple solution.
The Conference on Computational Natural Language Learning features a shared task, in which participants train and test their learning systems on the same data sets. In 2007, as in 2006, the shared task has been devoted to dependency parsing, this year with both a multilingual track and a domain adaptation track. In this paper, we define the tasks of the different tracks and describe how the data sets were created from existing treebanks for ten languages. In addition, we characterize the different approaches of the participating systems, report the test results, and provide a first analysis of these results.
This report explores the question of compatibility between annotation projects including translating annotation formalisms to each other or to common forms. Compatibility issues are crucial for systems that use the results of multiple annotation projects. We hope that this report will begin a concerted effort in the field to track the compatibility of annotation schemes for part of speech tagging, time annotation, treebanking, role labeling and other phenomena.
The retreat of BE as perfect auxiliary in the history of English is examined. Corpus data are presented showing that the initial advance of HAVE was most closely connected to a restriction against BE in past counterfactuals. Other factors which have been reported to favor the spread of HAVE are either dependent on the counterfactual effect, or significantly weaker in comparison. It is argued that the effect can be traced to the semantics of the BE perfect, which denoted resultativity rather than anteriority proper. Related data from other older Germanic and Romance languages are presented, and finally implications for existing theories of auxiliary selection stemming from the findings presented are discussed.
It has often been noticed that one syntactic argument position can be realized by elements which seem to realize different thematic roles. This is notably the case with the external argument position of verbs of change of state which licenses volitional agents, instruments or natural forces/causers, showing the generality and abstractness of the external argument relation. (1) a. John broke the window (Agent) b. The hammer broke the window (Instrument) c. The storm broke the window (Causer) In order to capture this generality, Van Valin & Wilkins (1996) and Ramchand (2003) among others have proposed that the thematic role of the external argument position is in fact underspecified. The relevant notion is that of an effector (in Van Valin & Wilkins) or of an abstract causer/initiator (in Ramchand). In this paper we argue against a total underspecification of the external argument relation. While we agree that (1b) does not instantiate an instrument theta role in subject position, we argue that a complete underspecification of the external theta-position is not feasible, but that two types of external theta roles have to be distinguished, Agents and Causers. Our arguments are based on languages where Agents and Causers show morpho-syntactic independence (section 2.1) and the behavior of instrument subjects in English, Dutch, German and Greek (section 2.2 and 3). We show that instrument subjects are either Agent or Causer like. In section (4) we give an analysis how arguments realizing these thematic notions are introduced into syntax.
In many languages, a passive-like meaning may be obtained through a noncanonical passive construction. The get passive (1b) in English, the se faire passive (2b) in French and the kriegen passive (3b) in German represent typical manifestations. This squib focuses on the behavior of the get-passive in English and discusses a number of restrictions associated with it as well as the status of get.