Linguistik
Refine
Year of publication
Document Type
- Part of a Book (591) (remove)
Language
- English (591) (remove)
Has Fulltext
- yes (591)
Is part of the Bibliography
- no (591) (remove)
Keywords
- Syntax (79)
- Spracherwerb (63)
- Deutsch (56)
- Phonologie (46)
- Semantik (42)
- Englisch (40)
- Sprachtest (33)
- Thema-Rhema-Gliederung (32)
- Intonation <Linguistik> (25)
- Morphologie (24)
Institute
- Extern (11)
- Sprachwissenschaften (1)
In this paper I argue that the syntax of Eastern Bantu does not make reference to the notion 'syntactic object'. That is, there is no linguistic category of objects that is the target of syntactic rules in Eastern Bantu languages. Instead I propose that syntactic rules broadly distinguish complements and adjuncts as well as category type of complement or adjunct. I argue that Bantu languages are typologically special in that (a) the verb complement structure can be expanded by the valency increasing applicative suffix; and (b) that the class of adjuncts can be expanded through verb concord licensing. Because of these properties, Bantu languages have a much-expanded notion of 'complement' and 'adjunct'. Namely, complements consist of (a) inherent complements (subcategorised by the lexical verb), and (b) derived complements (licensed by the applicative suffix). Adjuncts consist of (a) non-subcategorised modifying constituents in the usual sense and (b) phrases that are licensed by verb concord (i.e. Topics in Bresnan and Mchombo (1987)). I propose that most the differences in the licensing of objects in Bantu are due to two causes: (a) the unusual split in the composition of complements and adjuncts and (b) a set of typological parameter settings.
Complex focus versus double focus : investigations on multiple focus interpretations in Hungarian
(2006)
The main aim of this paper is to point out several problems with the semantic analysis of Hungarian focus interpretation and 'only'. For current semantic analyses the interpretation of Hungarian identificational/exhaustive focus and 'only' is problematic, since in classical semantic analyses 'only' is identified with an exhaustivity operator. In this paper I will discuss multiple focus constructions and question-answer pairs in Hungarian to show that such a view cannot be applied to Hungarian exhaustive focus. Next to this I will discuss possible interpretations of Hungarian sentences containing multiple prosodic foci: complex focus versus double focus. My claim is that in order to interpret multiple focus (in Hungarian) we have to take into consideration the different intonation patterns, the occurrence of 'only', and the syntactic structure as well.
Focus theories distinguish different types of focus according to the pragmatic conditions or communicative point on the one side and different scopes of focus on the other side. The assertion in term focus constructions (Dik 1989), called by others argument focus constructions or identificational sentences (Lambrecht 1994), has the purpose of establishing a relation between an argument and an open proposition. Kar, a north-eastern Senufo language of Burkina Faso, which has the basic word order S-Aux-O-V-other, has at its disposal different strategies to mark argument focus, among them fronting of the focused item. In many West African languages the displacement of the focused argument involves other devices, such as the use of special verb forms. In Kar fronting of a focused argument requires the use of special pronouns in the out-of-focus part of the sentence, called background subject pronouns. They are used in other backgrounded contexts, too, for example in relative clauses, adverbial clauses and constituent questions. Their inconsistent use is attributed to a particular sociolinguistic situation in which the data has been collected. The use of the same focus strategies for completive and contrastive focus suggests that Kar does not distinguish pragmatic conditions on the level of sentence grammar.
Many analyses of existential sentences have focused attention on determining which of its elements constitutes the logical subject and predicate, and this has proven to be a not uncontroversial topic of research. Some, from both syntactic and semantic points of view, have argued that there is a subject (cf. Williams 1994) others that it is a predicate (cf. Moro 1997). Similarly, some have argued that the associate NP is a logical subject, others that it is apredicate (Higginbotham 1987).
One logical possibility that has not (to my knowledge) been pursued in the linguistics literature is that these statements are not of the form subject-predicate, a possibility that has been taken up in the philosophical literature by P.F. Strawson (1959). He claims that there are such statements and that their form is simpler than that of subject-predicate statements because it does not, and cannot, involve an expression that makes reference to an individual. Not involving reference to an individual, these sentences are therefore are made true by different means than a subject-predicate statement whose truth, in the simplest cases, depends on the denotation of the subject being a member of the denotation of the predicate. Of interest from the point of view of the present discussion is his claim that existential statements are examples of this kind of statement, which he calls a feature-placing statement. The truth of a statement of the form feature-placer requires that something with the set of features denoted by the associate NP exist at the location or coordinates expressed by the placer. In an existential sentence we can take the associate NP as the feature-denoting expression and the coda-XP as the placer.
This paper focuses on different subtypes of constructions involving temporally bounded quantification, e.g. sequences like David visited Rome three times followed by temporal phrases as different as (i) last year, which defines a time interval; (ii) in less that two months, which defines an amount of time; and (iii) per month, which refers to a time unit. As for the first two types of temporal phrases, data will be presented which shows that they have specific linguistic properties in these quantifying contexts, and do not behave exactly as the locating or duration adverbials they are superficially identical with. The third type of phrases will receive special attention. Structures with frequency adverbials like n times per month will be analysed compositionally, separating the quantified component n times from the temporally binding phrase per month (whose role is comparable to that of adverbials (i) and (ii) in the relevant constructions). The data presented is mainly from Portuguese, although the issues at stake – the linguistic properties of temporally bounded quantification – are obviously relevant to parallel constructions in other languages.
This paper looks at sentences with "quantificational indefinites," discussed by Diesing (1992) and others. I propose that these sentences generate sets of alternatives of the form {p, not p and it's possible that p}, which restrict the quantification by an extension of familiar focus principles. For example, in the sentence "I usually read a book about slugs" (on the relevant reading), "usually" quantifies over pairs <x,t> such that x is a book about slugs, t is a time interval, and one alternative is true from the set {I read x at t, I can but do not read x at t}. In addition to accounting for a well-known contrast between creation and non-creation verbs, this also explains a second contrast that Diesing’s analysis cannot account for.
Russian and Spanish each have two variants of the predicational copular sentence. In Russian, the variation concerns the case of the predicate phrase, which can be nominative or instrumental, while in Spanish, the variation involves the choice of the copular verb, either ser or estar. It is shown that the choice of the particular variant of copular sentence in both languages depends on the speaker’s perspective, i.e., on whether or not the predication is linked to a specific topic situation.
There is an elegant account, proposed by Beaver and Condoravdi (2003), that assumes that the temporal connectives before and after are converses (i.e., they are analyzed by means of a unified lexical schema), and that explains away their different logical and veridical behavior appealing to other factors. There is an elegant explanation that connects the licensing of Polarity Items to informational strengthening requirements: Polarity Items are viewed as existentials that lead to a widening of the domain of quantification, and they are predicted to be legitimate only when this widening leads to a stronger statement (roughly, in downward monotone contexts). My plan is to connect these two approaches – by proposing an amendment in the definition Beaver and Condoravdi presented for before and after that is meant to account also for their Polarity Items licensing behavior.
Based on a Relevance Theory-informed view of language development, this paper argues that grammatical relations are construction-specific conventionalizations (grammaticalizations) of implicatures which arise out of repeated patterns of reference to particular types of referents. Once conventionalized, these structures function to constrain the hearer's identification of referents in discourse. As they are construction-specific, and hence language-specific, there is no category "subject" across languages; different languages will either show this type of grammaticalization or not, and if they do, may show it or not in different constructions. Any cross-linguistic use of terms such as "subject" (and "S", as in "SOV") should then be avoided.
Sino-Tibetan languages
(2006)
The Sino-Tibetan (ST) language family includes the Sinitic languages (what for political reasons are known as Chinese ‘dialects’) and the 200 to 300 Tibeto-Burman (TB) languages. Geographically it stretches from Northeast India, Burma, Bangladesh, and northern Thailand in the southeast, throughout the Tibetan plateau to the north, across most of China and up to the Korean border in the northeast, and down to Taiwan and Hainan Island in the southeast. The family has come to be the way it is because of multiple migrations, often into areas where other languages were spoken (LaPolla, 2001).
Li Fang-Kuei (1902-1987)
(2006)
Fang-Kuei Li was one of the foremost scholars of Thai and Sino-Tibetan studies and a major contributor to Amerind studies. Born in China, he was one of the early scholars sent to the United States to study. He had developed an interest in language while learning English, Latin, and German as part of his studies in China, and so he decided to study linguistics in the United States. In 1924, he went to the University of Michigan at Ann Arbor, receiving his B.A. 2 years later, then moved to the University of Chicago, where he received his M.A. and Ph.D., studying with Edward Sapir, Leonard Bloomfield, and Carl Darling Buck.
Wang Li (1900-1986)
(2006)
Mention some of all
(2006)
In the interpretation of natural language one may distinguish three types of dynamics: there are the acts or moves that are made; there are structural relations between subsequent moves; and interlocutors reason about the beliefs and intentions of the participants in a particular language game. Building on some of the formalisms developed to account for the first two types of dynamics, I will generalize and formalize Gricean insights into the third type, and show by means of a case study that such a formalization allows a direct account of an apparent ambiguity: the ‘exhaustive’ versus the ‘mention some’ interpretation of questions and their answers. While the principles which I sketch, like those of Grice, are motivated by assumptions of rationality and cooperativity, they do not presuppose these assumptions to be always warranted.
In this paper, focusing on the relevance-theoretic view of cognition, I discuss the idea that what is communicated through an utterance is not merely an explicature upon which implicature(s) are recovered, but rather a propositional complex that contains both explicit and implicit information. More specifically, I propose that this information is constructed on the fly as the interpreter processes every lexical item in its turn while parsing the utterance in real time, in this way creating a string of ad hoc concepts. While hearing an utterance and incrementally constructing a context, the propositional complex communicated by an utterance is pragmatically narrowed and simultaneously pragmatically broadened in order to incorporate only the set of optimally relevant propositions with respect to a specific point in the interpretation. The narrowing of propositions from the initial context at each stage allows relevant propositions to be carried on to the new level, while their broadening adds to the communicated propositional complex new propositions that are linked to the lexical item that is processed at every step of the interpretation process.
In this paper we compare the behaviour of adverbs of frequency (de Swart 1993) like usually with the behaviour of adverbs of quantity like for the most part in sentences that contain plural definites. We show that sentences containing the former type of Q-adverb evidence that Quantificational Variability Effects (Berman 1991) come about as an indirect effect of quantification over situations: in order for quantificational variability readings to arise, these sentences have to obey two newly observed constraints that clearly set them apart from sentences containing corresponding quantificational DPs, and that can plausibly be explained under the assumption that quantification over (the atomic parts of) complex situations is involved. Concerning sentences with the latter type of Q-adverb, on the other hand, such evidence is lacking: with respect to the constraints just mentioned, they behave like sentences that contain corresponding quantificational DPs. We take this as evidence that Q-adverbs like for the most part do not quantify over the atomic parts of sum eventualities in the cases under discussion (as claimed by Nakanishi and Romero (2004)), but rather over the atomic parts of the respective sum individuals.
The retreat of BE as perfect auxiliary in the history of English is examined. Corpus data are presented showing that the initial advance of HAVE was most closely connected to a restriction against BE in past counterfactuals. Other factors which have been reported to favor the spread of HAVE are either dependent on the counterfactual effect, or significantly weaker in comparison. It is argued that the effect can be traced to the semantics of the BE perfect, which denoted resultativity rather than anteriority proper. Related data from other older Germanic and Romance languages are presented, and finally implications for existing theories of auxiliary selection stemming from the findings presented are discussed.
Modifiability by almost has been used as a test for the quantificational force of a DP without stating the meaning of almost explicitly. The aim of this paper is to give a semantics for almost applying across categories and to evaluate the validity of the almost test as a diagnosis for universal quantifiers. It is argued that almost is similar to other cross-categorial modifiers such as at least or exactly in referring to alternatives ordered on a scale. I propose that almost evaluates alternatives in which the modified expression is replaced by a value close by on the corresponding Horn scale. It is shown that a semantics for almost that refers to scalar alternatives derives the correct truth conditions for almost and explains selectional restrictions. At the same time, taking the semantics of almost seriously invalidates the almost test as a simple diagnosis for the nature of quantifiers.
This paper demonstrates that there are no empirical and theoretical motivations for regarding verbal predicate focus constructions as (diachronically) derived from cleft constructions. Instead, it is argued that predicate fronting for the purpose of focus or topic is comparable to verb (phrase) fronting structures in other languages (e.g., Germanic). The proposed analysis further indicates that related doubling strategies observed in certain languages are the consequences of parallel chains that license the fronted verb (phrase) in the left periphery, and the Agree-tense-aspect features inside the proposition.
The paper investigates the interpretation of the Romanian subjunctive B (subjB) mood when it is embedded under the propositional attitude verb crede (believe). SubjB is analyzed as a single package of three distinct presuppositions: temporal de se, dissociation and propositional de se. I show that subjB is the temporal analogue of null PRO in the individual domain: it allows only for a de se reading. Dissociation enables us to show that subjB always takes scope over a negation embedded in a belief report. Propositional de se derives this empirical generalization. The introduction of centered propositions (generalizing centered worlds), together with propositional de se, dissociation and the belief 'introspection' principles, derives the fact that subjB belief reports (unlike their indicative counterparts) are infelicitous with embedded probabil.
In this paper we will develop a formal conceptual model of how the path in a motion situation interacts with the semantic analysis of so called 'motion shape verbs' like 'wackeln' ('wobble'), a subclass of the so called 'manner of motion verbs'. Central to this model will be the distinction between two concepts of motion: translational motion and nontranslational motion, which has no inherent translational component but puts emphasis on describing specific Motion Shape Patterns. We will define and algorithmically describe a theory of Path Shape Decomposition that aims at algorithmically deriving the translational vs. nontranslational distinction from the shape of the path. To account for object internal motion, we additionally introduce Bounding Box encapsulation, which yields a topological division of inner and outer movement. Finally we demonstrate how the outcome of such a technical decomposition can be used in modelling a Path Superimposition scenario like 'Peter wackelt über die Straße'.
The paper investigates the interaction of focus and adverbial quantification in Hausa, a Chadic tone language spoken in West Africa. The discussion focuses on similarities and differences between intonation and tone languages concerning the way in which adverbial quantifiers (AQs) and focus particles (FPs) associate with focus constituents. It is shown that the association of AQs with focused elements does not differ fundamentally in intonation and tone languages such as Hausa, despite the fact that focus marking in Hausa works quite differently. This may hint at the existence of a universal mechanism behind the interpretation of adverbial quantifiers across languages. From a theoretical perspective, the Hausa data can be taken as evidence in favour of pragmatic approaches to the focus-sensitivity of AQs, such as e.g. Beaver & Clark (2003).
According to standard Binding Theory, pronouns and reflexives are in (nearly) complementary distribution. However, representational NPs (e.g. 'picture of her/herself') allow both. It has been suggested that in English, reflexives in representational NPs (RNPs) have a preference for 'sources of information' and that pronouns prefer 'perceivers of information.' We conducted two experiments investigating the effects of structural and non-structural (source/perceiver) factors on the interpretation of two kinds of RNP structures in a typologically different language, namely Finnish. Our results reveal source/perceiver effects for postnominal but not for prenominal RNPs in Finnish, with a difference in the degree of sensitivity that pronouns and reflexives exhibit to the source/perceiver manipulation, and our results also suggest that morphological differences in Finnish reflexives correspond to interpretation differences. As a whole, these results support a multiple-factor model of reference resolution, which assumes that multiple factors can play a role in reference resolution and that the relative contributions of these factors can be different for different anaphoric forms (Kaiser 2003b, Kaiser & Trueswell in press).
Dealing with alternatives
(2006)
Traditionally, pure additive particles and scalar additive particles are both characterized by an existential presupposition. They differ insofar as the set of alternatives that is built is unordered for the former, and ordered for the latter, which carry the so-called scalar presupposition. As a result, the two characterisations cannot be cumulated, an impossibility that is at odds with the fact that several languages exhibit this combination of readings for a single item. The discussion of Italian neanche '(n)either/(not) even', an item that can both be additive and scalar, allows us to expose the connection between the oppositions non-ordered vs ordered set of alternatives and verified vs accommodated existential presupposition by adding content to the traditional view that the set of alternatives is made up of 'relevant' items in the context. The question of how to characterise this item is set against the backdrop of a more general discussion of the network of additive particles found in Italian.
Functions of English "man"
(2006)
This paper discusses the semantics of the English particle man. It is shown that this particle does different things when used sentence-initially and sentence-finally. The sentenceinitial use is further shown to separate into two distinct intonational types with different semantic content. A formal semantics is proposed for these types.
We propose a compositional analysis for sentences of the kind "You only have to go to the North End to get good cheese", referred to as the Sufficiency Modal Construction in the recent literature. We argue that the SMC is ambiguous depending on the kind of ordering induced by only. So is the exceptive construction – its cross-linguistic counterpart. Only is treated as inducing either a 'comparative possibility' scale or an 'implication-based' partial order on propositions. The properties of the 'comparative possibility' scale explain the absence of the prejacent presupposition that is usually associated with only. By integrating the scalarity into the semantics of the SMC, we explain the polarity facts observed in both variants of the construction. The sufficiency meaning component is argued to be due to a pragmatic inference.
The German causal preposition durch ('by', 'through') poses a challenge to formal-semantic analyses applying strict compositionality. To deal with this challenge, a formalism which builds on recent important developments in Discourse Representation Theory is developed, including a more elaborate analysis of presuppositional phenomena as well as the integration into the theory of unification as a mode of composition. It is argued that that the observed unificational phenomena belong in the realm of pragmatics, providing an argument for presuppositional phenomena at a sentence- and word-internal level.
This paper presents two experimental studies investigating the processing of presupposed content. Both studies employ the German additive particle auch (too). In the first study, participants were given a questionnaire containing bi-clausal, ambiguous sentences with 'auch' in the second clause. The presupposition introduced by auch was only satisfied on one of the two readings of the sentence, and this reading corresponded to a syntactically dispreferred parse of the sentence. The prospect of having the auch-presupposition satisfied made participants choose this syntactically dispreferred reading more frequently than in a control condition. The second study used the self-paced-reading paradigm and compared the reading times on clauses containing auch, which differed in whether the presupposition of auch was satisfied or not. Participants read the clause more slowly when the presupposition was not satisfied. It is argued that the two studies show that presuppositions play an important role in online sentence comprehension and affect the choice of syntactic analysis. Some theoretical implications of these findings for semantic theory and dynamic accounts of presuppositions as well as for theories of semantic processing are discussed.
Multiple modals construction
(2006)
Modal items of different semantic types can only be combined in a specific order. Epistemic items, for instance, cannot be embedded under deontic ones. I'll argue that this fact cannot be explained by the current semantic theories of modality. A solution to this problem will be developed in an update semantics framework. On the semantic side, a distinction will be drawn between circumstantial information about the world and information about duties, whereas I'll use Nuyts' notion of m-performativity to account for certain use of the modal items.
The expressions few and a few are typically considered to be separate quantifiers. I challenge this assumption, showing that with the appropriate definition of few, a few can be derived compositionally as a + few. The core of the analysis is a proposal that few has a denotation as a one-place predicate which incorporates a negation operator. From this, argument interpretations can be derived for expressions such as few students and a few students, differing only in the scope of negation. I show that this approach adequately captures the interpretive differences between few and a few. I further show that other such pairs are blocked by a constraint against the vacuous application of a.
Kripke's "modal argument" uses consideration about scope within modal contexts to show that proper names and definite descriptions must be of two different semantic types. I reexamine the data that is used to motivate Kripke's argument, and suggest that it, in fact, indicates that proper names behave exactly like a certain type of definite description, which I call "particularized" descriptions.
Semantic and pragmatic properties of the Yorùbá focus construction have not been fully examined. This paper investigates presupposition, exhaustivity effects, and felicity conditions in some of its attested forms. Yorùbá focus does not trigger existence presuppositions, it does not have any obligatory exhaustivity effects, and argument focus and predicate focus behave differently with respect to question-answer congruence. These properties are compatible Déchaine’s analysis (2002) of Yorùbá focus as inverse predication, essentially a type of cleft.
This paper revisits the question of whether propositions in situation semantics must be persistent (Kratzer (1989)). It shows that ignoring persistence causes empirical problems to theories which use quantification over minimal situations as a solution for donkey anaphora (Elbourne (2005)), while at the same time modifying these theories to incorporate persistence makes them incompatible with the use of situations for contextual restriction (Kratzer (2004)).
Starting from the basic observation that, across languages, the anticausative variant of an alternating verb systematically involves morphological marking that is shared by passive verbs, the goal of this paper is to provide a uniform and formal account of these arguably two different construction types. The central claim that I put forward is that passives and anticausatives differ only with respect to the event-type features of the verb but both arise through the same operation, namely suppression by special morphology of a feature in v that encodes the ontological event type of the verb. Crucially, I argue for two syntactic primitives, namely act and cause, whereto I trace the passive/anticausative distinction. Passive constructions across languages are made compatible by relegating the differences to simple combinatorial properties of verb and prepositional types and their interactions with other event functors, which are in turn encoded differently morphologically across languages. New arguments are brought forward for a causative analysis of anticausatives. Agentive adverbials are examined, and doubt is cast on the usefulness of by-phrases as a diagnostic for argumenthood.
Khoekhoe syntax exhibits an unusually flexible constituent structure. Any constituent with a lexical head can be preposed into the focal initial slot immediately before the PGN-marker that marks the subject position. Two strategies of focalisation by foregrounding need to be distinguished: inversion and fronting. Inversion amounts to an inversion of subject and predicate in their entirety. Such sentences have two readings, though, according to their underlying constituent structure: "predicative" or "copulative". Fronting amounts to the preposing of a lexical constituent into the focal initial slot, with subsequent dislocation of the lexical specification of the subject from that slot.
The present analysis has wider implications, particularly: The generally accepted view that Khoekhoe has coreferential/equational "copulative" sentences of the type NPsubject = NPcomplement is a fallacy. Such sentences actually are sentences with their predicate fronted into the focal initial slot. They amount to cleft constructions.
The fact that the primary focal position is immediately before the PGNmarker of the subject is further independent evidence for the "desentential hypothesis", according to which subject and object NPs in the underlying matrix sentence consist of only an enclitic PGN-marker, and for the claim that Khoekhoe underlyingly is a SVO language, not a SOV language as generally held. By implication these findings affect the analysis of other Central Khoesaan languages.
Languages cross-linguistically differ with respect to whether they accept or ban True Negative Imperatives (TNIs). In this paper I show that this ban follows from three generally accepted assumptions: (i) the fact that the operator that encodes the illocutionary force of an imperative universally takes scope from C°; (ii) the fact that this operator may not be operated on by a negative operator and (iii) the Head Movement Constraint (an instance of Relativized Minimality). In my paper I argue that languages differ too with respect to both the syntactic status (head/phrasal) and the semantic value (negative/non-negative) of their negative markers. Given these difference across languages and the analysis of TNIs based on the three above mentioned assumptions, two typological generalisations can be predicted: (i) every language with an overt negative marker X° that is semantically negative bans TNIs; and (ii) every language that bans TNIs exhibits an overt negative marker X°. I demonstrate in my paper that both typological predictions are born out.
In this paper I discuss four type of bare nominal, and note that, in some sense, all of them appear to imply stereotypicality. I consider an account in terms of Bidirectional Optimality Theory: unmarked (bare) forms give rise to unmarked (stereotypical) interpretations. However, it turns out that, while the form of bare numerals is unmarked, the interpretation sometimes is not. I suggest that the crucial notion is not unmarkedness, but optimal inference: unmarked forms give rise to interpretations that are best used for drawing inferences. I propose a revision of Bidirectional Optimality Theory to reflect this.
This paper discusses a semantic analysis of three syntactic types of English each, namely, floated each, binominal each, and prenominal each. It is argued that floated each consists of two parts, a quantifier and an inaudible element which functions as its restrictor, which together form a tripartite quantificational structure when they compose with the predicate. Binominal each and an associated NP such as two topics (which is generally called the 'distributive share') are syntactically analyzed as forming a subject-predicate relation within a DP in which the NP undergoes so-called 'predicate inversion'. Semantically, binominal each is analyzed as having the same semantic value as floated each, while prenominal each is shown to have a different logical type from floated and binominal each. As can be seen from analogous constructions in some Romance languages, it does not lexically contain its restrictor.
The present study examines native and nonnative perceptual processing of semantic information conveyed by prosodic prominence. Five groups of German learners of English each listened to one of 5 experimental conditions. Three conditions differed in place of focus accent in the sentence and two conditions were with spliced stimuli. The experiment condition was presented first in the learners’ L1 (German) and then in a similar set in the L2 (English). The effect of the accent condition and of the length and position of the target in the sentence was evaluated in a probe recognition task. In both the L1 and L2 tasks there was no significant effect in any of the five focus conditions. Target position and target word length had an effect in the L1 task. Word length did not affect accuracy rates in the L2 task. For probe recognition in the L2, word length and the position of the target interacted with the focus condition.
In this paper, we present the Multiple Annotation approach, which solves two problems: the problem of annotating overlapping structures, and the problem that occurs when documents should be annotated according to different, possibly heterogeneous tag sets. This approach has many advantages: it is based on XML, the modeling of alternative annotations is possible, each level can be viewed separately, and new levels can be added at any time. The files can be regarded as an interrelated unit, with the text serving as the implicit link. Two representations of the information contained in the multiple files (one in Prolog and one in XML) are described. These representations serve as a base for several applications.
This paper investigates the structural properties of morphosyntactically marked focus constructions, focussing on the often neglected non-focal sentence part in African tone languages. Based on new empirical evidence from five Gur and Kwa languages, we claim that these focus expressions have to be analysed as biclausal constructions even though they do not represent clefts containing restrictive relative clauses. First, we relativize the partly overgeneralized assumptions about structural correspondences between the out-of-focus part and relative clauses, and second, we show that our data do in fact support the hypothesis of a clause coordinating pattern as present in clause sequences in narration. It is argued that we deal with a non-accidental, systematic feature and that grammaticalization may conceal such basic narrative structures.
The semantics of ellipsis
(2005)
There are four phenomena that are particularly troublesome for theories of ellipsis: the existence of sloppy readings when the relevant pronouns cannot possibly be bound; an ellipsis being resolved in such a way that an ellipsis site in the antecedent is not understood in the way it was there; an ellipsis site drawing material from two or more separate antecedents; and ellipsis with no linguistic antecedent. These cases are accounted for by means of a new theory that involves copying syntactically incomplete antecedent material and an analysis of silent VPs and NPs that makes them into higher order definite descriptions that can be bound into.
In order to investigate the empirical properties of focus, it is necessary to diagnose focus (or: “what is focused”) in particular linguistic examples. It is often taken for granted that the application of one single diagnostic tool, the so-called question-answer test, which roughly says that whatever a question asks for is focused in the answer, is a fool-proof test for focus. This paper investigates one example class where such uncritical belief in the question-answer test has led to the assumption of rather complex focus projection rules: in these examples, pitch accent placement has been claimed to depend on certain parts of the focused constituents being given or not. It is demonstrated that such focus projection rules are unnecessarily complex and in turn require the assumption of unnecessarily complicated meaning rules, not to speak of the difficulties to give a precise semantic/pragmatic definition of the allegedly involved givenness property. For the sake of the argument, an alternative analysis is put forward which relies solely on alternative sets following Mats Rooth´s work, and avoids any recourse to givenness. As it turns out, this alternative analysis is not only simpler but also makes in a critical case the better predictions.
This paper discusses the use of XSLT stylesheets as a filtering mechanism for refining the results of user queries on treebanks. The discussion is within the context of the TIGER treebank, the associated search engine and query language, but the general ideas can apply to any search engine for XML-encoded treebanks. It will be shown that important classes of linguistic phenomena can be accessed by applying relatively simple XSLT templates to the output of a query, effectively simulating the universal quantifier for a subset of the query language.
We present a system for the linguistic exploration and analysis of lexical cohesion in English texts. Using an electronic thesaurus-like resource, Princeton WordNet, and the Brown Corpus of English, we have implemented a process of annotating text with lexical chains and a graphical user interface for inspection of the annotated text. We describe the system and report on some sample linguistic analyses carried out using the combined thesaurus-corpus resource.
Face-to-face communication is multimodal. In unscripted spoken discourse we can observe the interaction of several “semiotic layers”, modalities of information such as syntax, discourse structure, gesture, and intonation. We explore the role of gesture and intonation in structuring and aligning information in spoken discourse through a study of the co-occurrence of pitch accents and gestural apices. Metaphorical spatialization through gesture also plays a role in conveying the contextual relationships between the speaker, the government and other external forces in a naturally-occurring political speech setting.
Elke Kasimir´s paper (in this volume) argues against employing the notion of Givenness in the explanation of accent assignment. I will claim that the arguments against Givenness put forward by Kasimir are inconclusive because they beg the question of the role of Givenness. It is concluded that, more generally, arguments against Givenness as a diagnostic for information structural partitions should not be accepted offhand, since the notion of Givenness of discourse referents is (a) theoretically simple, (b) readily observable and quantifiable, and (c) bears cognitive significance.
Fronting of an infinite VP across a finite main verb - akin to German "VP-topicalization" - can be found also in Czech and Polish. The paper discusses evidence from large corpora for this process and some of its properties, both syntactic and information-structural. Based on this case, criteria for more user-friedly searching and retrieval of corpus data in syntactic research are being developed.
This paper describes the creation and preparation of TUSNELDA, a collection of corpus data built for linguistic research. This collection contains a number of linguistically annotated corpora which differ in various aspects such as language, text sorts / data types, encoded annotation levels, and linguistic theories underlying the annotation. The paper focuses on this variation on the one hand and the way how these heterogeneous data are integrated into one resource on the other hand.
In many languages, a passive-like meaning may be obtained through a noncanonical passive construction. The get passive (1b) in English, the se faire passive (2b) in French and the kriegen passive (3b) in German represent typical manifestations. This squib focuses on the behavior of the get-passive in English and discusses a number of restrictions associated with it as well as the status of get.
Trubetzkoy's recognition of a delimitative function of phonology, serving to signal boundaries between morphological units, is expressed in terms of alignment constraints in Optimality Theory, where the relevant constraints require specific morphological boundaries to coincide with phonological structure (Trubetzkoy 1936, 1939, McCarthy & Prince 1993). The approach pursued in the present article is to investigate the distribution of phonological boundary signals to gain insight into the criteria underlying morphological analysis. The evidence from English and Swedish suggests that necessary and sufficient conditions for word-internal morphological analysis concern the recognizability of head constituents, which include the rightmost members of compounds and head affixes. The claim is that the stability of word-internal boundary effects in historical perspective cannot in general be sufficiently explained in terms of memorization and imitation of phonological word form. Rather, these effects indicate a morphological parsing mechanism based on the recognition of word-internal head constituents. Head affixes can be shown to contrast systematically with modifying affixes with respect to syntactic function, semantic content, and prosodic properties. That is, head affixes, which cannot be omitted, often lack inherent meaning and have relatively unmarked boundaries, which can be obscured entirely under specific phonological conditions. By contrast, modifying affixes, which can be omitted, consistently have inherent meaning and have stronger boundaries, which resist prosodic fusion in all phonological contexts. While these correlations are hardly specific to English and Swedish it remains to be investigated to which extent they hold cross-linguistically. The observation that some of the constituents identified on the basis of prosodic evidence lack inherent meaning raises the issue of compositionality. I will argue that certain systematic aspects of word meaning cannot be captured with reference to the syntagmatic level, but require reference to the paradigmatic level instead. The assumption is then that there are two dimensions of morphological analysis: syntagmatic analysis, which centers on the criteria for decomposing words in terms of labelled constituents, and paradigmatic analysis, which centers on the criteria for establishing relations among (whole) words in the mental lexicon. While meaning is intrinsically connected with paradigmatic analysis (e.g. base relations, oppositeness) it is not essential to syntagmatic analysis.
It has been shown that visual cues play a crucial role in the perception of vowels and consonants. Conflicting consonantal stimuli presented in the visual and auditory modalities can even result in the emergence of a third perceptual unit (McGurk effect). From a developmental point of view, several studies report that newborns can associate the image of a face uttering a given vowel to the auditory signal corresponding to this vowel; visual cues are thus used by the newborns. Despite the large number of studies carried out with adult speakers and newborns, very little work has been conducted with preschool-aged children. This contribution is aimed at describing the use of auditory and visual cues by 4 and 5-year-old French Canadian speakers, compared to adult speakers, in the identification of voiced consonants. Audiovisual recordings of a French Canadian speaker uttering the sequences [aba], [ada], [aga], [ava], [ibi], [idi], [igi], [ivi] have been carried out. The acoustic and visual signals have been extracted and analysed so that conflicting and non-conflicting stimuli, between the two modalities, were obtained. The resulting stimuli were presented as a perceptual test to eight 4 and 5-year-old French Canadian speakers and ten adults in three conditions: visual-only, auditory-only, and audiovisual. Results show that, even though the visual cues have a significant effect on the identification of the stimuli for adults and children, children are less sensitive to visual cues in the audiovisual condition. Such results shed light on the role of multimodal perception in the emergence and the refinement of the phonological system in children.
In this paper the issue of the nature of the representations of the speech production task in the speaker's brain is addressed in a production-perception interaction framework. Since speech is produced to be perceived, it is hypothesized that its production is associated for the speaker with the generation of specific physical characteristics that are for the listeners the objects of speech perception. Hence, in the first part of the paper, four reference theories of speech perception are presented, in order to guide and to constrain the search for possible correlates of the speech production task in the physical space: the Acoustic Invariance Theory, the Adaptive Variability Theory, the Motor Theory and the Direct-Realist Theory. Possible interpretations of these theories in terms of representations of the speech production task are proposed and analyzed. In a second part, a few selected experimental studies are presented, which shed some light on this issue. In the conclusion, on the basis of the joint analysis of theoretical and experimental aspects presented in the paper, it is proposed that representations of the speech production task are multimodal, and that a hierarchy exists among the different modalities, the acoustic modality having the highest level of priority. It is also suggested that these representations are not associated with invariant characteristics, but with regions of the acoustic, orosensory and motor control spaces.
A fundamental question in the study of speech is about the invariance of the ultimate percepts, or features. The present paper gives an overview of the noninvariance problem and offers some hints towards a solution. Examination of various data on place and voicing perception suggests the following points. Features correspond to natural boundaries between sounds, which are included in the infant's predispositions for speech perception. Adult percepts arise from couplings and contextual interactions between features. Both couplings and interactions contribute to invariance. But this is at the expense of profound qualitative changes in perceptual boundaries implying that features are neither independently nor invariantly perceived. The question then is to understand the principles which guide feature couplings and interactions during perceptual development. The answer might reside in the fact that: (1) adult boundaries converge to a single point of the perceptual space, suggesting a context-free central reference; (2) this point corresponds to the neutral vocoïd, suggesting the reference is related to production; (3) at this point perceptual boundaries correspond to the natural ones, suggesting the reference is anchored in predispositions for feature perception. In sum, perceptual invariance seems to be grounded on a radial representation of the vocal tract around a singular point at which boundaries are context-fee, natural and coincide with the neutral vocoïd.
This paper presents the results of Open Quotient measurements in EGG signals of young (18 to 30 year old) and elderly (59 to 82 year old) male and female speakers. The paper further presents quantitative results on the relation between the OQ and the perception of a speaker's age. Higgins & Saxman (1991) found a decreased OQEGG with increasing age for females, whereas the OQEGG in sustained vowel material increased for males as the speakers age increased. In Linville (2002), however, the spectral amplitudes in the region of F0 (obtained by LTAS-measurements of read speech material) increased with increasing age independent of gender; this could be interpreted indirectly as an increasing OQ. We measured the OQEGG not only for sustained vowels, but also in vowels taken from isolated words. In order to analyse the relation between breathiness in terms of an increased OQ and the mean perceived age per stimulus a perception test was carried out in which listeners were asked to estimate speaker's age based on sustained /a/-vowel stimuli varying in vocal effort (soft - normal - loud) during production. The results indicated the following: (i) The decreased OQ for elderly females originally found by Higgins & Saxman is not apparent in our data for sustained /a/-vowels. For our female speakers no significant difference between the OQ of young and old speakers was found; for elderly males, however, we also found an increasing OQ with increasing age.(ii) In addition, a statistically significant increased OQEGG occurs for the group of the elderly males for the vowels from the word material. (iii) Our results show a strong positive relation between perceived age and OQ in male voices. Regarding (i) and (ii), at least the male speaker's voice becomes more breathy as age increases. Considering (iii), increased breathiness may contribute to the listener’s perception of increased age.
Studying kinematic behavior in speech production is an indispensable and fruitful methodology in order to describe for instance phonemic contrasts, allophonic variations, prosodic effects in articulatory movements. More intriguingly, it is also interpreted with respect to its underlying control mechanisms. Several interpretations have been borrowed from motor control studies of arm, eye, and limb movements. They do either explain kinematics with respect to a fine tuned control by the Central Nervous System (CNS) or they take into account a combination of influences arising from motor control strategies at the CNS level and from the complex physical properties of the peripheral speech apparatus. We assume that the latter is more realistic and ecological. The aims of this article are: first, to show, via a literature review related to the so called '1/3 power law' in human arm motor control, that this debate is of first importance in human motor control research in general. Second, to study a number of speech specific examples offering a fruitful framework to address this issue. However, it is also suggested that speech motor control differs from general motor control principles in the sense that it uses specific physical properties such as vocal tract limitations, aerodynamics and biomechanics in order to produce the relevant sounds. Third, experimental and modelling results are described supporting the idea that the three properties are crucial in shaping speech kinematics for selected speech phenomena. Hence, caution should be taken when interpreting kinematic results based on experimental data alone.
Syllable cut is said to be a phonologically distinctive feature in some languages where the difference in vowel quantity is accompanied by a difference in vowel quality like in German. There have been several attempts to find the corresponding phonetic correlates for syllable cut, from which the energy measurements of vowels by Spiekermann (2000) proved appropriate for explaining the difference between long, i.e. smoothly, and short, i.e. abruptly cut, vowels: in smoothly cut vowels, a larger number of peaks was counted in the energy contour which were located further back than in abruptly cut segments, and the overall energy was more constant throughout the entire nucleus. On this basis, we intended to compare German as a syllable cut language and Hungarian where the feature was not expected to be relevant. However, the phonetic correlates of syllable cut found in this study do not entirely confirm Spiekermann's results. It seems that the energy features of vowels are more strongly connected to their duration than to their quality.
This study reports on the results of an airflow experiment that measured the duration of airflow and the amount of air from release of a stop to the beginning of a following vowel in stop vowel-sequences of German. The sequences involved coronal, labial and velar voiced and voiceless stops followed by the vocoids /j, i:, ı, ɛ, ʊ, a/. The experiment tested the influence of the three factors voicing of stop, place of stop articulation, and the following vocoid context on the duration and amount of air as possible explanation for assibilation processes. The results show that the voiceless stops are related to a longer duration and more air in the release phase than voiced ones. For the influence of the vocoids, a significant difference could be established between /j/ and all other vocoids for the duration of the release phase. This difference could not be found for the amount of air over this duration. The place of articulation had only restricted influence. Velars resulted in significantly longer duration of the release phase compared to non-velars. A significant difference in amount of air between the places of articulation could not be found.
The present article is a follow-up study of the investigation of labiodentals in German and Dutch by Hamann & Sennema (2005), where we looked at the perception of the Dutch labiodental three-way contrast by German listeners without any knowledge of Dutch and German learners of Dutch. The results of this previous study suggested that the German voiced labiodental fricative /v/ is perceptually closer to the Dutch approximant /ʋ/ than to the corresponding Dutch voiced labiodental fricative /v/. These perceptual indications are attested by the acoustic findings in the present study. German /v/ has a similar harmonicity median and a similar centre of gravity to Dutch /ʋ/, but differs from Dutch /v/ in these parameters. With respect to the acoustic parameter of duration, German /v/ lies closer to the Dutch /v/ than to the Dutch /ʋ/.
(Non)retroflexivity of slavic affricates and its motivation : Evidence from polish and czech <č>
(2005)
The goal of this paper is two-fold. First, it revises the common assumption that the affricate <č> denotes /t͡ʃ/ for all Slavic languages. On the basis of experimental results it is shown that Slavic <č> stands for two sounds: /t͡ʃ/ as e.g. in Czech and /ʈʂ/ as in Polish.
The second goal of the paper is to show that this difference is not accidental but it is motivated by perceptual relations among sibilants. In Polish, /t͡ʃ/ changed to /ʈʂ/ thus lowering its sibilant tonality and creating a better perceptual distance to /tɕ/, whereas in Czech /t͡ʃ/ did not turn to /ʈʂ/, as the former displayed sufficient perceptual distance to the only affricate present in the inventory, namely, the alveolar /t͡s/. Finally, an analysis of Czech and Polish affricate inventories is offered.
While the perilinguistic child is endowed with predispositions for the categorical perception of phonetic features, their adaptation to the native language results from a long evolution from the end of the first year of age up to the adolescence. This evolution entails both a better discrimination between phonological categories, a concomitant reduction of the discrimination between within-category variants, and a higher precision of perceptual boundaries between categories. The first objective of the present study was to assess the relative importance of these modifications by comparing the perceptual performances of a group of 11 children, aged from 8 to 11 years, with those of their mothers. Our second objective was to explore the functional implications of categorical perception by comparing the performances of a group of 8 deaf children, equipped with a cochlear implant, with normal-hearing chronological age controls. The results showed that the categorical boundary was slightly more precise and that categorical perception was consistently larger in adults vs. normal-hearing children. Those among the deaf children who were able to discriminate minimal distinctions between syllables displayed categorical perception performances equivalent to those of normal-hearing controls. In conclusion, the late effect of age on the categorical perception of speech seems to be anchored in a fairly mature phonological system, as evidenced the fairly high precision of categorical boundaries in pre-adolescents. These late developments have functional implications for speech perception in difficult conditions as suggested by the relationship between categorical perception and speech intelligibility in cochlear implant children.
This paper describes the processing of MRI and CT images needed for developing a 3D linear articulatory model of velum. The 3D surface that defines each organ constitutive of the vocal and nasal tracts is extracted from MRI and CT images recorded on a subject uttering a corpus of artificially sustained French vowels and consonants. First, the 2D contours of the organs have been manually extracted from the corresponding images, expanded into 3D contours, and aligned in a common 3D coordinate system. Then, for each organ, a generic mesh has been chosen and fitted by elastic deformation to each of the 46 3D shapes of the corpus. This has finally resulted in a set of organ surfaces sampled with the same number of 3D vertices for each articulation, which is appropriate for Principal Component Analysis or linear decomposition. The analysis of these data has uncovered two main uncorrelated articulatory degrees of freedom for the velum's movement. The associated parameters are used to control the model. We have in particular investigated the question of a possible correlation between jaw / tongue and velum's movement and have not find more correlation than the one found in the corpus.
This paper contributes to the understanding of vocal folds oscillation during phonation. In order to test theoretical models of phonation, a new experimental set-up using a deformable vocal folds replica is presented. The replica is shown to be able to produce self sustained oscillations under controlled experimental conditions. Therefore different parameters, such as those related to elasticity, to acoustical coupling or to the subglottal pressure can be quantitatively studied. In this work we focused on the oscillation fundamental frequency and the upstream pressure in order to start (on-set threshold) either end (off-set threshold) oscillations in presence of a downstream acoustical resonator. As an example, it is shown how this data can be used in order to test the theoretical predictions of a simple one-mass model.
A visual articulatory model and its application to therapy
of speech disorders : a pilot study
(2005)
A visual articulatory model based on static MRI-data of isolated sounds and its application in therapy of speech disorders is described. The model is capable of generating video sequences of articulatory movements or still images of articulatory target positions within the midsagittal plane. On the basis of this model (1) a visual stimulation technique for the therapy of patients suffering from speech disorders and (2) a rating test for visual recognition of speech movements was developed. Results indicate that patients produce recognition rates above level of chance already without any training and that patients are capable of increasing their recognition rate over the time course of therapy significantly.
In order to investigate the articulatory processes of the hasty and mumbled speech of clutterers, the kinematic variability was analysed by means of electromagnetic midsagittal articulography (EMMA). In contrast to stutterers, clutterers improve their intelligibility by concentrating on their speech task. Variability is an important criterion in comparable studies of stuttering and is discussed in terms of the stability of the speech motor system. The aim of the current study was to analyse the spatial and temporal variability in the speech of three clutterers and three control speakers. All speakers were native speakers of German. The speech material consisted of repetitive CV-syllables and foreign words, because clutterers have the most severe problems with long words which have a complex syllable structure. The results showed a higher quotient of variation for clutterers in the foreign word production. For the syllable repetition task, no significant differences between clutterers and controls were found. The extremely large and variable displacements were interpreted as a strategy that helps clutterers to improve the intelligibility of their speech.
This paper summarizes our research efforts in functional modelling of the relationship between the acoustic properties of vowels and perceived vowel quality. Our model is trained on 164 short steady-state stimuli. We measured F1, F2, and additionally F0 since the effect of F0 on perceptual vowel height is evident. 40 phonetically skilled subjects judged vowel quality using the Cardinal Vowel diagram. The main focus is on refining the model and describing its transformation properties between the F1/F2 formant chart and the Cardinal Vowel diagram. An evaluation of the model based on 48 additional vowels showed the generalizability of the model and confirmed that it predicts perceived vowel quality with sufficient accuracy.
We measure face deformations during speech production using a motion capture system, which provides 3D coordinate data of about 60 markers glued on the speaker's face. An arbitrary orthogonal factor analysis followed by a principal component analysis (together called a guided PCA) of the data has showed that the first 6 factors explain about 90% of the variance, for each of our 3 speakers. The 6 derived factors, therefore, allow us to efficiently analyze or to reconstruct with a reasonable accuracy the observed face deformations. Since these factors can be interpreted in articulatory terms, they can reveal underlying articulatory organizations. The comparison of lip gestures in terms of data derived factors suggests that these speakers differently maneuver the lips to achieve contrast between /s/ and /R/. Such inter-speaker variability can occur because the acoustic contrast of these fricatives is shaped not only by the lip tube but also by cavities inside the mouth such as the sublingual cavity. In other words, these tube and cavity can acoustically compensate each other to produce their required acoustic properties.
The contribution of von Kempelen's "Mechanism of Speech" to the 'phonetic sciences' will be analyzed with respect to his theoretical reasoning on speech and speech production on the one hand and on the other in connection with his practical insights during his struggle in constructing a speaking machine. Whereas in his theoretical considerations von Kempelen's view is focussed on the natural functioning of the speech organs – cf. his membraneous glottis model – in constructing his speaking machine he clearly orientates himself towards the auditory result – cf. the bag pipe model for the sound generator used for the speaking machine instead. Concerning vowel production his theoretical description remains questionable, but his practical insight that vowels and speech sounds in general are only perceived correctly in connection with their surrounding sounds – i.e. the discovery of coarticulation – is clearly a milestone in the development of the phonetic sciences: He therefore dispenses with the Kratzenstein tubes, although they might have been based on more thorough acoustic modelling.
Finally, von Kempelen's model of speech production will be discussed in relation to the discussion of the acoustic nature of vowels afterwards [Willis and Wheatstone as well as von Helmholtz and Hermann in the 19th century and Stumpf, Chiba & Kajiyama as well as Fant and Ungeheuer in the 20th century].
In order to understand the functional morphology of the human voice producing system, we are in need of data on the vocal tract anatomy of other mammalian species. The larynges and vocal tracts of four species of Artiodactyla were investigated in combination with acoustic analyses of their respective calls. Different evolutionary specializations of laryngeal characters may lead to similar effects on sound production. In the investigated species, such specializations are: the elongation and mass increase of the vocal folds, the volume increase of the laryngeal vestibulum by an enlarged thyroid cartilage and the formation of laryngeal ventricles. Both the elongation of the vocal folds and the increase of the oscillating masses lower the fundamental frequency. The influence of an increased volume of the laryngeal vestibulum on sound production remains unclear. The anatomical and acoustic results are presented together with considerations about the habitats and the mating systems of the respective species.
Low- dimensional and speaker-independent linear vocal tract parametrizations can be obtained using the 3-mode PARAFAC factor analysis procedure first introduced by Harshman et al. (1977) and discussed in a series of subsequent papers in the Journal of the Acoustical Society of America (Jackson (1988), Nix et al. (1996), Hoole (1999), Zheng et al. (2003)). Nevertheless, some questions of importance have been left unanswered, e.g. none of the papers using this method has provided a consistent interpretation of the terms usually referred to as "speaker weights". This study attempts an exploration of what influences their reliability as a first step towards their consistent interpretation. With this in mind, we undertook a systematic comparison of the classical PARAFAC1 algorithm with a relaxed version, of it, PARAFAC2. This comparison was carried out on two different corpora acquired by the articulograph, which varied in vowel qualities, consonantal contexts, and the paralinguistic features accent and speech rate. The difference between these statistical approaches can grossly be described as follows: In PARAFAC1, observation units pertain to the same set of variables and the observation units are comparable. In PARAFAC2, observations pertain to the same set of variables, but observation units are not comparable. Such a situation can be easily conceived in a situation such as we are describing: The operationalization we took relies on the comparability of fleshpoint data acquired from different speakers, which need not be a good assumption due to influences like sensor placement and morphological conditions.
In particular, the comparison between the two different approaches is carried out by means of so-called "leverages" on different component matrices originating in regression analysis, calculated as v = diag(A(A A)−1A ) and delivering information on how "influential" a particular loading matrix is for the model. This analysis could potentially be carried out component by component, but we confined ourselves to effects on the global factor structure. For vowels, the most influential loadings are those for the tense cognates of non-palatal vowels. For speakers, the most prominent result is the relative absence of effects of the paralinguistic variables. Results generally indicate that there is quite little influence of the model specification (i.e. PARAFAC1 or PARAFAC2) on vowel and subject components. The patterns for the articulators indicate that there are strong differences between speakers with respect to the most influential measurement as revealed by PARAFAC2: In particular, the most influential y-contribution is the tongue-back for some talkers and the tongue-dorsum for other speakers. With respect to the speaker weights, again, the leverage patterns are very similar for both PARAFAC-versions. These patterns converge with the results of the loading plots, where the articulator profiles seem to be most altered by the use of PARAFAC2. These findings, in general, are interpreted as evidence for the reliability of the PARAFAC1 speaker weights.
Four speakers repeated 8 times 15 sentences containing 'pVp' syllables (V being /a/, /i/ and /u/). The 'pVp' syllables were located in final, penultimate and antepenultimate position relatively to the Intonational Phrase (IP) boundary. They were embedded in lexical words of 1-3 syllables and were either word-initial or word-final. Results show that the closer the vowel in word-final position is to the IP boundary, the longer the duration and the higher the fundamental frequency of the vowel; it is also characterised by larger lip opening gestures. The potential reduction or coarticulation of vowels in wordinitial position compared to their counterparts in word-final position is discussed.
A survey of 170 Tibeto-Burman languages showed 69 with a distinction between inclusive and exclusive first-person plural pronouns, 18 of which also show inclusive- exclusive in Idual. Only the Kiranti languages and some Chin languages have inclusive-exclusive in the person marking. Of the forms of the pronouns involved in the inclusive-exclusive opposition, usually the exclusive form is less marked and historically prior to the inclusive form, and we find the distinction cannot be reconstructed to Proto-Tibeto-Burman or to mid level groupings. Qnly the Kiranti group has marking of the distinction that can be reconstructed to the proto level, and this is also reflected in the person-marking system.
Typology and complexity
(2005)
For the Workshop I was asked to talk about complexity in language from a typological perspective. My way of approaching this topic was to ask myself some questions, and then see where the answers led. The first one was of course, "What sort of system are we looking at complexity in - what kind of system is language?"
Chao Yuen Ren (1892–1982)
(2005)
Y. R. Chao is easily the most famous linguist to have come out of China. Born before the end of the last dynasty in China, he received a traditional Confucian education, but was also one of the first Chinese people to be sent to the West for training in modern Western science (under the Boxer Indemnity Fund). The remarkable breadth and scope of his studies included physics, mathematics, linguistics, musical and literary composition, and translation, and he was a pioneer in many of these fields.
Articulatory token-to-token variability not only depends on linguistic aspects like the phoneme inventory of a given language but also on speaker specific morphological and motor constraints. As has been noted previously (Perkell (1997), Mooshammer et al. (2004)), speakers with coronally high "domeshaped" palates exhibit more articulatory variability than speakers with coronally low "flat" palates. One explanation for that is based on perception oriented control by the speaker. The influence of articulatory variation on the cross sectional area and consequently on the acoustics should be greater for flat palates than for domeshaped ones. This should force speakers with flat palates to place their tongue very precisely whereas speakers with domeshaped palates might tolerate a greater variability. A second explanation could be a greater amount of lateral linguo-palatal contact for flat palates holding the tongue in position. In this study both hypotheses were tested.
In order to investigate the influence of the palate shape on the variability of the acoustic output a modelling study was carried out. Parallely, an EPG experiment was conducted in order to investigate the relationship between palate shape, articulatory variability and linguo-palatal contact.
Results from the modelling study suggest that the acoustic variability resulting from a certain amount of articulatory variability is higher for flat palates than for domeshaped ones. Results from the EPG experiment with 20 speakers show that (1.) speakers with a flat palate exhibit a very low articulatory variability whereas speakers with a domeshaped palate vary, (2.) there is less articulatory variability if there is lots of linguo-palatal contact and (3.) there is no relationship between the amount of lateral linguo-palatal contact and palate shape. The results suggest that there is a relationship between token-to-token variability and palate shape, however, it is not that the two parameters correlate, but that speakers with a flat palate always have a low variability because of constraints of the variability range of the acoustic output whereas speakers with a domeshaped palate may choose the degree of variability. Since linguo-palatal contact and variability correlate it is assumed that linguo-palatal contact is a means for reducing the articulatory variability.
The author presents MASSY, the MODULAR AUDIOVISUAL SPEECH SYNTHESIZER. The system combines two approaches of visual speech synthesis. Two control models are implemented: a (data based) di-viseme model and a (rule based) dominance model where both produce control commands in a parameterized articulation space. Analogously two visualization methods are implemented: an image based (video-realistic) face model and a 3D synthetic head. Both face models can be driven by both the data based and the rule based articulation model.
The high-level visual speech synthesis generates a sequence of control commands for the visible articulation. For every virtual articulator (articulation parameter) the 3D synthetic face model defines a set of displacement vectors for the vertices of the 3D objects of the head. The vertices of the 3D synthetic head then are moved by linear combinations of these displacement vectors to visualize articulation movements. For the image based video synthesis a single reference image is deformed to fit the facial properties derived from the control commands. Facial feature points and facial displacements have to be defined for the reference image. The algorithm can also use an image database with appropriately annotated facial properties. An example database was built automatically from video recordings. Both the 3D synthetic face and the image based face generate visual speech that is capable to increase the intelligibility of audible speech.
Other well known image based audiovisual speech synthesis systems like MIKETALK and VIDEO REWRITE concatenate pre-recorded single images or video sequences, respectively. Parametric talking heads like BALDI control a parametric face with a parametric articulation model. The presented system demonstrates the compatibility of parametric and data based visual speech synthesis approaches.
The goal of our current project is to build a system that can learn to imitate a version of a spoken utterance using an articulatory speech synthesiser. The approach is informed and inspired by knowledge of early infant speech development. Thus we expect our system to reproduce and exploit the utility of infant behaviours such as listening, vocal play, babbling and word imitation. We expect our system to develop a relationship between the sound-making capabilities of its vocal tract and the phonetic/phonological structure of imitated utterances. At the heart of our approach is the learning of an inverse model that relates acoustic and motor representations of speech. The acoustic to auditory mappings uses an auditory filter bank and a self-organizing phase of learning. The inverse model from auditory to vocal tract control parameters is estimated using a babbling phase, in which the vocal tract is essentially driven in a random manner, much like the babbling phase of speech acquisition in infants. The complete system can be used to imitate simple utterances through a direct mapping from sound to control parameters. Our initial results show that this procedure works well for sounds generated by its own voice. Further work is needed to build a phonological control level and achieve better performance with real speech.
It is one of the most highly debated issues in loanword phonology whether loanword adaptations are phonologically or phonetically driven. This paper addresses this issue and aims at demonstrating that only the acceptance of both a phonological as well as a phonetic approximation stance can adequately account for the data found in Japanese. This point will be exemplified with the adaptation of German and French mid front rounded vowels in Japanese. It will be argued that the adaptation of German /oe/ and /ø/ as Japanese /e/ is phonologically grounded, whereas the adaptation of French /oe/ and /ø/ as Japanese /u/ is phonetically grounded. This asymmetry in the adaptation process of German and French mid front rounded vowels and further examples of loans in Japanese lead to the only conclusion that both strategies of loanword adaptation occur in languages. It will be shown that not only perception, but also the influence of orthography, of conventions and the knowledge of the source language play a role in the adaptation process.
Heterogeneity and standardization in data, use, and annotation : a diachronic corpus of German
(2005)
This paper describes the standardization problems that come up in a diachronic corpus: it has to cope with differing standards with regard to diplomaticity, annotation, and header information. Such highly heterogeneous texts must be standardized to allow for comparative research without (too much) loss of information.
In this paper, we discuss the design and implementation of our first version of the database "ANNIS" (ANNotation of Information Structure). For research based on empirical data, ANNIS provides a uniform environment for storing this data together with its linguistic annotations. A central database promotes standardized annotation, which facilitates interpretation and comparison of the data. ANNIS is used through a standard web browser and offers tier-based visualization of data and annotations, as well as search facilities that allow for cross-level and cross-sentential queries. The paper motivates the design of the system, characterizes its user interface, and provides an initial technical evaluation of ANNIS with respect to data size and query processing.
In this paper we review the current state of research on the issue of discourse structure (DS) / information structure (IS) interface. This field has received a lot of attention from discourse semanticists and pragmatists, and has made substantial progress in recent years. In this paper we summarize the relevant studies. In addition, we look at the issue of DS/ISinteraction at a different level—that of phonetics. It is known that both information structure and discourse structure can be realized prosodically, but the issue of phonetic interaction between the prosodic devices they employ has hardly ever been discussed in this context. We think that a proper consideration of this aspect of DS/IS-interaction would enrich our understanding of the phenomenon, and hence we formulate some related research-programmatic positions.
Japanese wh-questions always exhibit focus intonation (FI). Furthermore, the domain of FI exhibits a correspondence to the wh-scope. I propose that this phonology-semantics correspondence is a result of the cyclic computation of FI, which is explained under the notion of Multiple Spell-Out in the recent Minimalist framework. The proposed analysis makes two predictions: (1) embedding of an FI into another is possible; (2) (overt) movement of a wh-phrase to a phase edge position causes a mismatch between FI and wh-scope. Both predictions are tested experimentally, and shown to be borne out.
Results of a production experiment on the placement of sentence accent in German are reported. The hypothesis that German fulfills some of the most widely accepted rules of accent assignment— predicting focus domain integration—was only partly confirmed. Adjacency between argument and verb induces a single accent on the argument, as recognized in the literature, but interruption of this sequence by a modifier often induces remodeling of the accent pattern with a single accent on the modifier. The verb is rarely stressed. All models based on linear alignment or adjacency between elements belonging to a single accent domain fail to account for this result. A cyclic analysis of prosodic domain formation is proposed in an optimality-theoretic framework that can explain the accent pattern.
In morphological systems of the agglutinative type we sometimes encounter a nearly perfect one-to-one relation between form and function. Turkish inflectional morphology is, of course, the standard textbook example. Things seem to be quite different in systems of the flexive type. Declension in Contemporary Standard Russian (henceforth Russian, for short) may be cited as a typical example: We find, among other things, cumulative markers, “synonymous” endings (e.g., dative singular noun forms in -i, -e, or -u), and “homonymous” endings (e.g., -i, genitive, dative, and prepositional singular). True, some endings are more of an agglutinative nature, being bound to a specific case-number combination and applying across declensions, e.g., -am (dative plural, all nouns); and some cross the boundaries of word classes, e.g., -o, which serves as the nominative/accusative singular ending of neuter forms of pronouns (and adjectives) and as the nominative/accusative singular ending of (most) neuter nouns as well. Still, many observers have been struck by the impression that what we face here are rather uneconomic or even, so to speak, unnatural structures. But perhaps flexive systems are not as complicated as they seem. What seems to be uneconomic complexity may be, at least partially, an artifact of uneconomic descriptions.
On the syntax and pragmatics interface : Left-peripheral, medial and right-peripheral focus in greek
(2004)
The present paper explores the extent to which narrow syntax is responsible for the computation of discourse functions such as focus/topic. More specifically, it challenges the claim that language approximates ‘perfection’ with respect to economy, conceptual necessity and optimality in design by reconsidering the roles and interactions of the different modules of the grammar, in particular of syntax and phonology and the mapping between the two, in the representation of pragmatic notions. Empirical and theoretical considerations strongly indicate that narrow syntax is ‘blind’ to properties and operations involving the interpretive components — that is, PF and LF. As a result, syntax-phonology interface rules do not ‘see’ everything in the levels they connect. In essence, the architecture of grammar proposed here from the perspective of focus marking necessitates the autonomy of the different levels of grammar, presupposing that NS is minimally structured only when liberated from any non-syntactic/discourse implementations, i.e., movement operations to satisfy both interface needs. As a result, the model articulated here totally dispenses with discourse projections, i.e. FocusP.
Dislocation without movement
(2004)
This paper argues that French Left-Dislocation is a unified phenomenon whether it is resumed by a clitic or a non-clitic element. The syntactic component is shown to play a minimal role in its derivation: all that is required is that the dislocated element be merged by adjunction to a Discourse Projection (generally a finite TP with root properties). No agreement or checking of a topic feature is necessary, hence no syntactic movement of any sort need be postulated. The so-called resumptive element is argued to be a full-fledged pronoun rather than a true syntactic resumptive.
In this paper topic and focus effects at both left and right periphery are argued to be epiphenomena of general properties of tree growth. We incorporate Korean into this account as a prototypical verb-final language, and show how long- and short-distance scrambling form part of this general picture. Multiple long-distance scrambling effects emerge as a consequence of the feeding relationship between different forms of structural under-specification. We also show how the array of effects at the right periphery, in both verb-final and other language-types, can also be explained with the same concepts of tree growth. In particular the Right Roof Constraint, a well-known but little understood constraint, is an immediate consequence of compositionality constraints as articulated in this system.
Chicheŵa, a Bantu language of East Central Africa, displays mixed properties of configurationality such as the existence of VP, on the one hand, and discontinuous constituents (DCs), on the other. In the present work we examine the discourse and syntactic properties of DCs, and show that DCs in Chicheŵa arise naturally from the discourse-configurational nature of the language. We argue that the fronted DCs in Chicheŵa are contrastive topics that appear in a leftdislocated external topic position, with the remnant part of the split NP in the right-dislocated topic position. Once the precise discourse functions of DCs are properly integrated into the syntactic analysis, all the facts and restrictions observed in Chicheŵa DCs can be explained in a straightforward fashion.
The bulk of this paper deals with an analysis of the voice system of Tukang Besi, which, has both a complex verbal agreement system as well as the last fully developed (and obligatory) case marking system among Austronesian languages with an increasingly head-marking trend to the east (case marking of core constituents only becomes functional again in Vanuatu and the Solomons, and is well-developed in Polynesia). For that reason, as well as personal acquaintance with the language, it is a sensible starting point.
We argue that Malagasy (and related W. Austronesian languages!) has a positive setting for a macro-parameter RICH VOICE MORPHOLOGY which builds complex predicates that code the theta role of their argument: S = [[PreN(6) + (X)] + DP]. Manifestations of this parameter are: (1) Case and theta role are assigned in situ in nuclear clauses with no movement or co-indexing to a topic position. (2) Relative Clauses (and other "extraction" structures) satisfy the "Subjects Only" constraint, again with no movement or indexing. (3) UTAH is freely violated, as theta role assignment derives from compositional semantic interpretation. Predicates resemble lexical Ns in assigning case directly to arguments without using Prepositions and in combining directly with Dets to form DPs that include tense and negation (Keenan 1995, 2000). The major Predicate-Argument type is modeled on the Noun+Possessor one, not the Verb+Object one.
The purpose of this research was to trace the developmental steps in the acquisition of aspectual oppositions in Russian and to examine the validity of the 'aspect before tense' hypothesis for L1-speaking children. Imperfective/perfective verbs and their inflections, as well as aspectual pairs, were analysed in the first five months of verb production (and the respective months in the input) in three children. Additionally, the first four months of verb production were investigated in one boy with less data. Verb forms marked for the past and for the present occur simultaneously in all children. These early forms relate to 'here and now' situations: verbs marked for the past denote 'resultative' events that are perceived by the children as occurring during the speech time or immediately before it, while verbs marked for the present typically denote on-going events. Thus, with early tense oppositions (or tense morphology) children mark aspectual contrasts in the moment of speech: evidence in favour of the 'aspect before tense' hypothesis.
A strong preference in using the perfective aspect for the past and the imperfective aspect for the present events has been found in both adults and children. Further, only very few aspectual pairs were documented within the analysed period (from the onset of verb production to the period when children produce rule-driven inflectional forms). The productive use of the finite forms of perfective and imperfective verbs doesn't concord with the ability of the productive use of the contrastive forms of one lemma. Data suggest that children (start to) learn aspectual forms in an item-based manner. The acquisition of aspectual oppositions (aspectual pairs) is lexically dependent and is guided by the contextual 'thesaurus'. Aspectual pairs are learned in a peace-meal way during much longer, than observed for this article, period of time. Generally, aspect is not learned as a rule, also because there are no (uniform) rules of forming of aspectual pairs, but as the 'satellite' of the inherent lexical meaning of verbs of diverse Aktionsarten.
The issues addressed here are relevant for other Slavic languages, exhibiting the morphological category of aspect.
This paper discusses critically a number of developments at the heart of current syntactic theory. These include the postulation of a rich sequence of projections at the left periphery of the sentence; the idea that movement is tied to the need to eliminate uninterpretable features; and the conception put forward by Chomsky and others that advances in the past decade have made it reasonable to raise the question about whether language might be in some sense ‘perfect’. However, I will argue that there is little motivation for a highly-articulated left-periphery, that there is no connection between movement and uninterpretable features, and that there is no support for the idea that language might be perfect.
This article analyses the German discourse particle wohl 'I suppose', 'presumably' as a syntactic and semantic modifier of the sentence types declarative and interrogative. It is shown that wohl does not contribute to the propositional, i.e. descriptive content of an utterance. Nor does it trigger an implicature. The proposed analysis captures the semantic behaviour of wohl by assuming that it moves to SpecForceP at LF, from where it can modify the sentence type operators in Force0 in compositional fashion. Semantically, a modification with wohl results in a weaker commitment to the proposition expressed in declaratives and in a request for a weaker commitment concerning the questioned proposition in interrogatives. Cross-linguistic evidence for a left-peripheral position of wohl (at LF) comes from languages in which the counterpart of wohl occurs in the clausal periphery overtly. Overall, the analysis sheds more light on the semantic properties of the left periphery, in particular of the functional projection ForceP.
[W]hy are not all Malagasy adverbs postverbal with reverse Cinque order? The predicate raising mechanism […] operates around heads, and this leads Rackowski & Travis (2000: 122) to suggest that preverbal adverbs are not heads, but are phrasal, and are located in the Specifier positions themselves. The crucial consequence of this is that the specifier position is blocked, thus effectively preventing further predicate raising. Given that the entire analysis crucially rests on the assumption that certain elements are heads and others are phrases, it would be an advantage if some independent evidence for the X I XP status of the elements could be unearthed. Unfortunately, such evidence is hard to come by in Malagasy. However, other Austronesian languages with similar word order patterns do display rather robust evidence for the head status of certain elements. One such language in the Formosan language Seediq.
This work examines English echo questions (EQs) against the background of Rizzi's (1997) analysis of split CP. It argues that EQs do not behave as the split CP analysis predicts that they should, and that their behavior can instead be straightforwardly explained within the classic CP analysis. Further, what are termed here 'echo negations' of negative inversion constructions are shown not to parallel EQs, a surprising result if negative inversion architecture parallels question architecture, as claimed by split CP proponents. In general, classic CP architecture is more appropriate for analysing this range of phenomena.
This paper takes a close look at the properties of Hungarian relative clauses that occur in the left periphery of the main clause, preceding a (pro)nominal associate. It will be shown that these left-peripheral relative clauses differ in many ways from relative clauses dislocated on the right periphery, as well as from relative clauses embedded under a (pro)nominal head. To capture the precise syntax of these left-peripheral clauses, these will be compared to ordinary left-dislocated items, with which they have some properties in common. Despite the surface similarities between the two, however, there are a few decisive aspects of behaviour, most notably, distributional properties and connectivity effects, which argue against taking left-peripheral relatives as cases of clausal left-dislocates in Hungarian. Instead, one is led to consider these as correlative clauses, on the basis of the properties they share with well-established correlatives in languages like Hindi.
As part of a major project on the syntactic organisation of written discourse in the recent history of the English language, this paper tackles the distribution of sentences comprising left-dislocated constituents in a corpus of texts from late Middle English onwards. Once the phenomenon of left dislocation has been properly defined, this investigation will concentrate on the analysis of the corpus in the following directions: (i) statistical evolution of left dislocation in the recent history of the English language; (ii) the influence of orality and genre on left dislocation; (iii) information conveyed by the left-dislocated material, that is, the discourse-based referentiality potential of the left-dislocated constituents in terms of recoverability, and its association with end-focus; and (iv) grammatical complexity of the left-dislocated material and its association with end-weight.