Refine
Year of publication
Document Type
- Article (21)
- Part of a Book (17)
- Conference Proceeding (13)
- Working Paper (6)
- Book (3)
- Doctoral Thesis (1)
- Preprint (1)
Language
- English (49)
- German (9)
- Portuguese (2)
- French (1)
- Spanish (1)
Has Fulltext
- yes (62)
Is part of the Bibliography
- no (62) (remove)
Keywords
- Phonetik (62) (remove)
Institute
O Rio Grande do Sul (RS) recebeu, em 1824, os primeiros imigrantes alemães vindos da região do Hunsrück, Alemanha (ALTENHOFEN 1996). Posteriormente, por volta de 1873, chegaram os imigrantes alemães da Boêmia, região da atual República Tcheca (HABEL 2017). Com isso, ocorreu um intenso contato linguístico entre essas variedades dialetais. Como ainda não foram identificados estudos sobre o sistema consonantal dessa língua falada pelos descendentes de alemães boêmios, é feita, inicialmente, uma revisão bibliográfica do alemão Standard e da variedade de imigração alemã, Hunsrückisch. Assim, o objetivo desse estudo é fazer uma descrição preliminar de alguns aspectos do sistema fonético-fonológico das consoantes do alemão boêmio em um ponto de pesquisa do RS, tendo como base, em especial, o alemão Standard.
Prosodie ist nach neuesten Untersuchungen das wichtigste Merkmal der deutschen Sprache und umfasst auditiv wahrnehmbare Merkmale wie Akzent, Rhythmus, Stimmfarbe, Melodie, Lautheit, Sprechgeschwindigkeit, Pausen usw. Die Funktionen, die durch die Prosodie im Deutschen erfüllt werden, sind sehr vielschichtig und tragen eindeutig zur besseren Verständlichkeit und zum reibungsloseren Verlauf der Kommunikation bei. In den letzten Jahren hat man zahlreiche Untersuchungen auf dem Gebiet phonetischer Fehlleistungen ausländischer Deutschlerner durchgeführt. Es zeigte sich rasch, dass die Fehler, die den kommunikativen Erfolg von Sprechakten am stärksten beeinträchtigen, in den Bereich der prosodischen Realisierung fallen.
Das Ziel dieses Beitrags ist es, die Notwendigkeit der Aussprachevermittlung in der Deutschlehrerausbildung an brasilianischen Hochschulen zu unterstreichen. Der Beitrag präsentiert zuerst eine Diskussion über die Rolle der Aussprachvermittlung im Fremdsprachunterricht in den Zeiten nach dem Gemeinsamen Europäischen Referenzrahmen. Darüber hinaus wird der Begriff Akzent sowie dessen Einfluss bzw. Relevanz beim Erlernen einer Fremdsprache thematisiert. Anschließend werden einige Herangehensweisen vorgeschlagen, um die Phonetikarbeit an brasilianischen Hochschulen konsequent durchzuführen.
Zum Informationsangebot deutscher Aussprachewörterbücher für tschechische Germanistikstudierende
(2013)
Aussprachewörterbücher stellen für nichtmuttersprachliche Germanistikstudierende nach wie vor eine der wichtigsten Informationsquellen über die gesprochene Form der deutschen Gegenwartssprache dar. Beim Vergleich der gängigen Aussprachewörterbücher des Deutschen kann man jedoch gravierende Unterschiede feststellen, die einen Nichtmuttersprachler leicht zu desorientieren (wenn nicht gerade zu verwirren) vermögen. Im Beitrag werden die zwei gebräuchlichsten Aussprachewörterbücher der deutschen Sprache (DUDEN Das Aussprachewörterbuch und De Gruyter Deutsches Aussprachewörterbuch) im Hinblick auf das darin enthaltene Informationsangebot für tschechische Germanistikstudierende verglichen. In den Vordergrund tritt somit die Darstellung der für tschechische Deutschstudierende problematischen phonetischen Erscheinungen. Des Weiteren wird folgenden Aspekten Aufmerksamkeit geschenkt: konzeptionelle Ausgangspunkte, Umfang und Stichwörterauswahl, theoretische Beschreibung des phonetisch-phonologischen Systems des Deutschen sowie der deutschen Standardlautung, Aktualitätsbezug, Beschreibung von regionalen Varietäten, Berücksichtigung der Phonostilistik usw.
Dieser Aufsatz führt in Grundbegriffe der deutschen Intonation ein und diskutiert ihre Relevanz für den Unterricht des Deutschen als Fremdsprache, vor allem in Brasilien. Für Muttersprachler des Portugiesischen, die Deutsch lernen, ist die Intonation wahrscheinlich eine größere Herausforderung als die Phonetik der Einzellaute. Das System der Töne, Tonbewegungen und Äußerungsakzente sowie ihre Beiträge zur Äußerungsbedeutung werden am Beispiel von Aussage- und Fragesätzen dargestellt. Den Abschluss bilden konkrete Übungsvorschläge zur Intonation im DaF-Unterricht.
Analyse ausgewählter (sprachwissenschaftlicher) Nachschlagewerke und ihrer Benutzerfreundlichkeit
(2013)
This metalexicographic study compares four selected (linguistic) dictionaries. The results will potentially assist teachers in choosing a dictionary for students of German in the early phase of their studies. Selected lemmas are analyzed in a case study from the perspective of linguistic, academic and additional information; these lemmas are taken from the categories of phonetics and phonology, as these disciplines are generally covered first in a German degree programme. The selected dictionaries are also compared in terms of the range of phonetic and phonological concepts covered.
This article presents new experimental data on the phonetics of syllabic /l/ and syllabic /n/ in Southern British English and then proposes a new phonological account of their behaviour. Previous analyses (Chomsky and Halle 1968:354, Gimson 1989, Gussmann 1991 and Wells 1995) have proposed that syllabic /l/ and syllabic /n/ should be analysed in a uniform manner. Data presented here, however, shows that syllabic /l/ and syllabic /n/ behave in very different ways, and in light of this, a unitary analysis is not justified. Instead, a proposal is made that syllabic /l/ and syllabic /n/ have different phonological structures, and that these different phonological structures explain their different phonetic behaviours.
This article is organised as follows: First a general background is given to the phenomenon of syllabic consonants both cross linguistically and specifically in Southern British English. In §3 a set of experiments designed to elicit syllabic consonants are described and in §4 the results of these experiments are presented. §5 contains a discussion on data published by earlier authors concerning syllabic consonants in English. In §6 a theoretical phonological framework is set out, and in §7 the results of the experiments are analysed in the light of this framework. In the concluding section, some outstanding issues are addressed and several areas for further research are suggested.
In this paper the issue of the nature of the representations of the speech production task in the speaker's brain is addressed in a production-perception interaction framework. Since speech is produced to be perceived, it is hypothesized that its production is associated for the speaker with the generation of specific physical characteristics that are for the listeners the objects of speech perception. Hence, in the first part of the paper, four reference theories of speech perception are presented, in order to guide and to constrain the search for possible correlates of the speech production task in the physical space: the Acoustic Invariance Theory, the Adaptive Variability Theory, the Motor Theory and the Direct-Realist Theory. Possible interpretations of these theories in terms of representations of the speech production task are proposed and analyzed. In a second part, a few selected experimental studies are presented, which shed some light on this issue. In the conclusion, on the basis of the joint analysis of theoretical and experimental aspects presented in the paper, it is proposed that representations of the speech production task are multimodal, and that a hierarchy exists among the different modalities, the acoustic modality having the highest level of priority. It is also suggested that these representations are not associated with invariant characteristics, but with regions of the acoustic, orosensory and motor control spaces.
A fundamental question in the study of speech is about the invariance of the ultimate percepts, or features. The present paper gives an overview of the noninvariance problem and offers some hints towards a solution. Examination of various data on place and voicing perception suggests the following points. Features correspond to natural boundaries between sounds, which are included in the infant's predispositions for speech perception. Adult percepts arise from couplings and contextual interactions between features. Both couplings and interactions contribute to invariance. But this is at the expense of profound qualitative changes in perceptual boundaries implying that features are neither independently nor invariantly perceived. The question then is to understand the principles which guide feature couplings and interactions during perceptual development. The answer might reside in the fact that: (1) adult boundaries converge to a single point of the perceptual space, suggesting a context-free central reference; (2) this point corresponds to the neutral vocoïd, suggesting the reference is related to production; (3) at this point perceptual boundaries correspond to the natural ones, suggesting the reference is anchored in predispositions for feature perception. In sum, perceptual invariance seems to be grounded on a radial representation of the vocal tract around a singular point at which boundaries are context-fee, natural and coincide with the neutral vocoïd.
It has been hypothesized that sounds which are less perceptible are more likely to be altered than more salient sounds, the rationale being that the loss of information resulting from a change in a sound which is difficult to perceive is not as great as the loss resulting from a change in a more salient sound. Kohler (1990) suggested that the tendency to reduce articulatory movements is countered by perceptual and social constraints, finding that fricatives are relatively resistant to reduction in colloquial German. Kohler hypothesized that this is due to the perceptual salience of fricatives, a hypothesis which was supported by the results of a perception experiment by Hura, Lindblom, and Diehl (1992). These studies showed that the relative salience of speech sounds is relevant to explaining phonological behavior. An additional factor is the impact of different acoustic environments on the perceptibility of speech sounds. Steriade (1997) found that voicing contrasts are more common in positions where more cues to voicing are available. The P-map, proposed by Steriade (2001a, b), allows the representation of varying salience of segments in different contexts. Many researchers have posited a relationship between speech perception and phonology. The purpose of this paper is to provide experimental evidence for this relationship, drawing on the case of Turkish /h/ deletion.
This article deals with the Tashlhiyt dialect of Berber (henceforth TB) spoken in the southern part of Morocco. In TB, words may consist entirely of consonants without vowels and sometimes of only voiceless obstruents, e.g. tft#tstt "you rolled it (fem)". In this study we have carried out acoustic, video-endoscopic and phonological analyses to answer the following question: is schwa, which may function as syllabic, a segment at the level of phonetic representations in TB? Video-endoscopic films were made of one male native speaker of TB, producing a list of forms consisting entirely of voiceless obstruents. The same list was produced by 7 male native speakers of TB for the acoustic analysis. The phonological analysis is based on the behaviour of vowels with respect to the phonological rule of assibilation. This study shows the absence of schwa vowels in forms consisting of voiceless obstruents.
The current paper explores these two sorts of phonetic explanations of the relationship between syllabic position and the voicing contrast in American English. It has long been observed that the contrast between, for example, /p/ and /b/ is expressed differently, depending on the position of the stop with respect to the vowel. Preceding a vowel within a syllable, the contrast is largely one of aspiration. /p/ is aspirated, while /b/ is voiceless, or in some dialects voiced or even an implosive. Following a vowel within a syllable, both /p/ and /b/ both tend to lack voicing in the closure and the contrast is expressed largely by dynamic differences in the transition between the previous vowel and the stop. Here, vowel and closure duration are negatively correlated such that the /p/ has a shorter vowel and longer closure duration. This difference is often enhanced by the addition of glottalization to /p/. In addition to these differences, there are additional differences connected to higher-level organization involving stress and feet edges. To make the current discussion more tractable, we will restrict ourselves to the two conditions (CV and VC) laid out above.
In this study, cross-dialectal variation in the use of the acoustic cues of VOT and F0 to mark the laryngeal contrast in Korean stops is examined with Chonnam Korean and Seoul Korean. Prior experimental results (Han & Weitzman, 1970; Hardcastle, 1973; Jun, 1993 &1998; Kim, C., 1965) show that pitch values in the vowel onset following the target stop consonants play a supplementary role to VOT in designating the three contrastive laryngeal categories. F0 contours are determined in part by the intonational system of a language, which raises the question of how the intonational system interacts with phonological contrasts. Intonational difference might be linked to dissimilar patterns in using the complementary acoustic cues of VOT and F0. This hypothesis is tested with 6 Korean speakers, three Seoul Korean and three Chonnam Korean speakers. The results show that Chonnam Korean involves more 3-way VOT and a 2-way distinction in F0 distribution in comparison to Seoul Korean that shows more 3-way F0 distribution and a 2-way VOT distinction. The two acoustic cues are complementary in that one cue is rather faithful in marking 3-way contrast, while the other cue marks the contrast less distinctively. It also seems that these variations are not completely arbitrary, but linked to the phonological characteristics in dialects. Chonnam Korean, in which the initial tonal realization in the accentual phrase is expected to be more salient, tends to minimize the F0 perturbation effect from the preceding consonants by taking more overlaps in F0 distribution. And a 3-way distribution of VOT in Chonnam Korean, as compensation, can be also understood as a durational sensitivity. Without these characteristics, Seoul Korean shows relatively more overlapping distribution in VOT and more 3-way separation in F0 distribution.
This paper presents the results of Open Quotient measurements in EGG signals of young (18 to 30 year old) and elderly (59 to 82 year old) male and female speakers. The paper further presents quantitative results on the relation between the OQ and the perception of a speaker's age. Higgins & Saxman (1991) found a decreased OQEGG with increasing age for females, whereas the OQEGG in sustained vowel material increased for males as the speakers age increased. In Linville (2002), however, the spectral amplitudes in the region of F0 (obtained by LTAS-measurements of read speech material) increased with increasing age independent of gender; this could be interpreted indirectly as an increasing OQ. We measured the OQEGG not only for sustained vowels, but also in vowels taken from isolated words. In order to analyse the relation between breathiness in terms of an increased OQ and the mean perceived age per stimulus a perception test was carried out in which listeners were asked to estimate speaker's age based on sustained /a/-vowel stimuli varying in vocal effort (soft - normal - loud) during production. The results indicated the following: (i) The decreased OQ for elderly females originally found by Higgins & Saxman is not apparent in our data for sustained /a/-vowels. For our female speakers no significant difference between the OQ of young and old speakers was found; for elderly males, however, we also found an increasing OQ with increasing age.(ii) In addition, a statistically significant increased OQEGG occurs for the group of the elderly males for the vowels from the word material. (iii) Our results show a strong positive relation between perceived age and OQ in male voices. Regarding (i) and (ii), at least the male speaker's voice becomes more breathy as age increases. Considering (iii), increased breathiness may contribute to the listener’s perception of increased age.
Table of Contents:
T. A. Hall (Indiana University): English syllabification as the interaction of markedness constraints
Antony D. Green: Opacity in Tiberian Hebrew: Morphology, not phonology
Sabine Zerbian (ZAS Berlin): Phonological Phrases in Xhosa (Southern Bantu)
Laura J. Downing (ZAS Berlin): What African Languages Tell Us About Accent Typology
Marzena Zygis (ZAS Berlin): (Un)markedness of trills: the case of Slavic r-palatalisation
Laura J. Downing (ZAS Berlin), Al Mtenje (University of Malawi), Bernd Pompino-Marschall (Humboldt-Universitat Berlin): Prosody and Information Structure in Chichewa
T. A. Hall (Indiana University). Silke Hamann (ZAS Berlin), Marzena Zygis (ZAS Berlin): The phonetics of stop assibilation
Christian Geng (ZAS Berlin), Christine Mooshammer (Universitat Kiel): The Hungarian palatal stop: phonological considerations and phonetic data
In the research field initiated by Lindblom & Liljencrants in 1972, we illustrate the possibility of giving substance to phonology, predicting the structure of phonological systems with nonphonological principles, be they listener-oriented (perceptual contrast and stability) or speaker-oriented (articulatory contrast and economy). We proposed for vowel systems the Dispersion-Focalisation Theory (Schwartz et al., 1997b). With the DFT, we can predict vowel systems using two competing perceptual constraints weighted with two parameters, respectively λ and α. The first one aims at increasing auditory distances between vowel spectra (dispersion), the second one aims at increasing the perceptual salience of each spectrum through formant proximities (focalisation). We also introduced new variants based on research in physics - namely, phase space (λ,α) and polymorphism of a given phase, or superstructures in phonological organisations (Vallée et al., 1999) which allow us to generate 85.6% of 342 UPSID systems from 3- to 7-vowel qualities. No similar theory for consonants seems to exist yet. Therefore we present in detail a typology of consonants, and then suggest ways to explain plosive vs. fricative and voiceless vs. voiced consonants predominances by i) comparing them with language acquisition data at the babbling stage and looking at the capacity to acquire relatively different linguistic systems in relation with the main degrees of freedom of the articulators; ii) showing that the places “preferred” for each manner are at least partly conditioned by the morphological constraints that facilitate or complicate, make possible or impossible the needed articulatory gestures, e.g. the complexity of the articulatory control for voicing and the aerodynamics of fricatives. A rather strict coordination between the glottis and the oral constriction is needed to produce acceptable voiced fricatives (Mawass et al., 2000). We determine that the region where the combinations of Ag (glottal area) and Ac (constriction area) values results in a balance between the voice and noise components is indeed very narrow. We thus demonstrate that some of the main tendencies in the phonological vowel and consonant structures of the world’s languages can be explained partly by sensorimotor constraints, and argue that actually phonology can take part in a theory of Perception-for-Action-Control.
Arguing against Bhat’s (1974) claim that retroflexion cannot be correlated with retraction, the present article illustrates that retroflexes are always retracted, though retraction is not claimed to be a sufficient criterion for retroflexion. The cooccurrence of retraction with retroflexion is shown to make two further implications; first, that non-velarized retroflexes do not exist, and second, that secondary palatalization of retroflexes is phonetically impossible. The process of palatalization is shown to trigger a change in the primary place of articulation to non-retroflex. Phonologically, retraction has to be represented by the feature specification [+back] for all retroflex segments.
Consonants exhibit more variation in their phonetic realization than is typically acknowledged, but that variation is linguistically constrained. Acoustic analysis of both read and spontaneous speech reveals that consonants are not necessarily realized with the manner of articulation they would have in careful citation form. Although the variation is wider than one would imagine, it is limited by the phoneme inventory. The phoneme inventory of the language restricts the range of variation to protect the system of phonemic contrast. That is, consonants may stray phonetically into unfilled areas of the language's sound space. Listeners are seldom consciously aware of the consonant variation, and perceive the consonants phonemically as in their citation forms. A better understanding of surface phonetic consonant variation can help make predictions in theoretical domains and advances in applied domains.
Data on lingual movement, dorsopalatal contact and F2 frequency presented in previous papers of ours (Recasens, 2002; Recasens and Pallarès, 2001; Recasens, Pallarès and Fontdevila, 1997) suggest that the degree of articulatory constraint (DAC) model accounts to a large extent for the extent and direction of tongue dorsum coarticulation in VCV and CC sequences. A goal of this investigation is to verify the predictions of this model with respect to jaw V-to-V effects in VCV sequences using articulatory movement data collected with electromagnetic articulometry (EMA).
Articulatory token-to-token variability not only depends on linguistic aspects like the phoneme inventory of a given language but also on speaker specific morphological and motor constraints. As has been noted previously (Perkell (1997), Mooshammer et al. (2004)), speakers with coronally high "domeshaped" palates exhibit more articulatory variability than speakers with coronally low "flat" palates. One explanation for that is based on perception oriented control by the speaker. The influence of articulatory variation on the cross sectional area and consequently on the acoustics should be greater for flat palates than for domeshaped ones. This should force speakers with flat palates to place their tongue very precisely whereas speakers with domeshaped palates might tolerate a greater variability. A second explanation could be a greater amount of lateral linguo-palatal contact for flat palates holding the tongue in position. In this study both hypotheses were tested.
In order to investigate the influence of the palate shape on the variability of the acoustic output a modelling study was carried out. Parallely, an EPG experiment was conducted in order to investigate the relationship between palate shape, articulatory variability and linguo-palatal contact.
Results from the modelling study suggest that the acoustic variability resulting from a certain amount of articulatory variability is higher for flat palates than for domeshaped ones. Results from the EPG experiment with 20 speakers show that (1.) speakers with a flat palate exhibit a very low articulatory variability whereas speakers with a domeshaped palate vary, (2.) there is less articulatory variability if there is lots of linguo-palatal contact and (3.) there is no relationship between the amount of lateral linguo-palatal contact and palate shape. The results suggest that there is a relationship between token-to-token variability and palate shape, however, it is not that the two parameters correlate, but that speakers with a flat palate always have a low variability because of constraints of the variability range of the acoustic output whereas speakers with a domeshaped palate may choose the degree of variability. Since linguo-palatal contact and variability correlate it is assumed that linguo-palatal contact is a means for reducing the articulatory variability.
Mechanisms of contrasting korean velar stops : A catalogue of acoustic and articulatory parameters
(2003)
The Korean stop system exhibits a three-way distinction in velar stops among /g/, /k'/ and /kh/. If the differentiation is regarded as being based on voicing, such a system is rather unusual because even a two-way distinction between a voiced and a voicless unaspirated velar stop gets easily lost in the languages of the world especially in the case of velar stops. One possibility for maintainig this distinction is that supralaryngeal characteristics like articulators' velocity, duration of surrounding vowels or stop closure duration are involved. The aim of the present study is to set up a catalogue of parameters which are involved in the distinction of Korean velar stops in intervocalic position.
Two Korean speakers have been recorded via Electromagnetic Articulography. The word material consisted of VCV-sequences where V is one of the three vowels /a/, /i/ or /u/ and C one of the Korean velars /g/, /k'/ or /kh/. Articulatory and acoustic signals have been analysed It turned out that the distinction is only partly built on laryngeal parameters and that supralaryngeal characteristics differ for the three stops. Another result is that the voicing contrast is not a matter of one parameter, but there is always a set of parameters involved. Furthermore, speakers seem to have a certain freedom in the choice of these parameters.
Seit mehr als 60 Jahren dominiert in der historisch-phonologischen Umlaut-Landschaft EIN Aufsatz, eine vierseitige Skizze des althochdeutschen Umlauts von W. Freeman Twaddell. Keller (1978: 160) nennt diese Theorie 'one of the finest achievements of American linguists'. Ähnliche Lobsprüche findet man mehrmals in der Literatur und der Artikel bleibt bis heute noch DER Eckpfeiler der Umlaut-Debatte (s. Krygier 1997, Schulte 1998).
In den letzten paar Jahren haben wir mit einigen Kollegen – Anthony Buccini, Garry Davis, David Fertig, Dave Holsinger, Robert Howell, Regina Smith – einen neuen Ansatz entwickelt, die wir "ingenerate Umlaut" nennen. "Ingenerate" heißt hier ungefähr 'vorprogrammiert, inhärent, angeboren' und deutet darauf hin, daß wir die Wurzeln vom Umlaut in der Phonetik – noch genauer: in der Koartikulation – suchen. Auch meinen wir, die allmähliche Entfaltung des Prozesses in den "Ausnahmen" zum Umlaut sehen zu können, mit anderen Worten genau in den umlautlosen Formen, die in der Twaddellschen Tradition als willkürliche Ergebnisse der Analogie gesehen werden müssen.
Ausgehend von den Zielen des Ausspracheunterrichts wird dargestellt, dass das Verstehen und Verstandenwerden in der Fremdsprache Deutsch gut entwickelte Aussprachefertigkeiten voraussetzt. Analysen an Deutschlernenden haben gezeigt, wo die Hauptschwierigkeiten liegen, diese werden im Beitrag aufgelistet. Um den oft schwierigen Lernprozess zu unterstützen und Lernende zu motivieren und zu sensibilisieren, können im Unterricht spielerische Übungen zu verschiedenen phonetischen Themen eingesetzt werden. Im Beitrag werden die Besonderheiten spielerisch angelegter, d.h. mit Spielelementen versehener Übungen erläutert und Beispiele für solche Übungen gegeben.
Many teachers of German as a second language make some statements regarding this language that mix concepts from three distinct fields: Orthography (letters), Phonetics (phones or speech sounds) and Phonology (phonemes). In this paper I attempt to shed some light on these concepts and fields. I also provide examples of such statements and make comments on them.
Das Wogeo ist eine austronesische Sprache, die von etwa 1500 Menschen auf den Inseln Vokeo […] und Koil […] gesprochen wird. Da es sich beim Wogeo um eine bislang größtenteils unbeschriebene Sprache handelt, sind zum Verständnis der Ausführungen im Hauptteil dieser Arbeit sowie zur allgemeinen Orientierung einige einleitende Erklärungen nötig. Diese sind von unterschiedlicher Art: Zunächst wird die natürliche Umgebung der beiden Inseln, auf denen die Sprache gesprochen wird, kurz dargestellt. Dann werden die für das Verständnis der soziolinguistischen Lage der Sprache relevanten Aspekte beleuchtet. Zur Orientierung folgt ein kurzer Abriß der Position des Wogeo innerhalb der austronesischen Sprachfamilie. Schließlich leitet eine Darstellung des bisherigen Forschungsstandes über das Wogeo zum Hauptteil der Arbeit über.
The contribution of von Kempelen’s “Mechanism of Speech” to the ‘phonetic sciences‘ will be analyzed with respect to his theoretical reasoning on speech and speech production on the one hand and on the other in connection with his practical insights during his struggle in constructing a speaking machine. Whereas in his theoretical considerations von Kempelen’s view is focussed on the natural functioning of the speech organs – cf. his membraneous glottis model – in constructing his speaking machine he clearly orientates himself towards the auditory result – cf. the bag pipe model for the sound generator used for the speaking machine instead. Concerning vowel production his theoretical description remains questionable, but his practical insight that vowels and speech sounds in general are only perceived correctly in connection with their surrounding sounds – i.e. the discovery of coarticulation – is clearly a milestone in the development of the phonetic sciences: He therefore dispenses with the Kratzenstein tubes, although they might have been based on more thorough acoustic modelling. Finally, von Kempelen’s model of speech production will be discussed in relation to the discussion of the acoustic nature of vowels afterwards [Willis and Wheatstone as well as von Helmholtz and Hermann in the 19th century and Stumpf, Chiba & Kajiyama as well as Fant and Ungeheuer in the 20th century].
This study investigates supralaryngeal mechanisms of the two way voicing contrast among German velar stops and the three way contrast among Korean velar stops, both in intervocalic position. Articulatory data won via electromagnetic articulography of three Korean speakers and acoustic recordings of three Korean and three German speakers are analysed. It was found that in both languages the voicing contrast is created by more than one mechanism. However, one can say that for Korean velar stops in intervocalic position stop closure duration is the most important parameter. For German it is closure voicing. The results support the phonological description proposed by Kohler (1984).
The study investigates the contribution of tactile and auditory feedback in the adaptation of /s/ towards a palatal prosthesis. Five speakers were recorded via electromagnetic articulography, at first without the prosthesis, then with the prosthesis and auditory feedback masked, and finally with the prosthesis and auditory feedback available. Tongue position, jaw position and acoustic centre of gravity of productions of the sound were measured. The results show that the initial adaptation attempts without auditory feedback are dependent on the prosthesis type and directed towards reaching the original tongue palate contact pattern. Speakers with a prosthesis which retracted the alveolar ridge retracted the tongue. Speakers with a prosthesis which did not change the place of the alveolar ridge did not retract the tongue. All speakers lowered the jaw. In a second adaptation step with auditory feedback available speakers reorganised tongue and jaw movements in order to produce more subtle acoustic characteristics of the sound such as the high amplitude noise which is typical for sibilants.
Articulatory token-to-token variability not only depends on linguistic aspects like the phoneme inventory of a given language but also on speaker specific morphological and motor constraints. As has been noted previously (Perkell (1997), Mooshammer et al. (2004)) , speakers with coronally high "domeshaped" palates exhibit more articulatory variability than speakers with coronally low "flat" palates. One explanation for that is based on perception oriented control by the speaker. The influence of articulatory variation on the cross sectional area and consequently on the acoustics should be greater for flat palates than for domeshaped ones. This should force speakers with flat palates to place their tongue very precisely whereas speakers with domeshaped palates might tolerate a greater variability. A second explanation could be a greater amount of lateral linguo-palatal contact for flat palates holding the tongue in position. In this study both hypotheses were tested.
As has been noted previously, speakers with coronally low "flat" palates exhibit less articulatory variability than speakers with coronally high "domeshaped" palates. This phenomenon is investigated by means of a tongue model and an EPG experiment. The results show that acoustic variability depends on the shape of the vocal tract. The same articulatory variability leads to more acoustic variability if the palate is flat than if it is domeshaped. Furthermore, speakers with domeshaped palates show more articulatory variability than speakers with flat palates. The results are explained by different control strategies by the speakers. Speakers with flat palates reduce their articulatory variability in order to keep their acoustic variability low.
A two-week perturbation EMA-experiment was carried out with palatal prostheses. Articulatory effort for five speakers was assessed by means of peak acceleration and jerk during the tongue tip gestures from /t/ towards /i, e, o, y, u/. After a period of no change speakers showed an increase in these values. Towards the end of the experiment the values decreased. The results are interpreted as three phases of carrying out changes in the internal model. At first, the complete production system is shifted in relation to the palatal change, afterwards speakers explore different production mechanisms which involves more articulatory effort. This second phase can be seen as a training phase where several articulatory strategies are explored. In the third phase speakers start to select an optimal movement strategy to produce the sounds so that the values decrease.
Temporal development of compensation strategies for perturbed palate shape in German /S/-production
(2006)
The palate shape of four speakers was changed by a prosthesis which either lowered the palate or retracted the alveoles. Subjects wore the prosthesis for two weeks and were recorded several times via EMA. Results of articulatory measurements show that speakers use different compensation methods at different stages of the adaptation. They lower the tongue immediately after the insertion of the prosthesis. Other compensation methods as for example lip protrusion are only acquired after longer practising periods. The results are interpreted as supporting the existence of different mappings between motor commands, vocal tract shape and auditory-acoustic target.
Several articulatory strategies are available during the production of /u/, all resulting in a similar acoustic output. /u/ has two main constrictions, at the velum and at the lips. A perturbation of either constriction can be compensated at the other one, e.g wider constriction at the velum by more lip protrusion, wider lip opening by more tongue retraction. This study investigates whether speakers use this relation under perturbation. Six speakers were provided with palatal prostheses which were worn for two weeks. Speakers were instructed to make a serious attempt to produce normal speech. Their speech was recorded via EMA and acoustics several times over the adaptation period. Formant values of /u/-productions were measured. Velar constriction width and lip protrusion were estimated. For four speakers a correlation between constriction width and lip protrusion was found. A negative correlation between lip protrusion and F1 or F2 could sometimes be observed, but no correlation occurred between constriction size and either of the formants. The results show that under perturbation speakers use motor equivalent strategies in order to adapt. The correlation between constriction size and lip protrusion is stronger than in studies investigating unperturbed speech. This could be because under perturbation speakers are inclined to try out several strategies in order to reach the acoustic target and the co-variability might thus be greater.
Two hypotheses have been proposed in order to account for velar softening, i.e., a process through which /k/ changes to an affricate. Whereas one hypothesis states that for the process to apply the velar stop has to be realized as an (alveolo) palatal stop (articulation-based hypothesis), the other claims that velar softening is triggered by acoustic similarity between the input and output segments (acoustic equivalence hypothesis). The present paper investigates the acoustic equivalence hypothesis by comparing several acoustic properties of /k/ in various vowel contexts with those of /ts , ts , tc / for three languages differing in stop burst aspiration, i.e., German, Polish and Catalan. Results suggest that the acoustic equivalence hypothesis could account for velar softening in aspirated velar stops but not in unaspirated velar stops. The results also provide an explanation as to why aspirated velar stops are prone to undergo softening more easily when followed by front vocalic segments than in other contexts and positions
This paper shows that several typologically unrelated languages share the tendency to avoid voiced sibilant affricates. This tendency is explained by appealing to the phonetic properties of the sounds, and in particular to their aerodynamic characteristics. On the basis of experimental evidence it is shown that conflicting air pressure requirements for maintaining voicing and frication are responsible for the avoidance of voiced affricates. In particular, the air pressure released from the stop phase of the affricate is too high to maintain voicing, which in consequence leads to a devoicing of the frication part.
A model is proposed that interprets a variety of connected speech processes as resulting from prosodic modulations at different tiers of functional speech motor control along the hypo-hyper dimension [10]. The general background of the model is given by the trichotomy of A-, B- and C-prosodic phenomena [15] that together constitute the acoustic makeup of any speech utterance (with regard to their respective time domains at the uttarance/phrase level, the syllabic level and the segmental level).
Bilabial stops undergoing Surface Palatalization (SP) were analyzed in an EMMA/EPG study. Articulatorily, the point of maximal palatal contact and the labial opening movement were analyzed. The acoustic analysis pertained to stop related timing and the point of the highest F2-value. Results show (i) that SP yields a higher F2 at vowel onset and a lengthened opening gesture and (ii) that morphemeinduced palatalizations are distinguished from word initial ones and sandhi-palatalizations articulatorily and acoustically by a shorter delay of palatal target position with respect to stop production; (iii) no differences are found between ‘repalatalized’ and plain segments in case of sandhi palatalization.
Reduction in natural speech
(2009)
Natural (conversational) speech, compared to cannonical speech, is earmarked by the tremendous amount of variation that often leads to a massive change in pronunciation. Despite many attempts to explain and theorize the variability in conversational speech, its unique characteristics have not played a significant role in linguistic modeling. One of the reasons for variation in natural speech lies in a tendency of speakers to reduce speech, which may drastically alter the phonetic shape of words. Despite the massive loss of information due to reduction, listeners are often able to understand conversational speech even in the presence of background noise. This dissertation investigates two reduction processes, namely regressive place assimilation across word boundaries, and massive reduction and provides novel data from the analyses of speech corpora combined with experimental results from perception studies to reach a better understanding of how humans handle natural speech. The successes and failures of two models dealing with data from natural speech are presented: The FUL-model (Featurally Underspecified Lexicon, Lahiri & Reetz, 2002), and X-MOD (an episodic model, Johnson, 1997). Based on different assumptions, both models make different predictions for the two types of reduction processes under investigation. This dissertation explores the nature and dynamics of these processes in speech production and discusses its consequences for speech perception. More specifically, data from analyses of running speech are presented investigating the amount of reduction that occurs in naturally spoken German. Concerning production, the corpus analysis of regressive place assimilation reveals that it is not an obligatory process. At the same time, there emerges a clear asymmetry: With only very few exceptions, only [coronal] segments undergo assimilation, [labial] and [dorsal] segments usually do not. Furthermore, there seem to be cases of complete neutralization where the underlying Place of Articulation feature has undergone complete assimilation to the Place of Articulation feature of the upcoming segment. Phonetic analyses further underpin these findings. Concerning deletions and massive reductions, the results clearly indicate that phonological rules in the classical generative tradition are not able to explain the reduction patterns attested in conversational speech. Overall, the analyses of deletion and massive reduction in natural speech did not exhibit clear-cut patterns. For a more in-depth examination of reduction factors, the case of final /t/ deletion is examined by means of a new corpus constructed for this purpose. The analysis of this corpus indicates that although phonological context plays an important role on the deletion of segments (i.e. /t/), this arises in the form of tendencies, not absolute conditions. This is true for other deletion processes, too. Concerning speech perception, a crucial part for both models under investigation (X-MOD and FUL) is how listeners handle reduced speech. Five experiments investigate the way reduced speech is perceived by human listeners. Results from two experiments show that regressive place assimilations can be treated as instances of complete neutralizations by German listeners. Concerning massively reduced words, the outcome of transcription and priming experiments suggest that such words are not acceptable candidates of the intended lexical items for listeners in the absence of their proper phrasal context. Overall, the abstractionist FUL-model is found to be superior in explaining the data. While at first sight, X-MOD deals with the production data more readily, FUL provides a better fit for the perception results. Another important finding concerns the role of phonology and phonetics in general. The results presented in this dissertation make a strong case for models, such as FUL, where phonology and phonetics operate at different levels of the mental lexicon, rather than being integrated into one. The findings suggest that phonetic variation is not part of the representation in the mental lexicon.
Arguing against Bhat’s (1974) claim that retroflexion cannot be correlated with retraction, the present article illustrates that retroflexes are always retracted, though retraction is not claimed to be a sufficient criterion for retroflexion. The cooccurrence of retraction with retroflexion is shown to make two further implications; first, that non-velarized retroflexes do not exist, and second, that secondary palatalization of retroflexes is phonetically impossible. The process of palatalization is shown to trigger a change in the primary place of articulation to non-retroflex. Phonologically, retraction has to be represented by the feature specification [+back] for all retroflex segments.
The present study poses the question on what phonetic and phonological grounds postalveolar fricatives in Polish can be analyzed as retroflex and whether postalveolar fricatives in other Slavic languages are retroflex as well. Velarization and incompatibility with front vowels are introduced as articulatory criteria for retroflexion, based on crosslinguistic data. According to these criteria, Polish and Russian have retroflex fricatives, whereas Bulgarian and Czech do not. In a phonological representation of these Slavic retroflexes, the necessity of perceptual features is shown. Lastly, it is illustrated that palatalization of retroflex fricatives both in Slavic languages and more generally causes a phonetic and phonological change to a non-retroflex sound.
On the basis of perceptual experiments we show that alveolo-palatal fricatives and palatalized post-alveolars are two separate sounds which are distinguished not only by Polish native speakers but also by German ones. This claim is partly attested by centre of gravity measurements of the two sibilants. In this paper we revise the claim made by Halle & Stevens [1] and Maddieson & Ladefoged [2] that the Polish alveolo-palatal fricatives [˛, ¸] are palatalized postalveolars [SJ, ZJ]. On the basis of perceptual experiments we show that alveolo-palatal fricatives and palatalized post-alveolars are two separate sounds which are distinguished not only by Polish native speakers but also by German ones. This claim is partly attested by centre of gravity measurements of the two sibilants.
In the following study we present the results of three acoustic experiments with native speakers of German and Polish which support implications (a) and (b). In our experiments we measured the friction phase after the /t d/ release before the onset of the following high front vocoid for four speakers of German and Polish. We found that the friction phase for /tj/ was significantly longer than that of /ti/, and that the friction phase of /t/ in the assibilation context is significantly longer than that of /d/.