• Treffer 4 von 37
Zurück zur Trefferliste

Learning to control an articulatory synthesizer by imitating real speech

  • The goal of our current project is to build a system that can learn to imitate a version of a spoken utterance using an articulatory speech synthesiser. The approach is informed and inspired by knowledge of early infant speech development. Thus we expect our system to reproduce and exploit the utility of infant behaviours such as listening, vocal play, babbling and word imitation. We expect our system to develop a relationship between the sound-making capabilities of its vocal tract and the phonetic/phonological structure of imitated utterances. At the heart of our approach is the learning of an inverse model that relates acoustic and motor representations of speech. The acoustic to auditory mappings uses an auditory filter bank and a self-organizing phase of learning. The inverse model from auditory to vocal tract control parameters is estimated using a babbling phase, in which the vocal tract is essentially driven in a random manner, much like the babbling phase of speech acquisition in infants. The complete system can be used to imitate simple utterances through a direct mapping from sound to control parameters. Our initial results show that this procedure works well for sounds generated by its own voice. Further work is needed to build a phonological control level and achieve better performance with real speech.

Volltext Dateien herunterladen

Metadaten exportieren

Weitere Dienste

Teilen auf Twitter Suche bei Google Scholar
Metadaten
Verfasserangaben:Ian S. Howard, Mark A. Huckvale
URN:urn:nbn:de:hebis:30:3-309262
URL:http://www.zas.gwz-berlin.de/191.html?&L=1%2527%252band%252bchar%28124%29%25252Buser%25252Bchar%28124%29%25253D0%252band%252b%2527%2527%25253D%2527
ISSN:1435-9588
ISSN:0947-7055
Titel des übergeordneten Werkes (Englisch):Speech production and perception : experimental analyses and models / editors Susanne Fuchs, Pascal Perrier and Bernd Pompino-Marschall, Zentrum für Allgemeine Sprachwissenschaft, Sprachtypologie und Universalienforschung (Berlin): ZAS papers in linguistics ; Vol. 40 (2005)
Verlag:Zentrum für Allgemeine Sprachwissenschaft, Sprachtypologie und Universalienforschung
Verlagsort:Berlin
Dokumentart:Teil eines Buches (Kapitel)
Sprache:Englisch
Datum der Veröffentlichung (online):14.11.2013
Jahr der Erstveröffentlichung:2005
Veröffentlichende Institution:Universitätsbibliothek Johann Christian Senckenberg
Datum der Freischaltung:14.11.2013
GND-Schlagwort:Computerlinguistik; Syntaktische Analyse; Spracherwerb; Computersimulation
Jahrgang:40
Seitenzahl:16
Erste Seite:63
Letzte Seite:78
HeBIS-PPN:381257932
DDC-Klassifikation:4 Sprache / 41 Linguistik / 410 Linguistik
Sammlungen:Linguistik
Linguistik-Klassifikation:Linguistik-Klassifikation: Spracherwerb / Language acquisition
Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Zeitschriften / Jahresberichte:ZAS papers in linguistics : ZASPiL / ZASPiL 40 = Speech production and perception : Experimental analyses and models
Übergeordnete Einheit:urn:nbn:de:hebis:30:3-306823
Lizenz (Deutsch):License LogoDeutsches Urheberrecht