Learning to control an articulatory synthesizer by imitating real speech

  • The goal of our current project is to build a system that can learn to imitate a version of a spoken utterance using an articulatory speech synthesiser. The approach is informed and inspired by knowledge of early infant speech development. Thus we expect our system to reproduce and exploit the utility of infant behaviours such as listening, vocal play, babbling and word imitation. We expect our system to develop a relationship between the sound-making capabilities of its vocal tract and the phonetic/phonological structure of imitated utterances. At the heart of our approach is the learning of an inverse model that relates acoustic and motor representations of speech. The acoustic to auditory mappings uses an auditory filter bank and a self-organizing phase of learning. The inverse model from auditory to vocal tract control parameters is estimated using a babbling phase, in which the vocal tract is essentially driven in a random manner, much like the babbling phase of speech acquisition in infants. The complete system can be used to imitate simple utterances through a direct mapping from sound to control parameters. Our initial results show that this procedure works well for sounds generated by its own voice. Further work is needed to build a phonological control level and achieve better performance with real speech.

Download full text files

Export metadata

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Ian S. Howard, Mark A. Huckvale
URN:urn:nbn:de:hebis:30:3-309262
URL:http://www.zas.gwz-berlin.de/191.html?&L=1%2527%252band%252bchar%28124%29%25252Buser%25252Bchar%28124%29%25253D0%252band%252b%2527%2527%25253D%2527
ISSN:1435-9588
ISSN:0947-7055
Parent Title (English):Speech production and perception : experimental analyses and models / editors Susanne Fuchs, Pascal Perrier and Bernd Pompino-Marschall, Zentrum für Allgemeine Sprachwissenschaft, Sprachtypologie und Universalienforschung (Berlin): ZAS papers in linguistics ; Vol. 40 (2005)
Publisher:Zentrum für Allgemeine Sprachwissenschaft, Sprachtypologie und Universalienforschung
Place of publication:Berlin
Document Type:Part of a Book
Language:English
Date of Publication (online):2013/11/14
Year of first Publication:2005
Publishing Institution:Universitätsbibliothek Johann Christian Senckenberg
Release Date:2013/11/14
GND Keyword:Computerlinguistik; Syntaktische Analyse; Spracherwerb; Computersimulation
Volume:40
Page Number:16
First Page:63
Last Page:78
HeBIS-PPN:381257932
Dewey Decimal Classification:4 Sprache / 41 Linguistik / 410 Linguistik
Sammlungen:Linguistik
Linguistik-Klassifikation:Linguistik-Klassifikation: Spracherwerb / Language acquisition
Linguistik-Klassifikation: Computerlinguistik / Computational linguistics
Zeitschriften / Jahresberichte:ZAS papers in linguistics : ZASPiL / ZASPiL 40 = Speech production and perception : Experimental analyses and models
:urn:nbn:de:hebis:30:3-306823
Licence (German):License LogoDeutsches Urheberrecht