A simplified model for the simulation and transformation of speech

Brad H. Story; Ingo R. Titze; Darrell Wong

doi:10.1016/S0952-1976(97)00041-9

Back

A simplified model for the simulation and transformation of speech

Journal article

Peer reviewed

A simplified model for the simulation and transformation of speech

Brad H. Story, Ingo R. Titze and Darrell Wong

Engineering applications of artificial intelligence, Vol.10(6), pp.593-601

12/01/1997

DOI: 10.1016/S0952-1976(97)00041-9

View Online

Abstract

This paper explores a model that reduces speech production to the specification of four time-varying parameters; F1 and F2, voice fundamental frequency (F 0), and a relative amplitude of the voice. The trajectory of the first two formants, F1 and F2, is treated as a series of coordinate pairs that are mapped from the F1F2 plane into a two-dimensional plane of coefficients. These coefficients are multipliers of two empirically-based orthogonal basis vectors which, when added to a neutral vowel area function, will produce a new area function with the desired locations of F1 and F2. Thus, area functions and voice parameters extracted at appropriate time intervals can be fed into a speech simulation model to recreate the original speech. A transformation of the speech can also be imposed by manipulating the area function and voice characteristics prior to the recreation of speech by simulation. The model has initially been developed for vowel-like speech utterances, but the effect of consonants on the F1F2 trajectory is also briefly addressed.

Speech production

speech transformation

Details

Title: Subtitle: A simplified model for the simulation and transformation of speech
Creators: Brad H. Story - Denver Center for the Performing Arts
Ingo R. Titze - Denver Center for the Performing Arts
Darrell Wong - Denver Center for the Performing Arts
Resource Type: Journal article
Publication Details: Engineering applications of artificial intelligence, Vol.10(6), pp.593-601
Publisher: Elsevier Ltd
DOI: 10.1016/S0952-1976(97)00041-9
ISSN: 0952-1976
eISSN: 1873-6769
Number of pages: 9
Language: English
Date published: 12/01/1997
Academic Unit: School of Music; Communication Sciences and Disorders
Record Identifier: 9984719750702771

Metrics

1 Record Views