Journal article
A simplified model for the simulation and transformation of speech
Engineering applications of artificial intelligence, Vol.10(6), pp.593-601
12/01/1997
DOI: 10.1016/S0952-1976(97)00041-9
Abstract
This paper explores a model that reduces speech production to the specification of four time-varying parameters; F1 and F2, voice fundamental frequency (F
0), and a relative amplitude of the voice. The trajectory of the first two formants, F1 and F2, is treated as a series of coordinate pairs that are mapped from the F1F2 plane into a two-dimensional plane of coefficients. These coefficients are multipliers of two empirically-based orthogonal basis vectors which, when added to a neutral vowel area function, will produce a new area function with the desired locations of F1 and F2. Thus, area functions and voice parameters extracted at appropriate time intervals can be fed into a speech simulation model to recreate the original speech. A transformation of the speech can also be imposed by manipulating the area function and voice characteristics prior to the recreation of speech by simulation. The model has initially been developed for vowel-like speech utterances, but the effect of consonants on the F1F2 trajectory is also briefly addressed.
Details
- Title: Subtitle
- A simplified model for the simulation and transformation of speech
- Creators
- Brad H. Story - Denver Center for the Performing ArtsIngo R. Titze - Denver Center for the Performing ArtsDarrell Wong - Denver Center for the Performing Arts
- Resource Type
- Journal article
- Publication Details
- Engineering applications of artificial intelligence, Vol.10(6), pp.593-601
- Publisher
- Elsevier Ltd
- DOI
- 10.1016/S0952-1976(97)00041-9
- ISSN
- 0952-1976
- eISSN
- 1873-6769
- Number of pages
- 9
- Language
- English
- Date published
- 12/01/1997
- Academic Unit
- School of Music; Communication Sciences and Disorders
- Record Identifier
- 9984719750702771
Metrics
1 Record Views