Abstract
Vocal tract shape estimation using three noninvasive transducers
The Journal of the Acoustical Society of America, Vol.103(5_Supplement), pp.3056-3056
05/01/1998
DOI: 10.1121/1.422671
Abstract
Sentence-level speech simulation requires a description of vocal tract shape parameters as a function of time. Vowel-like configurations can often be derived from a microphone signal; however, the influence of consonants may be more easily detected with the use of additional signals. The electroglottograph (EGG) and an articulatory transducer [McGarr and Lofqvist, J. Acoust. Soc. Am. 72, 34–42 (1982)] are employed to increase the accuracy of vocal tract shape estimation. The microphone signal is analyzed on a frame-by-frame basis for spectral features, energy, and zero-crossing rate. The EGG provides information regarding voicing onset and offset, and the articulatory signal is used to indicate closure and lip rounding. These data are combined to select an appropriate area function for each frame of the sentence. The resulting temporal map of area functions is interpolated and sent to a wave reflection simulator to reconstruct the speech. [Work supported by NIH Grant No. DCO2532.]
Details
- Title: Subtitle
- Vocal tract shape estimation using three noninvasive transducers
- Creators
- Russel Long - Denver Center for the Performing ArtsBrad Story - Denver Center for the Performing ArtsIngo Titze - Denver Center for the Performing Arts
- Resource Type
- Abstract
- Publication Details
- The Journal of the Acoustical Society of America, Vol.103(5_Supplement), pp.3056-3056
- DOI
- 10.1121/1.422671
- ISSN
- 0001-4966
- eISSN
- 1520-8524
- Language
- English
- Date published
- 05/01/1998
- Academic Unit
- School of Music; Communication Sciences and Disorders
- Record Identifier
- 9984719855502771
Metrics
1 Record Views