Journal article
Cepstral Peak Sensitivity: A Theoretic Analysis and Comparison of Several Implementations
Journal of voice, Vol.29(6), pp.670-681
11/01/2015
DOI: 10.1016/j.jvoice.2014.11.005
PMCID: PMC4630216
PMID: 25944288
Abstract
Objective. The aim of this study was to develop a theoretic analysis of the cepstral peak (CP), to compare several CP software programs, and to propose methods for reducing variability in CP estimation.
Study Design. Descriptive, experimental study.
Methods. The theoretic CP value of a pulse train was derived and compared with estimates computed for pulse train WAV files using available CP software programs: (1) Hillenbrand's CP prominence (CPP) software (Western Michigan University, Kalamazoo, MI), (2) KayPENTAX (Montvale, NJ) Multi-Speech implementation of CPP, and (3) a MATLAB (The Mathworks, Natick, MA, version R2014a) implementation using cepstral interpolation. The CP variation was also investigated for synthetic breathy vowels.
Results. For pulse trains with period T samples, the theoretic CP is 1/2 + epsilon/T, vertical bar epsilon vertical bar < 0.1 for all pulse trains (epsilon = 0 for integer T). For fundamental frequencies between 70 and 230 Hz, the CP mean +/- standard deviation was 0.496 +/- 0.002 using cepstral interpolation and 0.29 +/- 0.03 using Hillenbrand's software, whereas CPP was 35.0 +/- 3.8 dB using Hillenbrand's software and 20.5 +/- 2.7 dB using KayPENTAX's software. The CP and CPP versus signal-to-noise ratio for synthetic breathy vowels were fit to a logistic model for the Hillenbrand (R-2 = 0.92) and KayPENTAX (R-2 = 0.82) estimators as well as an ideal estimator (R-2 = 0.98), which used a period-synchronous analysis.
Conclusions. The findings indicate that several variables unrelated to the signal itself impact CP values, with some factors introducing large variability in CP values that would otherwise be attributed to the signal (eg, voice quality). Variability may be reduced by using a period-synchronous analysis with Hann windows.
Details
- Title: Subtitle
- Cepstral Peak Sensitivity: A Theoretic Analysis and Comparison of Several Implementations
- Creators
- Mark D. Skowronski - Michigan State UniversityRahul Shrivastav - Michigan State UniversityEric J. Hunter - Michigan State University
- Resource Type
- Journal article
- Publication Details
- Journal of voice, Vol.29(6), pp.670-681
- Publisher
- Elsevier
- DOI
- 10.1016/j.jvoice.2014.11.005
- PMID
- 25944288
- PMCID
- PMC4630216
- ISSN
- 0892-1997
- eISSN
- 1873-4588
- Number of pages
- 12
- Grant note
- R01DC012315 / NATIONAL INSTITUTE ON DEAFNESS AND OTHER COMMUNICATION DISORDERS; United States Department of Health & Human Services; National Institutes of Health (NIH) - USA; NIH National Institute on Deafness & Other Communication Disorders (NIDCD) 2R01DC009029 / National Institute for Deafness and other Communication Disorders; United States Department of Health & Human Services; National Institutes of Health (NIH) - USA; NIH National Institute on Deafness & Other Communication Disorders (NIDCD)
- Language
- English
- Date published
- 11/01/2015
- Academic Unit
- Communication Sciences and Disorders
- Record Identifier
- 9984446408002771
Metrics
13 Record Views