Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique

Peter S Popolo; Richard W Sanders; Ingo R Titze

Back

Conference proceeding

Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique

Peter S Popolo, Richard W Sanders and Ingo R Titze

Audio Engineering Society - 123rd Audio Engineering Society Convention 2007, Vol.1, pp.456-468

2007

View Online

Abstract

This paper looks at a methodology of quantifying the speaking voice, by which temporal and spectral features of the voice are extracted and processed to create a numeric code that identifies speakers, so those speakers can be searched in a database much like fingerprints. The parameters studied include: (1) average fundamental frequency (F0) of the speech signal over time, (2) standard deviation of the F0, (3) the slope and (4) sign of the FO contour, (5) the average energy, (6) the standard deviation of the energy, (7) the spectral energy contained from 50 Hz to 1,000 Hz, (8) the spectral energy from 1,000 Hz to 5,000 Hz, (9) the Alpha Ratio, (10) the average speaking rate, and (11) the total duration of the spoken sentence.

Details

Title: Subtitle: Quantifying the Speaking Voice: Generating a Speaker Code as a Means of Speaker Identification Using a Simple Code-Matching Technique
Creators: Peter S Popolo
Richard W Sanders
Ingo R Titze
Resource Type: Conference proceeding
Publication Details: Audio Engineering Society - 123rd Audio Engineering Society Convention 2007, Vol.1, pp.456-468
Publisher: Audio Engineering Society
Language: English
Date published: 2007
Academic Unit: Communication Sciences and Disorders; School of Music
Record Identifier: 9984719566402771

Metrics

1 Record Views