Journal article
Multichannel speech reinforcement based on binaural unmasking
Signal processing, Vol.139, pp.165-172
10/2017
DOI: 10.1016/j.sigpro.2017.04.021
Abstract
•Multichannel speech reinforcement exploiting DoA information is proposed.•An empirical evidence of the binaural unmasking for monaural speech is provided.•Proposed reinforcement restores perceived loudness considering binaural unmasking.•The performance of the algorithm is verified through subjective listening tests.
Speech reinforcement or near-end listening enhancement is a technique that modifies the far-end signal to mitigate the effect of the near-end noise, usually based on the power spectra of the far-end signal and the near-end noise. Psychoacoustic experiments have shown that the location of a noise source with respect to that of a signal source affects the amount of masking. Since conventional speech reinforcement methods obtain spectral gain based only on the power spectra, this psychoacoustic phenomenon called binaural unmasking has not been considered in those approaches. In this paper, we propose a novel speech reinforcement algorithm that modifies the far-end speech signal based on both the power spectrum and the direction-of-arrival (DoA) of the noise. Specifically, we have computed the equivalent frontal noise level from the observed noise level and the estimated DoA, and used it to compute spectral gains as in conventional partial loudness restoration-based speech reinforcement. Experimental results showed that the proposed method outperformed the conventional methods based on partial loudness restoration and speech intelligibility index (SII) optimization in terms of the overall perceived quality through subjective listening tests.
Details
- Title: Subtitle
- Multichannel speech reinforcement based on binaural unmasking
- Creators
- Junhyeong Pak - School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, KoreaInyong Choi - Department of Communication Sciences and Disorders, University of Iowa, Iowa City, IA 52242, USAYu Gwang Jin - Corporate R&D Center, SK Telecom Co., Ltd., Seoul 04539, KoreaJong Won Shin - School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
- Resource Type
- Journal article
- Publication Details
- Signal processing, Vol.139, pp.165-172
- DOI
- 10.1016/j.sigpro.2017.04.021
- ISSN
- 0165-1684
- eISSN
- 1872-7557
- Publisher
- Elsevier B.V
- Language
- English
- Date published
- 10/2017
- Academic Unit
- Communication Sciences and Disorders; Iowa Neuroscience Institute; Otolaryngology
- Record Identifier
- 9984002583802771
Metrics
34 Record Views