Multichannel speech reinforcement based on binaural unmasking

Junhyeong Pak; Inyong Choi; Yu Gwang Jin; Jong Won Shin

doi:10.1016/j.sigpro.2017.04.021

Back

Multichannel speech reinforcement based on binaural unmasking

Journal article

Peer reviewed

Multichannel speech reinforcement based on binaural unmasking

Junhyeong Pak, Inyong Choi, Yu Gwang Jin and Jong Won Shin

Signal processing, Vol.139, pp.165-172

10/2017

DOI: 10.1016/j.sigpro.2017.04.021

View Online

Abstract

•Multichannel speech reinforcement exploiting DoA information is proposed.•An empirical evidence of the binaural unmasking for monaural speech is provided.•Proposed reinforcement restores perceived loudness considering binaural unmasking.•The performance of the algorithm is verified through subjective listening tests. Speech reinforcement or near-end listening enhancement is a technique that modifies the far-end signal to mitigate the effect of the near-end noise, usually based on the power spectra of the far-end signal and the near-end noise. Psychoacoustic experiments have shown that the location of a noise source with respect to that of a signal source affects the amount of masking. Since conventional speech reinforcement methods obtain spectral gain based only on the power spectra, this psychoacoustic phenomenon called binaural unmasking has not been considered in those approaches. In this paper, we propose a novel speech reinforcement algorithm that modifies the far-end speech signal based on both the power spectrum and the direction-of-arrival (DoA) of the noise. Specifically, we have computed the equivalent frontal noise level from the observed noise level and the estimated DoA, and used it to compute spectral gains as in conventional partial loudness restoration-based speech reinforcement. Experimental results showed that the proposed method outperformed the conventional methods based on partial loudness restoration and speech intelligibility index (SII) optimization in terms of the overall perceived quality through subjective listening tests.

Loudness perception

Binaural unmasking

Direction-of-arrival

Speech reinforcement

Partial masking effect

Details

Title: Subtitle: Multichannel speech reinforcement based on binaural unmasking
Creators: Junhyeong Pak - School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
Inyong Choi - Department of Communication Sciences and Disorders, University of Iowa, Iowa City, IA 52242, USA
Yu Gwang Jin - Corporate R&D Center, SK Telecom Co., Ltd., Seoul 04539, Korea
Jong Won Shin - School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea
Resource Type: Journal article
Publication Details: Signal processing, Vol.139, pp.165-172
DOI: 10.1016/j.sigpro.2017.04.021
ISSN: 0165-1684
eISSN: 1872-7557
Publisher: Elsevier B.V
Language: English
Date published: 10/2017
Academic Unit: Communication Sciences and Disorders; Iowa Neuroscience Institute; Otolaryngology
Record Identifier: 9984002583802771

Metrics

34 Record Views

2 Times Cited - Web of Science