A simple model for detection of rare sound events: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES

Weiran Wang; Chieh-chi Kao; Chao Wang

doi:10.21437/Interspeech.2018-2338

Back

Conference proceeding

A simple model for detection of rare sound events: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES

Weiran Wang, Chieh-chi Kao and Chao Wang

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6, pp.1344-1348

Interspeech

01/01/2018

DOI: 10.21437/Interspeech.2018-2338

View Online

Abstract

We propose a simple recurrent model for detecting rare sound events, when the time boundaries of events are available for training. Our model optimizes the combination of an utterance level loss, which classifies whether an event occurs in an utterance, and a frame-level loss, which classifies whether each frame corresponds to the event when it does occur. The two losses make use of a shared vectorial representation the event, and are connected by an attention mechanism. We demonstrate our model on Task 2 of the DCASE 2017 challenge, and achieve competitive performance.

Computer Science

Engineering

Technology

Computer Science, Artificial Intelligence

Computer Science, Theory & Methods

Engineering, Electrical & Electronic

Science & Technology

Details

Title: Subtitle: A simple model for detection of rare sound events: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES
Creators: Weiran Wang - Amazon Alexa, 101 Main St, Cambridge, MA 02142 USA
Chieh-chi Kao - Amazon Alexa, 101 Main St, Cambridge, MA 02142 USA
Chao Wang - Amazon Alexa, 101 Main St, Cambridge, MA 02142 USA
Resource Type: Conference proceeding
Publication Details: 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6, pp.1344-1348
Publisher: Isca-Int Speech Communication Assoc
Series: Interspeech
DOI: 10.21437/Interspeech.2018-2338
ISSN: 2308-457X
Number of pages: 5
Language: English
Date published: 01/01/2018
Academic Unit: Computer Science
Record Identifier: 9984696571102771

Metrics

1 Record Views

12 Times Cited - Web of Science