Conference proceeding
A simple model for detection of rare sound events: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6, pp.1344-1348
Interspeech
01/01/2018
DOI: 10.21437/Interspeech.2018-2338
Abstract
We propose a simple recurrent model for detecting rare sound events, when the time boundaries of events are available for training. Our model optimizes the combination of an utterance level loss, which classifies whether an event occurs in an utterance, and a frame-level loss, which classifies whether each frame corresponds to the event when it does occur. The two losses make use of a shared vectorial representation the event, and are connected by an attention mechanism. We demonstrate our model on Task 2 of the DCASE 2017 challenge, and achieve competitive performance.
Details
- Title: Subtitle
- A simple model for detection of rare sound events: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES
- Creators
- Weiran Wang - Amazon Alexa, 101 Main St, Cambridge, MA 02142 USAChieh-chi Kao - Amazon Alexa, 101 Main St, Cambridge, MA 02142 USAChao Wang - Amazon Alexa, 101 Main St, Cambridge, MA 02142 USA
- Resource Type
- Conference proceeding
- Publication Details
- 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6, pp.1344-1348
- Publisher
- Isca-Int Speech Communication Assoc
- Series
- Interspeech
- DOI
- 10.21437/Interspeech.2018-2338
- ISSN
- 2308-457X
- Number of pages
- 5
- Language
- English
- Date published
- 01/01/2018
- Academic Unit
- Computer Science
- Record Identifier
- 9984696571102771
Metrics
1 Record Views