Conference proceeding
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
INTERSPEECH 2024, pp.282-286
Interspeech
01/01/2024
DOI: 10.21437/Interspeech.2024-1349
Abstract
We propose a GPU/TPU-friendly implementation for contextual biasing based on the Knuth-Morris-Pratt (KMP) pattern matching algorithm. Our algorithms simulate classical search-based biasing approaches which are often implemented in the weighted finite state transducer (WFST) framework, with careful considerations on memory footprint and efficiency by vectorization. We design scoring mechanisms such that, during beam search, a token extension receives a bonus if it extends matching into a biasing phrase, and receives a penalty to cancel previously received bonus otherwise. Our methods could be incorporated in either the shallow fusion or on-the-fly rescoring manner, to trade off accuracy with efficiency. On a large-scale voice search dataset, our method achieves significant word error rate (WER) reductions on biasing test sets without introducing additional model parameters, and yields further performance gain when combined with a model-based biasing method.
Details
- Title: Subtitle
- Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
- Creators
- Weiran Wang - Google (United States)Zelin Wu - Google (United States)Diamantino Caseiro - Google (United States)Tsendsuren Munkhdalai - Google (United States)Khe Chai Sim - Google (United States)Pat Rondon - Google (United States)Golan Pundak - Google (United States)Gan Song - Google (United States)Rohit Prabhavalkar - Google (United States)Zhong Meng - Google (United States)Ding Zhao - Google (United States)Tara Sainath - Google (United States)Yanzhang He - Google (United States)Pedro Moreno Mengibar - Google (United States)
- Resource Type
- Conference proceeding
- Publication Details
- INTERSPEECH 2024, pp.282-286
- Series
- Interspeech
- DOI
- 10.21437/Interspeech.2024-1349
- ISSN
- 2308-457X
- Publisher
- Isca-Int Speech Communication Assoc
- Number of pages
- 5
- Language
- English
- Date published
- 01/01/2024
- Academic Unit
- Computer Science
- Record Identifier
- 9984798224702771
Metrics
6 Record Views