A Neural Model for Contextual Biasing Score Learning and Filtering

Wanting Huang; Weiran Wang

doi:10.48550/arxiv.2510.23849

Back

A Neural Model for Contextual Biasing Score Learning and Filtering

Preprint

Open access

A Neural Model for Contextual Biasing Score Learning and Filtering

Wanting Huang and Weiran Wang

ArXiv.org

Cornell University

10/27/2025

DOI: 10.48550/arxiv.2510.23849

Files and links (1)

url

https://doi.org/10.48550/arxiv.2510.23849View

Preprint (Author's original)This preprint has not been evaluated by subject experts through peer review. Preprints may undergo extensive changes and/or become peer-reviewed journal articles. Open Access

Abstract

Contextual biasing improves automatic speech recognition (ASR) by integrating external knowledge, such as user-specific phrases or entities, during decoding. In this work, we use an attention-based biasing decoder to produce scores for candidate phrases based on acoustic information extracted by an ASR encoder, which can be used to filter out unlikely phrases and to calculate bonus for shallow-fusion biasing. We introduce a per-token discriminative objective that encourages higher scores for ground-truth phrases while suppressing distractors. Experiments on the Librispeech biasing benchmark show that our method effectively filters out majority of the candidate phrases, and significantly improves recognition accuracy under different biasing conditions when the scores are used in shallow fusion biasing. Our approach is modular and can be used with any ASR system, and the filtering mechanism can potentially boost performance of other biasing methods.

Computer Science - Artificial Intelligence

Computer Science - Computation and Language

Computer Science - Sound

Details

Title: Subtitle: A Neural Model for Contextual Biasing Score Learning and Filtering
Creators: Wanting Huang
Weiran Wang
Resource Type: Preprint
Publication Details: ArXiv.org
DOI: 10.48550/arxiv.2510.23849
ISSN: 2331-8422
Publisher: Cornell University; Ithaca, New York
Language: English
Date posted: 10/27/2025
Academic Unit: Computer Science
Record Identifier: 9985019035802771

Metrics

12 Record Views