Bayesian Low-Rank Factorization for Robust Model Adaptation

2510.18723v1 cs.CL, cs.LG, cs.SD, eess.AS 2025-10-23

Авторы:

Enes Yavuz Ugan, Ngoc-Quan Pham, Alexander Waibel

Abstract

Large speech foundation models achieve strong performance across many domains, but they often require adaptation to handle local needs such as code-switching, where speakers mix languages within the same utterance. Direct fine-tuning of these models risks overfitting to the target domain and overwriting the broad capabilities of the base model. To address this challenge, we explore Bayesian factorized adapters for speech foundation models, which place priors near zero to achieve sparser adaptation matrices and thereby retain general performance while adapting to specific domains. We apply our approach to the Whisper model and evaluate on different multilingual code-switching scenarios. Our results show only minimal adaptation loss while significantly reducing catastrophic forgetting of the base model. Compared to LoRA, our method achieves a backward gain of 54% with only a 4% drop on the new domain. These findings highlight the effectiveness of Bayesian adaptation for fine-tuning speech foundation models without sacrificing generalization.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Bayesian Low-Rank Factorization for Robust Model Adaptation

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Adapting Language Balance in Code-Switching Speech

CarelessWhisper: Turning Whisper into a Causal Streaming Model

Text to Speech System for Meitei Mayek Script

How Does a Deep Neural Network Look at Lexical Stress?

The State Of TTS: A Case Study with Human Fooling Rates

Навигация