Schrödinger Bridge Mamba for One-Step Speech Enhancement

2510.16834v1 cs.SD, cs.AI, cs.LG, eess.AS 2025-10-22

Авторы:

Jing Yang, Sirui Wang, Chao Wu, Fan Fan

Abstract

We propose Schr\"odinger Bridge Mamba (SBM), a new concept of training-inference framework motivated by the inherent compatibility between Schr\"odinger Bridge (SB) training paradigm and selective state-space model Mamba. We exemplify the concept of SBM with an implementation for generative speech enhancement. Experiments on a joint denoising and dereverberation task using four benchmark datasets demonstrate that SBM, with only 1-step inference, outperforms strong baselines with 1-step or iterative inference and achieves the best real-time factor (RTF). Beyond speech enhancement, we discuss the integration of SB paradigm and selective state-space model architecture based on their underlying alignment, which indicates a promising direction for exploring new deep generative models potentially applicable to a broad range of generative tasks. Demo page: https://sbmse.github.io

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Schrödinger Bridge Mamba for One-Step Speech Enhancement

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation...

Learning Linearity in Audio Consistency Autoencoders via Implicit Regularization

Automatic Music Sample Identification with Multi-Track Contrastive Learning

Leveraging Whisper Embeddings for Audio-based Lyrics Matching

SAGE-Music: Low-Latency Symbolic Music Generation via Attribute-Specialized Key-...

Навигация