Source Separation for A Cappella Music
2509.26580v1
cs.SD, cs.LG
2025-10-02
Авторы:
Luca A. Lanzendörfer, Constantin Pinkl, Florian Grötschla
Abstract
In this work, we study the task of multi-singer separation in a cappella
music, where the number of active singers varies across mixtures. To address
this, we use a power set-based data augmentation strategy that expands limited
multi-singer datasets into exponentially more training samples. To separate
singers, we introduce SepACap, an adaptation of SepReformer, a state-of-the-art
speaker separation model architecture. We adapt the model with periodic
activations and a composite loss function that remains effective when stems are
silent, enabling robust detection and separation. Experiments on the JaCappella
dataset demonstrate that our approach achieves state-of-the-art performance in
both full-ensemble and subset singer separation scenarios, outperforming
spectrogram-based baselines while generalizing to realistic mixtures with
varying numbers of singers.
Ссылки и действия
Дополнительные ресурсы: