Automatic Music Sample Identification with Multi-Track Contrastive Learning
2510.11507v1
cs.SD, cs.AI, cs.LG, eess.AS
2025-10-15
Авторы:
Alain Riou, Joan Serrà, Yuki Mitsufuji
Abstract
Sampling, the technique of reusing pieces of existing audio tracks to create
new music content, is a very common practice in modern music production. In
this paper, we tackle the challenging task of automatic sample identification,
that is, detecting such sampled content and retrieving the material from which
it originates. To do so, we adopt a self-supervised learning approach that
leverages a multi-track dataset to create positive pairs of artificial mixes,
and design a novel contrastive learning objective. We show that such method
significantly outperforms previous state-of-the-art baselines, that is robust
to various genres, and that scales well when increasing the number of noise
songs in the reference database. In addition, we extensively analyze the
contribution of the different components of our training pipeline and
highlight, in particular, the need for high-quality separated stems for this
task.