Segment-Factorized Full-Song Generation on Symbolic Piano Music

2510.05881v1 cs.SD, cs.AI, cs.LG, cs.MM, eess.AS 2025-10-09
Авторы:

Ping-Yi Chen, Chih-Pin Tan, Yi-Hsuan Yang

Abstract

We propose the Segmented Full-Song Model (SFS) for symbolic full-song generation. The model accepts a user-provided song structure and an optional short seed segment that anchors the main idea around which the song is developed. By factorizing a song into segments and generating each one through selective attention to related segments, the model achieves higher quality and efficiency compared to prior work. To demonstrate its suitability for human-AI interaction, we further wrap SFS into a web application that enables users to iteratively co-create music on a piano roll with customizable structures and flexible ordering.

Ссылки и действия

Связанные статьи

On the de-duplication of the Lakh MIDI dataset

## Контекст Lakh MIDI Dataset (LMD) является одним из крупнейших общедоступных источников символической музыки. Он содер...

2025-09-24

The Name-Free Gap: Policy-Aware Stylistic Control in Music Generation

#### Контекст Текстово-музыкальные модели, такие как MusicGen, успешно подхватывают широкие атрибуты музыки, такие как ...

2025-09-05

From Discord to Harmony: Decomposed Consonance-based Training for Improved Audio...

## Контекст Аудио Чорд Эстимация (Audio Chord Estimation, ACE) — это ключевая задача в области музыкального информационн...

2025-09-05