An Agent-Based Framework for Automated Higher-Voice Harmony Generation
2509.24463v2
cs.SD, cs.AI
2025-10-02
Авторы:
Nia D'Souza Ganapathy, Arul Selvamani Shaja
Abstract
The generation of musically coherent and aesthetically pleasing harmony
remains a significant challenge in the field of algorithmic composition. This
paper introduces an innovative Agentic AI-enabled Higher Harmony Music
Generator, a multi-agent system designed to create harmony in a collaborative
and modular fashion. Our framework comprises four specialized agents: a
Music-Ingestion Agent for parsing and standardizing input musical scores; a
Chord-Knowledge Agent, powered by a Chord-Former (Transformer model), to
interpret and provide the constituent notes of complex chord symbols; a
Harmony-Generation Agent, which utilizes a Harmony-GPT and a Rhythm-Net (RNN)
to compose a melodically and rhythmically complementary harmony line; and an
Audio-Production Agent that employs a GAN-based Symbolic-to-Audio Synthesizer
to render the final symbolic output into high-fidelity audio. By delegating
specific tasks to specialized agents, our system effectively mimics the
collaborative process of human musicians. This modular, agent-based approach
allows for robust data processing, deep theoretical understanding, creative
composition, and realistic audio synthesis, culminating in a system capable of
generating sophisticated and contextually appropriate higher-voice harmonies
for given melodies.
Ссылки и действия
Дополнительные ресурсы: