SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

2510.16917v1 cs.SD, cs.AI, cs.CL, eess.AS 2025-10-22

Авторы:

Chih-Kai Yang, Yen-Ting Piao, Tzu-Wen Hsu, Szu-Wei Fu, Zhehuai Chen, Ke-Han Lu, Sung-Feng Huang, Chao-Han Huck Yang, Yu-Chiang Frank Wang, Yun-Nung Chen, Hung-yi Lee

Abstract

Knowledge editing offers an efficient way to update model knowledge without full retraining, but prior work has concentrated almost exclusively on textual or visual modalities. We introduce SAKE, the first benchmark specifically designed for editing auditory attribute knowledge in Large Audio-Language Models (LALMs). Unlike factual updates, SAKE targets several abstract auditory attributes, capturing knowledge types that go beyond conventional textual and visual domains. We benchmark seven editing methods on two LALMs along four dimensions: reliability, generality, audio/text locality, and portability. Results highlight challenges such as preserving intra-attribute knowledge unrelated to the edit, generalizing edits to multimodal reasoning, and maintaining edits under sequential updates. SAKE provides a principled framework to study how knowledge editing extends to the auditory modalities, opening new directions for maintaining and adapting LALMs in more diverse real-world scenarios.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speake...

ControlAudio: Tackling Text-Guided, Timing-Indicated and Intelligible Audio Gene...

AudioMarathon: A Comprehensive Benchmark for Long-Context Audio Understanding an...

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Rewar...

MDAR: A Multi-scene Dynamic Audio Reasoning Benchmark

Навигация