AWARE: Audio Watermarking with Adversarial Resistance to Edits

2510.17512v1 cs.SD, cs.LG, cs.MM, eess.AS 2025-10-22

Авторы:

Kosta Pavlović, Lazar Stanarević, Petar Nedić, Slavko Kovačević, Igor Djurović

Abstract

Prevailing practice in learning-based audio watermarking is to pursue robustness by expanding the set of simulated distortions during training. However, such surrogates are narrow and prone to overfitting. This paper presents AWARE (Audio Watermarking with Adversarial Resistance to Edits), an alternative approach that avoids reliance on attack-simulation stacks and handcrafted differentiable distortions. Embedding is obtained via adversarial optimization in the time-frequency domain under a level-proportional perceptual budget. Detection employs a time-order-agnostic detector with a Bitwise Readout Head (BRH) that aggregates temporal evidence into one score per watermark bit, enabling reliable watermark decoding even under desynchronization and temporal cuts. Empirically, AWARE attains high audio quality and speech intelligibility (PESQ/STOI) and consistently low BER across various audio edits, often surpassing representative state-of-the-art learning-based audio watermarking systems.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

AWARE: Audio Watermarking with Adversarial Resistance to Edits

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Language Model Based Text-to-Audio Generation: Anti-Causally Aligned Collaborati...

Навигация