Watermarking Discrete Diffusion Language Models

2511.02083v1 cs.CR, cs.AI, cs.CY 2025-11-06

Авторы:

Avi Bagchi, Akhil Bhimaraju, Moulik Choraria, Daniel Alabi, Lav R. Varshney

Abstract

Watermarking has emerged as a promising technique to track AI-generated content and differentiate it from authentic human creations. While prior work extensively studies watermarking for autoregressive large language models (LLMs) and image diffusion models, none address discrete diffusion language models, which are becoming popular due to their high inference throughput. In this paper, we introduce the first watermarking method for discrete diffusion models by applying the distribution-preserving Gumbel-max trick at every diffusion step and seeding the randomness with the sequence index to enable reliable detection. We experimentally demonstrate that our scheme is reliably detectable on state-of-the-art diffusion language models and analytically prove that it is distortion-free with an exponentially decaying probability of false detection in the token sequence length.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Watermarking Discrete Diffusion Language Models

Авторы:

Abstract

Ссылки и действия

Связанные статьи

A Taxonomy of Pix Fraud in Brazil: Attack Methodologies, AI-Driven Amplification...

Future-Back Threat Modeling: A Foresight-Driven Security Framework

Can AI Models be Jailbroken to Phish Elderly Victims? An End-to-End Evaluation

Covert Surveillance in Smart Devices: A SCOUR Framework Analysis of Youth Privac...

BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool...

Навигация