Papers
Topics
Authors
Recent
2000 character limit reached

Watermarking Discrete Diffusion Language Models (2511.02083v1)

Published 3 Nov 2025 in cs.CR, cs.AI, and cs.CY

Abstract: Watermarking has emerged as a promising technique to track AI-generated content and differentiate it from authentic human creations. While prior work extensively studies watermarking for autoregressive LLMs and image diffusion models, none address discrete diffusion LLMs, which are becoming popular due to their high inference throughput. In this paper, we introduce the first watermarking method for discrete diffusion models by applying the distribution-preserving Gumbel-max trick at every diffusion step and seeding the randomness with the sequence index to enable reliable detection. We experimentally demonstrate that our scheme is reliably detectable on state-of-the-art diffusion LLMs and analytically prove that it is distortion-free with an exponentially decaying probability of false detection in the token sequence length.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 4 likes about this paper.