Lost in Overlap: Exploring Logit-based Watermark Collision in LLMs (2403.10020v3)

Published 15 Mar 2024 in cs.CL and cs.MM

Abstract: The proliferation of LLMs in generating content raises concerns about text copyright. Watermarking methods, particularly logit-based approaches, embed imperceptible identifiers into text to address these challenges. However, the widespread usage of watermarking across diverse LLMs has led to an inevitable issue known as watermark collision during common tasks, such as paraphrasing or translation. In this paper, we introduce watermark collision as a novel and general philosophy for watermark attacks, aimed at enhancing attack performance on top of any other attacking methods. We also provide a comprehensive demonstration that watermark collision poses a threat to all logit-based watermark algorithms, impacting not only specific attack scenarios but also downstream applications.

References (17)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Lost in Overlap: Exploring Logit-based Watermark Collision in LLMs (2403.10020v3)

Summary

Follow-up Questions

Related Papers

Authors (6)