Papers
Topics
Authors
Recent
Search
2000 character limit reached

Authorship and the Politics and Ethics of LLM Watermarks

Published 11 Mar 2024 in cs.CY | (2403.06593v1)

Abstract: Recently, watermarking schemes for LLMs have been proposed to distinguish text generated by machines and by humans. The present paper explores philosophical, political, and ethical ramifications of implementing and using watermarking schemes. A definition of authorship that includes both machines (LLMs) and humans is proposed to serve as a backdrop. It is argued that private watermarks may provide private companies with sweeping rights to determine authorship, which is incompatible with traditional standards of authorship determination. Then, possible ramifications of the so-called entropy dependence of watermarking mechanisms are explored. It is argued that entropy may vary for different, socially salient groups. This could lead to group dependent rates at which machine generated text is detected. Specifically, groups more interested in low entropy text may face the challenge that it is harder to detect machine generated text that is of interest to them.

Authors (1)
Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. Aaronson, S. 2023. Watermarking of LLMs. Slides, Simons Institute, Berkeley, August 17, 2023.
  2. The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation. arXiv preprint arXiv:2302.06784 .
  3. Undetectable Watermarks for Language Models. arXiv preprint arXiv:2306.09194 .
  4. Coeckelbergh, M. and D. J. Gunkel. 2023. ChatGPT: deconstructing the debate and moving it forward. AI & SOCIETY : 1–11.
  5. Cover, T. M. and J. A. Thomas. 2006. Elements of Information Theory. Wiley, second ed.
  6. de Haas, H. 2023. How Migration Really Works. Penguin.
  7. Eckert, S. 2015. The Guttenberg plagiarism scandal: myths through Germany’s leading news magazines. Journal of Communication Inquiry 39(3): 249–272.
  8. Ehret, K. 2016. An information-theoretic approach to language complexity: variation in naturalistic corpora. Ph.D. thesis, Dissertation, Albert-Ludwigs-Universität Freiburg, 2016.
  9. Publicly detectable watermarking for language models. arXiv preprint arXiv:2310.18491 .
  10. Foucault, M. 1998. What is an Author? In J. D. Faubion, ed., Aesthetics, Method, and Epistemology, vol. 2. New York: The New Press, pp. 205–222.
  11. An ugly truth: Inside Facebook’s battle for domination. Hachette UK.
  12. The ethical need for watermarks in machine-generated language. arXiv preprint arXiv:2209.03118 .
  13. Isaak, J. and M. J. Hanna. 2018. User data privacy: Facebook, Cambridge Analytica, and privacy protection. Computer 51(8): 56–59.
  14. A watermark for large language models. arXiv preprint arXiv:2301.10226 .
  15. Kontoyiannis, I. 1997. The complexity and entropy of literary styles. Tech. rep., Department of Statistics, Stanford University Stanford, CA, USA.
  16. Languages with more speakers tend to be harder to (machine-) learn. Scientific Reports 13(1): 18521.
  17. Robust distortion-free watermarks for language models. arXiv preprint arXiv:2307.15593 .
  18. MacKay, D. 2003. Information Theory, Inference, and Learning Algorithms. Cambridge University Press.
  19. Generative AI entails a credit–blame asymmetry. Nature Machine Intelligence : 1–4.
  20. Guttenberg soll bei Doktorarbeit abgeschrieben haben. Süddeutsche Zeitung.
  21. ChatGPT vs Human-authored Text: Insights into Controllable Text Summarization and Sentence Style Transfer. arXiv preprint arXiv:2306.07799 .
  22. Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning. PMLR, pp. 28492–28518.
  23. Rodriguez, M. C. 2010. History of supersymmetric extensions of the Standard Model. International Journal of Modern Physics A 25(06): 1091–1121.
  24. Authorship and ChatGPT: a Conservative View. Philosophy & Technology 37(1): 34.
  25. Ethical and social risks of harm from language models. arXiv preprint arXiv:2112.04359 .
  26. Watermarks in the sand: Impossibility of strong watermarking for generative models. arXiv preprint arXiv:2311.04378 .

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.