Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Steal My Artworks for Fine-tuning? A Watermarking Framework for Detecting Art Theft Mimicry in Text-to-Image Models (2311.13619v1)

Published 22 Nov 2023 in cs.CV and cs.CR

Abstract: The advancement in text-to-image models has led to astonishing artistic performances. However, several studios and websites illegally fine-tune these models using artists' artworks to mimic their styles for profit, which violates the copyrights of artists and diminishes their motivation to produce original works. Currently, there is a notable lack of research focusing on this issue. In this paper, we propose a novel watermarking framework that detects mimicry in text-to-image models through fine-tuning. This framework embeds subtle watermarks into digital artworks to protect their copyrights while still preserving the artist's visual expression. If someone takes watermarked artworks as training data to mimic an artist's style, these watermarks can serve as detectable indicators. By analyzing the distribution of these watermarks in a series of generated images, acts of fine-tuning mimicry using stolen victim data will be exposed. In various fine-tune scenarios and against watermark attack methods, our research confirms that analyzing the distribution of watermarks in artificially generated images reliably detects unauthorized mimicry.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Catlora. https://civitai.com/models/99579?modelVersionId=106565. Accessed: 2023-09-05.
  2. Civitai. https://civitai.com/. Accessed: 2023-09-02.
  3. Ghostmix. https://civitai.com/models/36520/. Accessed: 2023-09-05.
  4. Midjourney. https://legacy.midjourney.com/showcase/recent/. Accessed: 2023-09-05.
  5. Ali Al-Haj. Combined dwt-dct digital image watermarking. Journal of computer science, 3(9):740–746, 2007.
  6. AUTOMATIC1111. Sd-webui. https://github.com/AUTOMATIC1111/stable-diffusion-webui. Accessed: 2023-09-03.
  7. Andy Baio. Invasive diffusion: how one unwilling illustrator found herself turned into an ai model. https://waxy.org/2022/11/invasive-diffusion-how-one-unwilling-illustrator-found-herself-turned-into-an-ai-model/. Accessed: 2023-09-02.
  8. Improving image generation with better captions.
  9. Blake Brittain. Ai-created images lose us copyrights in test for new technology. Reuters, February, 2023.
  10. Cogview: Mastering text-to-image generation via transformers. Advances in Neural Information Processing Systems, 34:19822–19835, 2021.
  11. Watermarking images in self-supervised latent spaces. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022.
  12. A proposed digital image watermarking based on dwt-dct-svd. In 2018 2nd IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), pages 1214–1218. IEEE, 2018.
  13. Lora: Low-rank adaptation of large language models. In International Conference on Learning Representations, 2021.
  14. Controllable text-to-image generation. Advances in Neural Information Processing Systems, 32, 2019.
  15. Generating images from captions with attention. In ICLR, 2016.
  16. Glide: Towards photorealistic image generation and editing with text-guided diffusion models. In International Conference on Machine Learning, pages 16784–16804. PMLR, 2022.
  17. Hierarchical text-conditional image generation with clip latents.
  18. Zero-shot text-to-image generation. In International Conference on Machine Learning, pages 8821–8831. PMLR, 2021.
  19. Generative adversarial text to image synthesis. In International conference on machine learning, pages 1060–1069. PMLR, 2016.
  20. High-resolution image synthesis with latent diffusion models, 2021.
  21. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022.
  22. Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 22500–22510, 2023.
  23. Photorealistic text-to-image diffusion models with deep language understanding. Advances in Neural Information Processing Systems, 35:36479–36494, 2022.
  24. Glaze: Protecting artists from style mimicry by Text-to-Image models. In 32nd USENIX Security Symposium (USENIX Security 23), pages 2187–2204, Anaheim, CA, 2023. USENIX Association.
  25. Nüwa: Visual synthesis pre-training for neural visual world creation. In European conference on computer vision, pages 720–736. Springer, 2022.
  26. Attngan: Fine-grained text to image generation with attentional generative adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 1316–1324, 2018.
  27. Scaling autoregressive models for content-rich text-to-image generation. Transactions on Machine Learning Research, 2022.
  28. Stackgan: Text to photo-realistic image synthesis with stacked generative adversarial networks. In Proceedings of the IEEE international conference on computer vision, pages 5907–5915, 2017.
  29. Robust invisible video watermarking with attention. 2019.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ge Luo (8 papers)
  2. Junqiang Huang (5 papers)
  3. Manman Zhang (1 paper)
  4. Zhenxing Qian (54 papers)
  5. Sheng Li (219 papers)
  6. Xinpeng Zhang (86 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.