Deep Learning-based Text-in-Image Watermarking (2404.13134v1)
Abstract: In this work, we introduce a novel deep learning-based approach to text-in-image watermarking, a method that embeds and extracts textual information within images to enhance data security and integrity. Leveraging the capabilities of deep learning, specifically through the use of Transformer-based architectures for text processing and Vision Transformers for image feature extraction, our method sets new benchmarks in the domain. The proposed method represents the first application of deep learning in text-in-image watermarking that improves adaptivity, allowing the model to intelligently adjust to specific image characteristics and emerging threats. Through testing and evaluation, our method has demonstrated superior robustness compared to traditional watermarking techniques, achieving enhanced imperceptibility that ensures the watermark remains undetectable across various image contents.
- A. K. Pandey, P. Singh, N. Agarwal, and B. Raman, “Secmed: A secure approach for proving rightful ownership of medical images in encrypted domain over cloud,” in 2018 IEEE Conference on Multimedia Information Processing and Retrieval (MIPR). IEEE, 2018, pp. 390–395.
- X. Zhong, P.-C. Huang, S. Mastorakis, and F. Y. Shih, “An automated and robust image watermarking scheme based on deep neural networks,” IEEE Transactions on Multimedia, vol. 23, pp. 1951–1961, 2020.
- J. Zhu, R. Kaplan, J. Johnson, and L. Fei-Fei, “Hidden: Hiding data with deep networks,” in Proceedings of the European conference on computer vision (ECCV), 2018, pp. 657–672.
- A. Das and X. Zhong, “A deep learning-based audio-in-image watermarking scheme,” in 2021 International Conference on Visual Communications and Image Processing (VCIP). IEEE, 2021, pp. 1–5.
- X. Zhong, A. Das, F. Alrasheedi, and A. Tanvir, “A brief, in-depth survey of deep learning-based image watermarking,” Applied Sciences, vol. 13, no. 21, p. 11852, 2023.
- C. Ou, “Text watermarking for text document copyright protection,” Computer Science, vol. 725, 2003.
- S. G. Rizzo, F. Bertini, and D. Montesi, “Fine-grain watermarking for intellectual property protection,” EURASIP Journal on Information Security, vol. 2019, pp. 1–20, 2019.
- N. S. Kamaruddin, A. Kamsin, L. Y. Por, and H. Rahman, “A review of text watermarking: theory, methods, and applications,” IEEE Access, vol. 6, pp. 8011–8028, 2018.
- M. T. Ahvanooey, Q. Li, X. Zhu, M. Alazab, and J. Zhang, “Anitw: A novel intelligent text watermarking technique for forensic identification of spurious information on social media,” Computers & Security, vol. 90, p. 101702, 2020.
- G. Gupta and A. Khunteta, “Hiding text data in image through image watermarking using dct & dwt: A research paper,” in 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI). IEEE, 2017, pp. 447–450.
- M. Ahmadi, A. Norouzi, N. Karimi, S. Samavi, and A. Emami, “Redmark: Framework for residual diffusion watermarking based on deep networks,” Expert Systems with Applications, vol. 146, p. 113157, 2020.
- Z. Jia, H. Fang, and W. Zhang, “Mbrs: Enhancing robustness of dnn-based watermarking by mini-batch of real and simulated jpeg compression,” in Proceedings of the 29th ACM international conference on multimedia, 2021, pp. 41–49.
- A. K. Singh, “Improved hybrid algorithm for robust and imperceptible multiple watermarking using digital images,” Multimedia Tools and Applications, vol. 76, pp. 8881–8900, 2017.
- A. Anand and A. K. Singh, “An improved dwt-svd domain watermarking for medical information security,” Computer Communications, vol. 152, pp. 72–80, 2020.
- A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017.
- J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “Bert: Pre-training of deep bidirectional transformers for language understanding,” arXiv preprint arXiv:1810.04805, 2018.
- A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly et al., “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929, 2020.
- S. Ge, Z. Xia, J. Fei, Y. Tong, J. Weng, and M. Li, “A robust document image watermarking scheme using deep neural network,” Multimedia Tools and Applications, pp. 1–24, 2023.
- T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick, “Microsoft coco: Common objects in context,” in Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer, 2014, pp. 740–755.
- D. Elliott, S. Frank, K. Sima’an, and L. Specia, “Multi30k: Multilingual english-german image descriptions,” arXiv preprint arXiv:1605.00459, 2016.