Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Restoring Ancient Ideograph: A Multimodal Multitask Neural Network Approach (2403.06682v1)

Published 11 Mar 2024 in cs.CL, cs.CV, and cs.CY

Abstract: Cultural heritage serves as the enduring record of human thought and history. Despite significant efforts dedicated to the preservation of cultural relics, many ancient artefacts have been ravaged irreversibly by natural deterioration and human actions. Deep learning technology has emerged as a valuable tool for restoring various kinds of cultural heritages, including ancient text restoration. Previous research has approached ancient text restoration from either visual or textual perspectives, often overlooking the potential of synergizing multimodal information. This paper proposes a novel Multimodal Multitask Restoring Model (MMRM) to restore ancient texts, particularly emphasising the ideograph. This model combines context understanding with residual visual information from damaged ancient artefacts, enabling it to predict damaged characters and generate restored images simultaneously. We tested the MMRM model through experiments conducted on both simulated datasets and authentic ancient inscriptions. The results show that the proposed method gives insightful restoration suggestions in both simulation experiments and real-world scenarios. To the best of our knowledge, this work represents the pioneering application of multimodal deep learning in ancient text restoration, which will contribute to the understanding of ancient society and culture in digital humanities fields.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (27)
  1. Restoring ancient text using deep learning: a case study on Greek epigraphy. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 6368–6375, Hong Kong, China. Association for Computational Linguistics.
  2. Restoring and attributing ancient texts using deep neural networks. Nature, 603(7900):280–283.
  3. Curriculum learning. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML ’09, page 41–48, New York, NY, USA. Association for Computing Machinery.
  4. Dual discriminator gan: Restoring ancient yi characters. Transactions on Asian and Low-Resource Language Information Processing, 21(4):1–23.
  5. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota. Association for Computational Linguistics.
  6. Enabling language models to fill in the blanks. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2492–2501, Online. Association for Computational Linguistics.
  7. Disentangling the cultural evolution of ancient china: a digital humanities perspective. Humanities and Social Sciences Communications, 10(1):1–15.
  8. Nacer Farajzadeh and Mahdi Hashemzadeh. 2021. A deep neural network based framework for restoring the damaged persian pottery via digital inpainting. Journal of Computational Science, 56:101486.
  9. Restoration of fragmentary babylonian texts using recurrent neural networks. Proceedings of the National Academy of Sciences, 117(37):22743–22751.
  10. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778.
  11. Restoring and mining the records of the Joseon dynasty via neural language modeling and machine translation. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4031–4042, Online. Association for Computational Linguistics.
  12. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
  13. Filling the gaps in Ancient Akkadian texts: A masked language modelling approach. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 4682–4691, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
  14. Deep matching network for handwritten chinese character recognition. Pattern Recognition, 107:107471.
  15. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
  16. Ancient text recognition: a review. Artificial Intelligence Review, 53:5517–5558.
  17. Cecilia Ostertag and Marie Beurton-Aimar. 2020. Matching ostraca fragments using a siamese neural network. Pattern Recognition Letters, 131:336–340.
  18. A generative model for the mycenaean linear b script and its application in infilling text from ancient tablets. J. Comput. Cult. Herit., 16(3).
  19. Blank language models. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 5186–5198, Online. Association for Computational Linguistics.
  20. Mohamed Ali Souibgui and Yousri Kessentini. 2022. De-gan: A conditional generative adversarial network for document enhancement. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(3):1180–1191.
  21. A restoration method using dual generate adversarial networks for chinese ancient characters. Visual Informatics, 6(1):26–34.
  22. Text Extraction and Restoration of Old Handwritten Documents, pages 109–132. Springer International Publishing, Cham.
  23. Dunhuang mural restoration using deep learning. In SIGGRAPH Asia 2018 Technical Briefs, SA ’18, New York, NY, USA. Association for Computing Machinery.
  24. Evol project: a comprehensive online platform for quantitative analysis of ancient literature. Humanities and Social Sciences Communications, 11(1):1–13.
  25. Controllable digital restoration of ancient paintings using convolutional neural network and nearest neighbor. Pattern Recognition Letters, 133:158–164.
  26. Online and offline handwritten chinese character recognition: A comprehensive study and new benchmark. Pattern Recognition, 61:348–360.
  27. Virtual restoration of the colored paintings on weathered beams in the forbidden city using multiple deep learning algorithms. Advanced Engineering Informatics, 50:101421.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com