Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SikuGPT: A Generative Pre-trained Model for Intelligent Information Processing of Ancient Texts from the Perspective of Digital Humanities (2304.07778v1)

Published 16 Apr 2023 in cs.CL

Abstract: The rapid advance in artificial intelligence technology has facilitated the prosperity of digital humanities research. Against such backdrop, research methods need to be transformed in the intelligent processing of ancient texts, which is a crucial component of digital humanities research, so as to adapt to new development trends in the wave of AIGC. In this study, we propose a GPT model called SikuGPT based on the corpus of Siku Quanshu. The model's performance in tasks such as intralingual translation and text classification exceeds that of other GPT-type models aimed at processing ancient texts. SikuGPT's ability to process traditional Chinese ancient texts can help promote the organization of ancient information and knowledge services, as well as the international dissemination of Chinese ancient culture.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Liu Chang (6 papers)
  2. Wang Dongbo (2 papers)
  3. Zhao Zhixiao (2 papers)
  4. Hu Die (1 paper)
  5. Wu Mengcheng (1 paper)
  6. Lin Litao (1 paper)
  7. Shen Si (2 papers)
  8. Li Bin (2 papers)
  9. Liu Jiangfeng (1 paper)
  10. Zhang Hai (2 papers)
  11. Zhao Lianzheng (1 paper)
Citations (8)

Summary

We haven't generated a summary for this paper yet.