CharPoet: A Chinese Classical Poetry Generation System Based on Token-free LLM (2401.03512v3)
Abstract: Automatic Chinese classical poetry generation has attracted much research interest, but achieving effective control over format and content simultaneously remains challenging. Traditional systems usually accept keywords as user inputs, resulting in limited control over content. LLMs improve content control by allowing unrestricted user instructions, but the token-by-token generation process frequently makes format errors. Motivated by this, we propose CharPoet, a Chinese classical poetry generation system based on token-free LLM, which provides effective control over both format and content. Our token-free architecture generates in a character-by-character manner, enabling precise control over the number of characters. Pruned from existing token-based LLMs, CharPoet inherits their pretrained capabilities and can generate poetry following instructions like "Write me a poem for my mother's birthday." CharPoet achieves format accuracy above 0.96, outperforming Jiuge-GPT-2 (0.91) and GPT-4 (0.38). In terms of content quality, CharPoet surpasses traditional systems including Jiuge, and is comparable to other LLMs. Our system is open source and available at https://modelscope.cn/models/CharPoet/CharPoet. A video demonstration of CharPoet is available at https://youtu.be/voZ25qEp3Dc.
- Qwen Technical Report. arXiv:2309.16609 [cs.CL]
- Jonas Belouadi and Steffen Eger. 2023. ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models. In 61st Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (ACL).
- Jinyi Hu and Maosong Sun. 2020. Generating Major Types of Chinese Classical Poetry in a Uniformed Framework. In 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (ACL).
- Itay Itzhak and Omer Levy. 2022. Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens. In 60th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (ACL).
- Ayush Kaushal and Kyle Mahowald. 2022. What do tokens know about their characters and how do they know it?. In 60th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (ACL).
- OpenAI. 2016. Introducing ChatGPT. https://openai.com/blog/chatgpt
- Training language models to follow instructions with human feedback. arXiv:2203.02155 [cs.CL]
- Mike Schuster and Kaisuke Nakajima. 2012. Japanese and Korean Voice Search. In International Conference on Acoustics, Speech and Signal Processing. 5149–5152.
- Neural Machine Translation of Rare Words with Subword Units. In 54th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics (ACL), 1715–1725.
- Chinese Poetry Generation with Planning based Neural Network. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers. 1051–1060.
- Rui Yan. 2016. i, poet: automatic poetry composition through recurrent neural networks with iterative polishing schema. In Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. 2238–2244.
- Generating chinese classical poems with rnn encoder-decoder. In Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data: 16th China National Conference, CCL 2017, and 5th International Symposium, NLP-NABD 2017, Nanjing, China, October 13-15, 2017, Proceedings 16. Springer, 211–223.
- Chinese poetry generation with a working memory model. In Proceedings of the 27th International Joint Conference on Artificial Intelligence. 4553–4559.
- Flexible and Creative Chinese Poetry Generation Using Neural Memory. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1364–1373.
- Xingxing Zhang and Mirella Lapata. 2014. Chinese poetry generation with recurrent neural networks. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 670–680.
- Jiuge: A human-machine collaborative chinese classical poetry generation system. In Proceedings of the 57th annual meeting of the association for computational linguistics: system demonstrations. 25–30.
- Chengyue Yu (1 paper)
- Lei Zang (5 papers)
- Jiaotuan Wang (3 papers)
- Chenyi Zhuang (20 papers)
- Jinjie Gu (50 papers)