Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification (2305.12463v1)

Published 21 May 2023 in cs.CL and cs.AI

Abstract: Randomly masking text spans in ordinary texts in the pre-training stage hardly allows models to acquire the ability to generate simple texts. It can hurt the performance of pre-trained models on text simplification tasks. In this paper, we propose a new continued pre-training strategy to teach the pre-trained model to generate simple texts. We continue pre-training BART, a representative model, to obtain SimpleBART. It consistently and significantly improves the results on lexical simplification, sentence simplification, and document-level simplification tasks over BART. At the end, we compare SimpleBART with several representative LLMs.

Citations (16)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Teaching the Pre-trained Model to Generate Simple Texts for Text Simplification (2305.12463v1)

Summary

Related Papers