Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

In-sample Curriculum Learning by Sequence Completion for Natural Language Generation (2211.11297v2)

Published 21 Nov 2022 in cs.CL

Abstract: Curriculum learning has shown promising improvements in multiple domains by training machine learning models from easy samples to hard ones. Previous works which either design rules or train models for scoring the difficulty highly rely on task-specific expertise, and cannot generalize. Inspired by the "easy-to-hard" intuition, we propose to do in-sample curriculum learning for natural language generation tasks. Our learning strategy starts training the model to generate the last few words, i.e., do sequence completion, and gradually extends to generate the whole output sequence. Comprehensive experiments show that it generalizes well to different tasks and achieves significant improvements over strong baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Qi Jia (42 papers)
  2. Yizhu Liu (9 papers)
  3. Haifeng Tang (20 papers)
  4. Kenny Q. Zhu (50 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.