Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Compositional Generalization in Classification Tasks via Structure Annotations (2106.10434v1)

Published 19 Jun 2021 in cs.LG and cs.CL

Abstract: Compositional generalization is the ability to generalize systematically to a new data distribution by combining known components. Although humans seem to have a great ability to generalize compositionally, state-of-the-art neural models struggle to do so. In this work, we study compositional generalization in classification tasks and present two main contributions. First, we study ways to convert a natural language sequence-to-sequence dataset to a classification dataset that also requires compositional generalization. Second, we show that providing structural hints (specifically, providing parse trees and entity links as attention masks for a Transformer model) helps compositional generalization.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Juyong Kim (4 papers)
  2. Pradeep Ravikumar (101 papers)
  3. Joshua Ainslie (32 papers)
  4. Santiago Ontañón (28 papers)
Citations (16)