Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems (2106.06566v1)

Published 11 Jun 2021 in cs.CL

Abstract: Neural models excel at extracting statistical patterns from large amounts of data, but struggle to learn patterns or reason about language from only a few examples. In this paper, we ask: Can we learn explicit rules that generalize well from only a few examples? We explore this question using program synthesis. We develop a synthesis model to learn phonology rules as programs in a domain-specific language. We test the ability of our models to generalize from few training examples using our new dataset of problems from the Linguistics Olympiad, a challenging set of tasks that require strong linguistic reasoning ability. In addition to being highly sample-efficient, our approach generates human-readable programs, and allows control over the generalizability of the learnt programs.

Authors (4)

Saujas Vaduguru (8 papers)
Aalok Sathe (4 papers)
Monojit Choudhury (66 papers)
Dipti Misra Sharma (15 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Sample-efficient Linguistic Generalizations through Program Synthesis: Experiments with Phonology Problems (2106.06566v1)

Summary

Related Papers