2000 character limit reached
Mathematical Reasoning via Self-supervised Skip-tree Training (2006.04757v3)
Published 8 Jun 2020 in cs.LG, cs.AI, cs.PL, and stat.ML
Abstract: We examine whether self-supervised LLMing applied to mathematical formulas enables logical reasoning. We suggest several logical reasoning tasks that can be used to evaluate LLMs trained on formal mathematical statements, such as type inference, suggesting missing assumptions and completing equalities. To train LLMs for formal mathematics, we propose a novel skip-tree task. We find that models trained on the skip-tree task show surprisingly strong mathematical reasoning abilities, and outperform models trained on standard skip-sequence tasks. We also analyze the models' ability to formulate new conjectures by measuring how often the predictions are provable and useful in other proofs.