Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LogiGAN: Learning Logical Reasoning via Adversarial Pre-training (2205.08794v2)

Published 18 May 2022 in cs.CL

Abstract: We present LogiGAN, an unsupervised adversarial pre-training framework for improving logical reasoning abilities of LLMs. Upon automatic identifying logical reasoning phenomena in massive text corpus via detection heuristics, we train LLMs to predict the masked-out logical statements. Inspired by the facilitation effect of reflective thinking in human learning, we analogically simulate the learning-thinking process with an adversarial Generator-Verifier architecture to assist logic learning. LogiGAN implements a novel sequential GAN approach that (a) circumvents the non-differentiable challenge of the sequential GAN by leveraging the Generator as a sentence-level generative likelihood scorer with a learning objective of reaching scoring consensus with the Verifier; (b) is computationally feasible for large-scale pre-training with arbitrary target length. Both base and large size LLMs pre-trained with LogiGAN demonstrate obvious performance improvement on 12 datasets requiring general reasoning abilities, revealing the fundamental role of logic in broad reasoning, as well as the effectiveness of LogiGAN. Ablation studies on LogiGAN components reveal the relative orthogonality between linguistic and logic abilities and suggest that reflective thinking's facilitation effect might also generalize to machine learning.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Xinyu Pi (7 papers)
  2. Wanjun Zhong (49 papers)
  3. Yan Gao (157 papers)
  4. Nan Duan (172 papers)
  5. Jian-Guang Lou (69 papers)
Citations (15)