Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Abstract Visual Reasoning Enabled by Language (2303.04091v3)

Published 7 Mar 2023 in cs.AI, cs.CL, and cs.LG

Abstract: While AI models have achieved human or even superhuman performance in many well-defined applications, they still struggle to show signs of broad and flexible intelligence. The Abstraction and Reasoning Corpus (ARC), a visual intelligence benchmark introduced by Fran\c{c}ois Chollet, aims to assess how close AI systems are to human-like cognitive abilities. Most current approaches rely on carefully handcrafted domain-specific program searches to brute-force solutions for the tasks present in ARC. In this work, we propose a general learning-based framework for solving ARC. It is centered on transforming tasks from the vision to the language domain. This composition of language and vision allows for pre-trained models to be leveraged at each stage, enabling a shift from handcrafted priors towards the learned priors of the models. While not yet beating state-of-the-art models on ARC, we demonstrate the potential of our approach, for instance, by solving some ARC tasks that have not been solved previously.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (16)
  1. Communicating natural programs to humans and machines, 2021.
  2. Roderic Guigo Corominas Alejandro de Miquel, Yuji Ariyasu. Arc kaggle competition, 2020.
  3. Neural-guided, bidirectional program search for abstraction and reasoning, 2021.
  4. Object-centric compositional imagination for visual abstract reasoning. In ICLR2022 Workshop on the Elements of Reasoning: Objects, Structure and Causality, 2022.
  5. Language models are few-shot learners, 2020.
  6. François Chollet. On the measure of intelligence, 2019.
  7. Abstraction and reasoning challenge, 2020.
  8. Vlad Golubev Ilia Larchenko. Abstract reasoning, 2020.
  9. Fast and flexible: Human program induction in abstract reasoning tasks, 2021.
  10. Lab42. Arc abstraction & reasoning corpus, 2022.
  11. Grounding language for transfer in deep reinforcement learning. Journal of Artificial Intelligence Research, 63:849–874, 2018.
  12. Bloom: A 176b-parameter open-access multilingual language model. arXiv preprint arXiv:2211.05100, 2022.
  13. Core knowledge. Dev. Sci., 10(1):89–96, Jan. 2007.
  14. Johan Sokrates Wind. Dsl solution to the arc challenge, 2020.
  15. Graphs, constraints, and search for the abstraction and reasoning corpus, 2022.
  16. Virel: Unsupervised visual relations discovery with graph-level analogy, 2022.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Giacomo Camposampiero (10 papers)
  2. Loic Houmard (2 papers)
  3. Benjamin Estermann (9 papers)
  4. Joël Mathys (11 papers)
  5. Roger Wattenhofer (212 papers)
Citations (10)