Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PHYRE: A New Benchmark for Physical Reasoning (1908.05656v1)

Published 15 Aug 2019 in cs.LG, cs.AI, and stat.ML

Abstract: Understanding and reasoning about physics is an important ability of intelligent agents. We develop the PHYRE benchmark for physical reasoning that contains a set of simple classical mechanics puzzles in a 2D physical environment. The benchmark is designed to encourage the development of learning algorithms that are sample-efficient and generalize well across puzzles. We test several modern learning algorithms on PHYRE and find that these algorithms fall short in solving the puzzles efficiently. We expect that PHYRE will encourage the development of novel sample-efficient agents that learn efficient but useful models of physics. For code and to play PHYRE for yourself, please visit https://player.phyre.ai.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Anton Bakhtin (16 papers)
  2. Laurens van der Maaten (54 papers)
  3. Justin Johnson (56 papers)
  4. Laura Gustafson (11 papers)
  5. Ross Girshick (75 papers)
Citations (114)

Summary

We haven't generated a summary for this paper yet.