Papers
Topics
Authors
Recent
2000 character limit reached

Evaluating LLMs on Real-World Forecasting Against Human Superforecasters (2507.04562v1)

Published 6 Jul 2025 in cs.LG, cs.AI, and cs.CL

Abstract: LLMs have demonstrated remarkable capabilities across diverse tasks, but their ability to forecast future events remains understudied. A year ago, LLMs struggle to come close to the accuracy of a human crowd. I evaluate state-of-the-art LLMs on 464 forecasting questions from Metaculus, comparing their performance against human superforecasters. Frontier models achieve Brier scores that ostensibly surpass the human crowd but still significantly underperform a group of superforecasters.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 2 tweets with 12 likes about this paper.