Can fully automated AI scientists generate paradigm-shifting ideas?
Determine whether fully automated scientific discovery agents built on large language models, specifically The AI Scientist framework described in this paper, can autonomously generate genuinely paradigm-shifting ideas in machine learning that are comparable in impact to landmark innovations such as diffusion models or transformer architectures.
References
While the current iteration of The AI Scientist demonstrates a strong ability to innovate on top of well-established ideas, such as Diffusion Modeling or Transformers, it is an open question whether such systems can ultimately propose genuinely paradigm-shifting ideas.
— The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
(2408.06292 - Lu et al., 2024) in Section 7, Discussion