Unknown downstream impacts on scientific decision-making from LLM-shaped review criteria
Characterize the downstream impacts on scientific decision-making—including paper acceptance, research incentives, and field trajectories—resulting from systematic differences in evaluation criteria between large language model–generated peer reviews and human-written peer reviews at venues such as ICLR.
References
These results demonstrate that the criteria under human review and LLM review are significantly different, which will have as-yet-unknown downstream impacts on the decisions made about what scientific work is valid and incentivized.
— How LLMs Distort Our Written Language
(2603.18161 - Abdulhai et al., 18 Mar 2026) in Section 4, Subsection "LLMs Distort Decisions Affecting Scientific Institutions"