Susceptibility of ChatGPT’s research quality estimates to training-data gaming
Determine the extent to which ChatGPT 4o-mini’s research quality scores can be manipulated through alterations to its training data (e.g., targeted web content injection) that bias scores for specific articles, approaches, or institutions, and develop detection and mitigation strategies.
References
Moreover, the extent to which ChatGPT can be gamed through its training data to inflate or deflate article scores is unknown.
— In which fields can ChatGPT detect journal article quality? An evaluation of REF2021 results
(2409.16695 - Thelwall et al., 25 Sep 2024) in Conclusion