Disentangle displacement effects from population confounds in Turing test accuracy
Ascertain whether the reduction in adjudication accuracy observed for displaced human judges who read Turing test transcripts, relative to interactive human interrogators, is attributable to displacement per se rather than to differences in participant populations (social-media-recruited interactive interrogators versus undergraduate displaced adjudicators).
References
Interactive interrogators were recruited via social media while displaced participants were undergraduate students. We therefore cannot know whether this drop in accuracy is purely due to the effect of displacement.
— GPT-4 is judged more human than humans in displaced and inverted Turing tests
(2407.08853 - Rathi et al., 11 Jul 2024) in Section 3.3 Discussion