SPOT: Text Source Prediction from Originality Score Thresholding (2405.20505v1)

Published 30 May 2024 in cs.CL and cs.LG

Abstract: The wide acceptance of LLMs has unlocked new applications and social risks. Popular countermeasures aim at detecting misinformation, usually involve domain specific models trained to recognize the relevance of any information. Instead of evaluating the validity of the information, we propose to investigate LLM generated text from the perspective of trust. In this study, we define trust as the ability to know if an input text was generated by a LLM or a human. To do so, we design SPOT, an efficient method, that classifies the source of any, standalone, text input based on originality score. This score is derived from the prediction of a given LLM to detect other LLMs. We empirically demonstrate the robustness of the method to the architecture, training data, evaluation data, task and compression of modern LLMs.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (2)

Edouard Yvinec (19 papers)
Gabriel Kasser (1 paper)

Tweets

https://twitter.com/realmofresearch/status/1797617247319691361

SPOT: Text Source Prediction from Originality Score Thresholding (2405.20505v1)

Related Papers

Tweets