Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Detection of ChatGPT Fake Science with the xFakeSci Learning Algorithm (2308.11767v4)

Published 15 Aug 2023 in cs.CL and cs.IR

Abstract: Generative AI tools exemplified by ChatGPT are becoming a new reality. This study is motivated by the premise that ``AI generated content may exhibit a distinctive behavior that can be separated from scientific articles''. In this study, we show how articles can be generated using means of prompt engineering for various diseases and conditions. We then show how we tested this premise in two phases and prove its validity. Subsequently, we introduce xFakeSci, a novel learning algorithm, that is capable of distinguishing ChatGPT-generated articles from publications produced by scientists. The algorithm is trained using network models driven from both sources. As for the classification step, it was performed using 300 articles per condition. The actual label steps took place against an equal mix of 50 generated articles and 50 authentic PubMed abstracts. The testing also spanned publication periods from 2010 to 2024 and encompassed research on three distinct diseases: cancer, depression, and Alzheimer's. Further, we evaluated the accuracy of the xFakeSci algorithm against some of the classical data mining algorithms (e.g., Support Vector Machines, Regression, and Naive Bayes). The xFakeSci algorithm achieved F1 scores ranging from 80% to 94%, outperforming common data mining algorithms, which scored F1 values between 38% and 52%. We attribute the noticeable difference to the introduction of calibration and a proximity distance heuristic, which underscores this promising performance. Indeed, the prediction of fake science generated by ChatGPT presents a considerable challenge. Nonetheless, the introduction of the xFakeSci algorithm is a significant step on the way to combating fake science.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (57)
  1. Chatgpt. Online: https://chat.openai.com (2023). Accessed August 15, 2023.
  2. Citespace ii: visualization and knowledge discovery in bibliographic databases. In AMIA annual symposium proceedings, vol. 2005, 724 (American Medical Informatics Association, 2005).
  3. Holzinger, A. et al. On graph entropy measures for knowledge discovery from publication network data. In Availability, Reliability, and Security in Information Systems and HCI: IFIP WG 8.4, 8.9, TC 5 International Cross-Domain Conference, CD-ARES 2013, Regensburg, Germany, September 2-6, 2013. Proceedings 8, 354–362 (Springer, 2013).
  4. Knowledge discovery out of text data: a systematic review via text mining. \JournalTitleJournal of knowledge management 22, 1471–1488 (2018).
  5. Fish tales: Combating fake science in popular media. \JournalTitleOcean & Coastal Management 115, 88–91 (2015).
  6. Fake science and the knowledge crisis: ignorance can be fatal. \JournalTitleRoyal Society open science 6, 190161 (2019).
  7. Let’s nab fake science news: Predicting scientists’ support for interventions using the influence of presumed media influence model. \JournalTitleJournalism 23, 910–928 (2022).
  8. Addressing the big business of fake science. \JournalTitleMolecular Therapy 30, 2390 (2022).
  9. Rocha, Y. M. et al. The impact of fake news on social media and its influence on health during the covid-19 pandemic: A systematic review. \JournalTitleJournal of Public Health 1–10 (2021).
  10. Evaluating the impact of attempts to correct health misinformation on social media: A meta-analysis. \JournalTitleHealth communication 36, 1776–1784 (2021).
  11. Measuring the impact of covid-19 vaccine misinformation on vaccination intent in the uk and usa. \JournalTitleNature human behaviour 5, 337–348 (2021).
  12. Misinformation and its correction: Continued influence and successful debiasing. \JournalTitlePsychological science in the public interest 13, 106–131 (2012).
  13. Misinformation about vaccines. \JournalTitleVaccines for biodefense and emerging and neglected diseases 255–270 (2009).
  14. Government orders review into vitamin d’s role in covid-19. Online: https://www.dailymail.co.uk/news/article-8432321/Government-orders-review-vitamin-D-role-Covid-19.html (2020). Accessed on April 13, 2024.
  15. Fighting the covid-19 infodemic in news articles and false publications: The neonet text classifier, a supervised machine learning algorithm. \JournalTitleApplied Sciences 11, 7265 (2021).
  16. Safeguarding authenticity for mitigating the harms of generative ai: Issues, research agenda, and policies for detection, fact-checking, and ethical ai. \JournalTitleiScience 27, 108782, DOI: https://doi.org/10.1016/j.isci.2024.108782 (2024).
  17. Eysenbach, G. et al. The role of chatgpt, generative language models, and artificial intelligence in medical education: a conversation with chatgpt and a call for papers. \JournalTitleJMIR Medical Education 9, e46885 (2023).
  18. IEEE special issue on education in the world of ChatGPT and other generative AI. Online: https://ieee-edusociety.org/ieee-special-issue-education-world-chatgpt-and-other-generative-ai (2023). Accessed April 13, 2024.
  19. Financial innovation. Online: https://jfin-swufe.springeropen.com/special-issue—chatgpt-and-generative-ai-in-finance (2023). Accessed April 13, 2024.
  20. Special issue "language generation with pretrained models". Online: https://www.mdpi.com/journal/languages/special_issues/K1Z08ODH6V (Year). Accessed April 13, 2023.
  21. Call for papers for the special focus issue on ChatGPT and large language models (LLMs) in biomedicine and health. https://academic.oup.com/jamia/pages/call-for-papers-for-special-focus-issue (Year). Accessed July 4, 2023.
  22. Best practices for using ai tools as an author, peer reviewer, or editor. \JournalTitleJ Med Internet Res 25, e51584, DOI: 10.2196/51584 (2023).
  23. Null, N. The PNAS journals outline their policies for ChatGPT and generative AI. \JournalTitlePNAS Updates DOI: 10.1073/pnas-updates.2023-02-21 (2023). Published online.
  24. Brainard, J. As scientists explore ai-written text, journals hammer out policies. \JournalTitleScience 379, 740–741 (2023).
  25. Fuster, V. et al. Jacc journals’ pathway forward with ai tools: The future is now. \JournalTitleJACC: Advances 2, 100296, DOI: https://doi.org/10.1016/j.jacadv.2023.100296 (2023).
  26. Nonhuman “authors” and implications for the integrity of scientific publication and medical knowledge. \JournalTitleJama 329, 637–639 (2023).
  27. Chatgpt plugins. Online: https://openai.com/blog/chatgpt-plugins (2023). Accessed April 13, 2023.
  28. Gilson, A. et al. How does chatgpt perform on the united states medical licensing examination? the implications of large language models for medical education and knowledge assessment. \JournalTitleJMIR Medical Education 9, e45312 (2023).
  29. Chaka, C. Detecting ai content in responses generated by chatgpt, youchat, and chatsonic: The case of five ai content detection tools. \JournalTitleJournal of Applied Learning and Teaching 6 (2023).
  30. Vapnik, V. N. An overview of statistical learning theory. \JournalTitleIEEE transactions on neural networks 10, 988–999 (1999).
  31. Cingillioglu, I. Detecting ai-generated essays: the chatgpt challenge. \JournalTitleThe International Journal of Information and Learning Technology 40, 259–268 (2023).
  32. Copyleaks: AI & machine learning powered plagiarism checker. Online: https://copyleaks.com/. Accessed April 13, 2024.
  33. Crossplag: Online plagiarism checker. Online: https://crossplag.com/. Accessed April 13, 2024.
  34. Evaluating the efficacy of ai content detection tools in differentiating between human and ai-generated text. \JournalTitleInternational Journal for Educational Integrity 19, 17 (2023).
  35. Anderson, N. et al. Ai did not write this manuscript, or did it? can we trick the ai text detector into generated texts? the potential future of chatgpt and ai in sports & exercise medicine manuscript generation. \JournalTitleBMJ Open Sport & Exercise Medicine 9, DOI: 10.1136/bmjsem-2023-001568 (2023).
  36. The chatgpt conundrum: Human-generated scientific manuscripts misidentified as ai creations by ai text detection tool. \JournalTitleJournal of Pathology Informatics 14, 100342 (2023).
  37. NLM, N. L. o. M. National center of biotechnology information. Online: https://pubmed.ncbi.nlm.nih.gov/. Accessed on January 25, 2024.
  38. Wu, X. et al. Top 10 algorithms in data mining. \JournalTitleKnowledge and Information Systems 14, 1–37 (2008).
  39. Pedregosa, F. et al. Scikit-learn: Machine learning in Python. \JournalTitleJournal of Machine Learning Research 12, 2825–2830 (2011).
  40. Aizawa, A. An information-theoretic perspective of tf–idf measures. \JournalTitleInformation Processing & Management 39, 45–65 (2003).
  41. Text mining: use of tf-idf to examine the relevance of words to documents. \JournalTitleInternational Journal of Computer Applications 181, 25–29 (2018).
  42. Ramos, J. et al. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, vol. 242,1, 29–48 (Citeseer, 2003).
  43. Knn with tf-idf based framework for text categorization. \JournalTitleProcedia Engineering 69, 1356–1364 (2014).
  44. Interpreting tf-idf term weights as making relevance decisions. \JournalTitleACM Transactions on Information Systems (TOIS) 26, 1–37 (2008).
  45. A comparative study of tf* idf, lsi and multi-words for text classification. \JournalTitleExpert systems with applications 38, 2758–2765 (2011).
  46. The use of bigrams to enhance text categorization. \JournalTitleInformation processing & management 38, 529–546 (2002).
  47. Bigrams of syntactic labels for authorship discrimination of short texts. \JournalTitleLiterary and Linguistic Computing 22, 405–417 (2007).
  48. Giant strongly connected component of directed networks. \JournalTitlePhysical Review E 64, 025101 (2001).
  49. Kitsak, M. et al. Stability of a giant connected component in a complex network. \JournalTitlePhysical Review E 97, 012309 (2018).
  50. Improving network robustness by edge modification. \JournalTitlePhysica A: Statistical Mechanics and its Applications 357, DOI: 10.1016/j.physa.2005.03.040 (2005).
  51. Network stability, connectivity and innovation output. \JournalTitleTechnological Forecasting and Social Change 114, DOI: 10.1016/j.techfore.2016.09.004 (2017).
  52. Bellingeri, M. et al. Link and node removal in real social networks: A review. \JournalTitleFrontiers in Physics 8, DOI: 10.3389/fphy.2020.00228 (2020).
  53. Large-scale bayesian logistic regression for text categorization. \JournalTitletechnometrics 49, 291–304 (2007).
  54. Feng, X. et al. Overfitting reduction of text classification based on adabelm. \JournalTitleEntropy 19, 330 (2017).
  55. Feature selection for text classification: A review. \JournalTitleMultimedia Tools and Applications 78, 3797–3816, DOI: 10.1007/s11042-018-6083-5 (2019).
  56. Optimal feature selection for imbalanced text classification. \JournalTitleIEEE Transactions on Artificial Intelligence 4, 135–147, DOI: 10.1109/TAI.2022.3144651 (2023).
  57. Conroy, G. How chatgpt and other ai tools could disrupt scientific publishing. \JournalTitleNature 622, 234–236 (2023).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Ahmed Abdeen Hamed (6 papers)
  2. Xindong Wu (49 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.