Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Distinguishing Human Generated Text From ChatGPT Generated Text Using Machine Learning (2306.01761v1)

Published 26 May 2023 in cs.CL, cs.AI, and cs.LG

Abstract: ChatGPT is a conversational artificial intelligence that is a member of the generative pre-trained transformer of the LLM family. This text generative model was fine-tuned by both supervised learning and reinforcement learning so that it can produce text documents that seem to be written by natural intelligence. Although there are numerous advantages of this generative model, it comes with some reasonable concerns as well. This paper presents a machine learning-based solution that can identify the ChatGPT delivered text from the human written text along with the comparative analysis of a total of 11 machine learning and deep learning algorithms in the classification process. We have tested the proposed model on a Kaggle dataset consisting of 10,000 texts out of which 5,204 texts were written by humans and collected from news and social media. On the corpus generated by GPT-3.5, the proposed algorithm presents an accuracy of 77%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Niful Islam (5 papers)
  2. Debopom Sutradhar (1 paper)
  3. Humaira Noor (1 paper)
  4. Jarin Tasnim Raya (2 papers)
  5. Monowara Tabassum Maisha (1 paper)
  6. Dewan Md Farid (31 papers)
Citations (16)