Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text (2407.11774v1)

Published 16 Jul 2024 in cs.CL and cs.AI

Abstract: Detecting Machine-Generated Text (MGT) has emerged as a significant area of study within Natural Language Processing. While LLMs generate text, they often leave discernible traces, which can be scrutinized using either traditional feature-based methods or more advanced neural LLMs. In this research, we explore the effectiveness of fine-tuning a RoBERTa-base transformer, a powerful neural architecture, to address MGT detection as a binary classification task. Focusing specifically on Subtask A (Monolingual-English) within the SemEval-2024 competition framework, our proposed system achieves an accuracy of 78.9% on the test dataset, positioning us at 57th among participants. Our study addresses this challenge while considering the limited hardware resources, resulting in a system that excels at identifying human-written texts but encounters challenges in accurately discerning MGTs.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (7)

Seyedeh Fatemeh Ebrahimi (3 papers)
Karim Akhavan Azari (2 papers)
Amirmasoud Iravani (2 papers)
Arian Qazvini (1 paper)
Pouya Sadeghi (6 papers)
Zeinab Sadat Taghavi (8 papers)
Hossein Sameti (19 papers)

Citations (1)

View on Semantic Scholar

Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text (2407.11774v1)

Related Papers