Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Eating Garlic Prevents COVID-19 Infection: Detecting Misinformation on the Arabic Content of Twitter (2101.05626v1)

Published 9 Jan 2021 in cs.IR, cs.CY, cs.LG, and cs.SI

Abstract: The rapid growth of social media content during the current pandemic provides useful tools for disseminating information which has also become a root for misinformation. Therefore, there is an urgent need for fact-checking and effective techniques for detecting misinformation in social media. In this work, we study the misinformation in the Arabic content of Twitter. We construct a large Arabic dataset related to COVID-19 misinformation and gold-annotate the tweets into two categories: misinformation or not. Then, we apply eight different traditional and deep machine learning models, with different features including word embeddings and word frequency. The word embedding models (\textsc{FastText} and word2vec) exploit more than two million Arabic tweets related to COVID-19. Experiments show that optimizing the area under the curve (AUC) improves the models' performance and the Extreme Gradient Boosting (XGBoost) presents the highest accuracy in detecting COVID-19 misinformation online.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Sarah Alqurashi (2 papers)
  2. Btool Hamoui (2 papers)
  3. Abdulaziz Alashaikh (4 papers)
  4. Ahmad Alhindi (2 papers)
  5. Eisa Alanazi (10 papers)
Citations (35)