Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Classification Aware Neural Topic Model and its Application on a New COVID-19 Disinformation Corpus (2006.03354v2)

Published 5 Jun 2020 in cs.LG, cs.CL, cs.SI, and stat.ML

Abstract: The explosion of disinformation accompanying the COVID-19 pandemic has overloaded fact-checkers and media worldwide, and brought a new major challenge to government responses worldwide. Not only is disinformation creating confusion about medical science amongst citizens, but it is also amplifying distrust in policy makers and governments. To help tackle this, we developed computational methods to categorise COVID-19 disinformation. The COVID-19 disinformation categories could be used for a) focusing fact-checking efforts on the most damaging kinds of COVID-19 disinformation; b) guiding policy makers who are trying to deliver effective public health messages and counter effectively COVID-19 disinformation. This paper presents: 1) a corpus containing what is currently the largest available set of manually annotated COVID-19 disinformation categories; 2) a classification-aware neural topic model (CANTM) designed for COVID-19 disinformation category classification and topic discovery; 3) an extensive analysis of COVID-19 disinformation categories with respect to time, volume, false type, media type and origin source.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xingyi Song (30 papers)
  2. Johann Petrak (4 papers)
  3. Ye Jiang (22 papers)
  4. Iknoor Singh (10 papers)
  5. Diana Maynard (12 papers)
  6. Kalina Bontcheva (64 papers)
Citations (19)