Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Monant Medical Misinformation Dataset: Mapping Articles to Fact-Checked Claims (2204.12294v1)

Published 26 Apr 2022 in cs.CL, cs.CY, cs.IR, and cs.LG

Abstract: False information has a significant negative influence on individuals as well as on the whole society. Especially in the current COVID-19 era, we witness an unprecedented growth of medical misinformation. To help tackle this problem with machine learning approaches, we are publishing a feature-rich dataset of approx. 317k medical news articles/blogs and 3.5k fact-checked claims. It also contains 573 manually and more than 51k automatically labelled mappings between claims and articles. Mappings consist of claim presence, i.e., whether a claim is contained in a given article, and article stance towards the claim. We provide several baselines for these two tasks and evaluate them on the manually labelled part of the dataset. The dataset enables a number of additional tasks related to medical misinformation, such as misinformation characterisation studies or studies of misinformation diffusion between sources.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Ivan Srba (28 papers)
  2. Branislav Pecher (12 papers)
  3. Matus Tomlein (4 papers)
  4. Robert Moro (22 papers)
  5. Elena Stefancova (23 papers)
  6. Jakub Simko (18 papers)
  7. Maria Bielikova (27 papers)
Citations (15)