2000 character limit reached
NELA-GT-2019: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles (2003.08444v2)
Published 18 Mar 2020 in cs.CY
Abstract: In this paper, we present an updated version of the NELA-GT-2018 dataset (N{\o}rregaard, Horne, and Adal{\i} 2019), entitled NELA-GT-2019. NELA-GT-2019 contains 1.12M news articles from 260 sources collected between January 1st 2019 and December 31st 2019. Just as with NELA-GT-2018, these sources come from a wide range of mainstream news sources and alternative news sources. Included with the dataset are source-level ground truth labels from 7 different assessment sites covering multiple dimensions of veracity. The NELA-GT-2019 dataset can be found at: https://doi.org/10.7910/DVN/O7FWPO
- Maurício Gruppi (15 papers)
- Benjamin D. Horne (28 papers)
- Sibel Adalı (23 papers)