A Comprehensive Low and High-level Feature Analysis for Early Rumor Detection on Twitter (1711.00726v3)
Abstract: Recent work have done a good job in modeling rumors and detecting them over microblog streams. However, the performance of their automatic approaches are not relatively high when looking early in the diffusion. A first intuition is that, at early stage, most of the aggregated rumor features (e.g., propagation features) are not mature and distinctive enough. The objective of rumor debunking in microblogs, however, are to detect these misinformation as early as possible. In this work, we leverage neural models in learning the hidden representations of individual rumor-related tweets at the very beginning of a rumor. Our extensive experiments show that the resulting signal improves our classification performance over time, significantly within the first 10 hours. To deepen the understanding of these low and high-level features in contributing to the model performance over time, we conduct an extensive study on a wide range of high impact rumor features for the 48 hours range. The end model that engages these features are shown to be competitive, reaches over 90% accuracy and out-performs strong baselines in our carefully cured dataset.
- The power of a good idea: Quantitative modeling of the spread of ideas from epidemiological models. Physica A, 364, 2006.
- Information credibility on twitter. In Proceedings of WWW, pages 675–684. ACM, 2011.
- Rumor cascades. 2014.
- Tweetcred: Real-time credibility assessment of content on twitter. In SocInfo. Springer, 2014.
- S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
- Epidemiological modeling of news and rumors on twitter. In Proceedings of SNA-KDD, 2013.
- Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759, 2016.
- Y. Kim. Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882, 2014.
- Prominent features of rumor propagation in online social media. In Proceedings of ICDM, 2013.
- Real-time rumor debunking on twitter. In Proceedings of CIKM, pages 1867–1870. ACM, 2015.
- Detecting rumors from microblogs with recurrent neural networks.
- Detect rumors using time series of social context information on microblogging websites. In Proceedings of CIKM, 2015.
- Rise and fall patterns of information diffusion. In KDD.
- Building a large-scale corpus for evaluating event detection on twitter. In Proceedings of CIKM, 2013.
- Degeneracy-based real-time sub-event detection in twitter stream. In Proceedings of ICWSM, 2015.
- Twitter under crisis: can we trust what we rt? In Proceedings of the first workshop on social media analytics, pages 71–79. ACM, 2010.
- On early-stage debunking rumors on twitter: Leveraging the wisdom of weak learners. CoRR, abs/1709.04402, 2017.
- Rumor has it: Identifying misinformation in microblogs. In Proceedings of EMNLP, 2011.
- Rumors, false flags, and digital vigilantes: Misinformation on twitter after the 2013 boston marathon bombing. iConference 2014 Proceedings, 2014.
- False rumors detection on sina weibo by propagation structures. In Proceedings of ICDE, pages 651–662. IEEE, 2015.
- Automatic detection of rumor on sina weibo. In Proceedings of MDS. ACM, 2012.
- Enquiring minds: Early detection of rumors in social media from enquiry posts. In Proceedings of WWW, 2015.
- A c-lstm neural network for text classification. arXiv preprint arXiv:1511.08630, 2015.