Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Susceptibility of Communities against Low-Credibility Content in Social News Websites (2403.10705v1)

Published 15 Mar 2024 in cs.SI

Abstract: Social news websites, such as Reddit, have evolved into prominent platforms for sharing and discussing news. A key issue on social news websites sites is the formation of echo chambers, which often lead to the spread of highly biased or uncredible news. We develop a method to identify communities within a social news website that are prone to uncredible or highly biased news. We employ a user embedding pipeline that detects user communities based on their stances towards posts and news sources. We then project each community onto a credibility-bias space and analyze the distributional characteristics of each projected community to identify those that have a high risk of adopting beliefs with low credibility or high bias. This approach also enables the prediction of individual users' susceptibility to low credibility content, based on their community affiliation. Our experiments show that latent space clusters effectively indicate the credibility and bias levels of their users, with significant differences observed across clusters -- a $34\%$ difference in the users' susceptibility to low-credibility content and a $8.3\%$ difference in the users' susceptibility to high political bias.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (37)
  1. Anon. 2023. Stance Detection with Fine-Tuned Large Language Models. OpenReview.
  2. Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection. In Annual Meeting of the Association for Computational Linguistics, 13448–13464. ACL.
  3. Stance Detection with Bidirectional Conditional Encoding. In Conference on Empirical Methods in Natural Language Processing, 876–885. ACL.
  4. A large annotated corpus for learning natural language inference. In Conference on Empirical Methods in Natural Language Processing. ACM.
  5. Supervised Learning of Universal Sentence Representations from Natural Language Inference Data. In Conference on Empirical Methods in Natural Language Processing, 670–680. ACL.
  6. Simple, direct and efficient multi-way spectral clustering. Information and Inference: A Journal of the IMA, 8: 181–203.
  7. No echo in the chambers of political interactions on Reddit. Scientific reports, 11(1): 2818.
  8. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv:1810.04805.
  9. A Survey of User Profiling: State-of-the-Art, Challenges, and Solutions. IEEE Access, 7: 144907–144924.
  10. Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models. arXiv:2307.11224.
  11. A Survey on Stance Detection for Mis- and Disinformation Identification. In Findings of the Association for Computational Linguistics: NAACL 2022, 1259–1277. ACL.
  12. A Repository of Conversational Datasets. arXiv preprint arxiv:1904.06472.
  13. Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection. arXiv:2309.12247.
  14. LoRA: Low-Rank Adaptation of Large Language Models. In International Conference on Learning Representations.
  15. Representation Learning for Text-level Discourse Parsing. In Toutanova, K.; and Wu, H., eds., Annual Meeting of the Association for Computational Linguistics, volume 1, 13–24. ACL.
  16. Skip-Thought Vectors. In Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc.
  17. Stance Detection: A Survey. ACM Computing Surveys, 53(1).
  18. Social Media and News Fact Sheet. Pew Research Center.
  19. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv:1907.11692.
  20. Improving (Dis)agreement Detection with Inductive Social Relation Information From Comment-Reply Interactions. arXiv:2302.03950.
  21. UMAP: Uniform Manifold Approximation and Projection. The Journal of Open Source Software, 3(29): 861.
  22. Fake News Detection on Social Media using Geometric Deep Learning. arXiv:1902.06673.
  23. Toward a standard approach for echo chamber detection: Reddit case study. Applied Sciences, 11(12): 5390.
  24. The Development and Psychometric Properties of LIWC2015.
  25. DEBAGREEMENT: A comment-reply dataset for (dis)agreement detection in online debates. In Neural Information Processing Systems.
  26. Rumor has it: Identifying Misinformation in Microblogs. In Conference on Empirical Methods in Natural Language Processing, 1589–1599.
  27. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Conference on Empirical Methods in Natural Language Processing. ACL.
  28. FACTOID: A New Dataset for Identifying Misinformation Spreaders and Political Bias. In Language Resources and Evaluation Conference, 3231–3241. European Language Resources Association.
  29. Understanding User Profiles on Social Media for Fake News Detection. In IEEE Conference on Multimedia Information Processing and Retrieval, 430–435.
  30. The Role of User Profiles for Fake News Detection. In ASONAM, 436–439. New York, NY, USA: ACM.
  31. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. Journal of Machine Learning Research, 15: 1929–1958.
  32. Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv preprint arxiv:2307.09288.
  33. Attention is All you Need. In Guyon, I.; Luxburg, U. V.; Bengio, S.; Wallach, H.; Fergus, R.; Vishwanathan, S.; and Garnett, R., eds., Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc.
  34. A Comprehensive Survey on Graph Neural Networks. IEEE Transactions on Neural Networks and Learning Systems, 32(1): 4–24.
  35. Multiclass spectral clustering. In IEEE International Conference on Computer Vision, 313–319.
  36. Self-Tuning Spectral Clustering. In Saul, L.; Weiss, Y.; and Bottou, L., eds., Advances in Neural Information Processing Systems, volume 17. MIT Press.
  37. Fake News Early Detection: A Theory-Driven Model. Digital Threats, 1(2).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Yigit Ege Bayiz (5 papers)
  2. Arash Amini (55 papers)
  3. Radu Marculescu (48 papers)
  4. Ufuk Topcu (287 papers)
Citations (2)