DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection (2402.10426v2)
Abstract: LLMs are limited by challenges in factuality and hallucinations to be directly employed off-the-shelf for judging the veracity of news articles, where factual accuracy is paramount. In this work, we propose DELL that identifies three key stages in misinformation detection where LLMs could be incorporated as part of the pipeline: 1) LLMs could \emph{generate news reactions} to represent diverse perspectives and simulate user-news interaction networks; 2) LLMs could \emph{generate explanations} for proxy tasks (e.g., sentiment, stance) to enrich the contexts of news articles and produce experts specializing in various aspects of news understanding; 3) LLMs could \emph{merge task-specific experts} and provide an overall prediction by incorporating the predictions and confidence scores of varying experts. Extensive experiments on seven datasets with three LLMs demonstrate that DELL outperforms state-of-the-art baselines by up to 16.8\% in macro f1-score. Further analysis reveals that the generated reactions and explanations are greatly helpful in misinformation detection, while our proposed LLM-guided expert merging helps produce better-calibrated predictions.
- Out of one, many: Using language models to simulate human samples. Political Analysis, 31(3):337–351.
- Self-rag: Learning to retrieve, generate, and critique through self-reflection. arXiv preprint arXiv:2310.11511.
- On the dangers of stochastic parrots: Can language models be too big? In FAccT ’21: 2021 ACM Conference on Fairness, Accountability, and Transparency, Virtual Event / Toronto, Canada, March 3-10, 2021, pages 610–623. ACM.
- Twitter-comms: Detecting climate, covid, and military multimodal misinformation. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1530–1549.
- Rumor detection on social media with bi-directional graph convolutional networks. In Proceedings of the AAAI conference on artificial intelligence, volume 34, pages 549–556.
- Improving language models by retrieving from trillions of tokens. In International conference on machine learning, pages 2206–2240. PMLR.
- Cody Buntain and Jennifer Golbeck. 2017. Automatically identifying fake news in popular twitter threads. In 2017 IEEE International Conference on Smart Cloud (SmartCloud), pages 208–215. IEEE.
- The media frames corpus: Annotations of frames across issues. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 438–444.
- The media frames corpus: Annotations of frames across issues. In Proceedings of ACL.
- Beyond detection: A defend-and-summarize strategy for robust and interpretable rumor analysis on social media. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 11538–11556.
- Canyu Chen and Kai Shu. 2023. Can llm-generated misinformation be detected? arXiv preprint arXiv:2309.13788.
- Can large language models understand content and propagation for misinformation detection: An empirical study. arXiv preprint arXiv:2311.12699.
- Dense x retrieval: What retrieval granularity should we use? arXiv preprint arXiv:2312.06648.
- Exploring the potential of large language models (llms) in learning on graphs. arXiv preprint arXiv:2307.03393.
- Causal intervention and counterfactual reasoning for multi-modal fake news detection. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 627–638.
- Cheng-Han Chiang and Hung-yi Lee. 2023. Can large language models be an alternative to human evaluations? arXiv preprint arXiv:2305.01937.
- Eun Cheol Choi and Emilio Ferrara. 2023. Automated claim matching with large language models: Empowering fact-checkers in the fight against misinformation. arXiv preprint arXiv:2310.09223.
- Editing factual knowledge in language models. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 6491–6506.
- A survey of natural language generation. ACM Computing Surveys, 55(8):1–38.
- User preference-aware fake news detection. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 2051–2055.
- Quantifying and attributing the hallucination of large language models via association analysis. arXiv preprint arXiv:2309.05217.
- Kan: Knowledge-aware attention network for fake news detection. In Proceedings of the AAAI conference on artificial intelligence, volume 35, pages 81–89.
- Paul Ekman et al. 1999. Basic emotions. Handbook of cognition and emotion, 98(45-60):16.
- Robert Entman. 1993. Framing: Toward clarification of a fractured paradigm. The Journal of Communication, 43:51–58.
- Knowledge solver: Teaching llms to search for domain knowledge from knowledge graphs. arXiv preprint arXiv:2309.03118.
- From pretraining data to language models to downstream tasks: Tracking the trails of political biases leading to unfair NLP models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pages 11737–11762. Association for Computational Linguistics.
- Cook: Empowering general-purpose language models with modular and collaborative knowledge. arXiv preprint arXiv:2305.09955.
- Knowledge card: Filling llms’ knowledge gaps with plug-in specialized language models.
- Don’t hallucinate, abstain: Identifying llm knowledge gaps via multi-llm collaboration. arXiv preprint arXiv:2402.00367.
- KALM: knowledge-aware integration of local, document, and global contexts for long document understanding. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pages 2116–2138. Association for Computational Linguistics.
- Infosurgeon: Cross-media fine-grained information consistency checking for fake news detection. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1683–1698.
- Misinfo reaction frames: Reasoning about readers’ reactions to news headlines. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3108–3127.
- News and political information consumption in mexico: Mapping the 2018 mexican presidential election on twitter and facebook.
- Generative language models and automated influence operations: Emerging threats and potential mitigations. arXiv preprint arXiv:2301.04246.
- MDP P Goonathilake and PPN V Kumara. 2020. Cnn, rnn-lstm based hybrid approach to detect state-of-the-art stance-based fake news on social media. In 2020 20th International Conference on Advances in ICT for Emerging Regions (ICTer), pages 23–28. IEEE.
- Public wisdom matters! discourse-aware hyperbolic fourier co-attention for social text classification. Advances in Neural Information Processing Systems, 35:9417–9431.
- On calibration of modern neural networks. In International conference on machine learning, pages 1321–1330. PMLR.
- Philipp Hartl and Udo Kruschwitz. 2022. Applying automatic text summarization for fake news detection. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 2702–2713.
- Do language models have beliefs? methods for detecting, updating, and visualizing model beliefs. arXiv preprint arXiv:2111.13654.
- Reinforcement learning-based counter-misinformation response generation: a case study of covid-19 vaccine misinformation. In Proceedings of the ACM Web Conference 2023, pages 2698–2709.
- Deberta: decoding-enhanced bert with disentangled attention. In 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net.
- Harnessing explanations: Llm-to-lm interpreter for enhanced text-attributed graph representation learning. arXiv preprint arXiv:2305.19523.
- Bert model for fake news detection based on social bot activities in the covid-19 pandemic. In 2021 IEEE 12th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), pages 0103–0109. IEEE.
- Bad actor, good advisor: Exploring the role of large language models in fake news detection. arXiv preprint arXiv:2309.12247.
- Compare to the knowledge: Graph neural fake news detection with external knowledge. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 754–763.
- Can llms effectively leverage graph structural information: when and why. arXiv preprint arXiv:2309.16595.
- Faking fake news for real fake news detection: Propaganda-loaded training data generation. arXiv preprint arXiv:2203.05386.
- Faking fake news for real fake news detection: Propaganda-loaded training data generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pages 14571–14589. Association for Computational Linguistics.
- Survey of hallucination in natural language generation. ACM Computing Surveys, 55(12):1–38.
- Mistral 7b. arXiv preprint arXiv:2310.06825.
- Disinformation detection: An evolving challenge in the age of llms. arXiv preprint arXiv:2309.15847.
- Raucg: Retrieval-augmented unsupervised counter narrative generation for hate speech. arXiv preprint arXiv:2310.05650.
- On transferability of bias mitigation effects in language model fine-tuning. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3770–3783.
- Towards fine-grained reasoning for fake news detection. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 5746–5754.
- Caution: Rumors ahead—a case study on the debunking of false information on twitter. Big Data & Society, 7(2):2053951720980127.
- Large language models struggle to learn long-tail knowledge. In International Conference on Machine Learning, pages 15696–15707. PMLR.
- Covid-19 vaccine misinformation in middle income countries. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 3903–3915.
- Prometheus: Inducing fine-grained evaluation capability in language models. arXiv preprint arXiv:2310.08491.
- Thomas N. Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings. OpenReview.net.
- Evaluating the factual consistency of abstractive text summarization. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 9332–9346.
- A systematic media frame analysis of 1.5 million new york times articles from 2000 to 2017. In Proceedings of the 12th ACM Conference on Web Science, pages 305–314.
- Detecting misinformation with llm-predicted credibility signals and weak supervision. arXiv preprint arXiv:2309.07601.
- A survey of graph meets large language model: Progress and future directions. arXiv preprint arXiv:2311.12399.
- A revisit of fake news dataset with augmented fact-checking by chatgpt. arXiv preprint arXiv:2312.11870.
- Zero-shot rumor detection with propagation structure via prompt learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 5213–5221.
- Interpretable multimodal misinformation detection with logic reasoning. In Findings of the Association for Computational Linguistics: ACL 2023, Toronto, Canada, July 9-14, 2023, pages 9781–9796. Association for Computational Linguistics.
- Yi-Ju Lu and Cheng-Te Li. 2020. Gcan: Graph-aware co-attention networks for explainable fake news detection on social media. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 505–514.
- Fighting fire with fire: The dual role of llms in crafting and detecting elusive disinformation. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 14279–14305.
- Chain of history: Learning and forecasting with llms for temporal knowledge graph completion. arXiv preprint arXiv:2401.06072.
- Propagation structure fusion for rumor detection based on node-level contrastive learning. IEEE Transactions on Neural Networks and Learning Systems.
- Kapalm: Knowledge graph enhanced language models for fake news detection. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 3999–4009.
- Rumor detection on twitter with tree-structured recursive neural networks. Association for Computational Linguistics.
- When not to trust language models: Investigating effectiveness of parametric and non-parametric memories. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 9802–9822.
- Semeval-2020 task 11: Detection of propaganda techniques in news articles (version semeval-2020). https://doi.org/10.5281/zenodo.3952415. Accessed on YYYY-MM-DD.
- Tackling fake news detection by continually improving social context representations using graph neural networks. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1363–1380.
- Modeling framing in immigration discourse on social media. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2219–2263.
- Human-in-the-loop evaluation for early misinformation detection: A case study of COVID-19 treatments. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pages 15817–15835. Association for Computational Linguistics.
- The role of the crowd in countering misinformation: A case study of the covid-19 infodemic. In 2020 IEEE international Conference on big data (big data), pages 748–757. IEEE.
- Erxue Min and Sophia Ananiadou. 2023. Pesto: a post-user fusion network for rumour detection on social media. In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 1–10.
- Using llm for improving key event discovery: Temporal-guided news stream clustering with event summaries. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 4162–4173.
- Fang: Leveraging social context for fake news detection using graph representation. In Proceedings of the 29th ACM international conference on information & knowledge management, pages 1165–1174.
- Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35:27730–27744.
- Understanding factuality in abstractive summarization with frank: A benchmark for factuality metrics. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4812–4829.
- Large language models and knowledge graphs: Opportunities and challenges. TGDK, 1(1):2:1–2:38.
- On the risk of misinformation pollution with large language models. In Findings of the Association for Computational Linguistics: EMNLP 2023, Singapore, December 6-10, 2023, pages 1389–1403. Association for Computational Linguistics.
- The surprising performance of simple baselines for misinformation detection. In Proceedings of the Web Conference 2021, pages 3432–3441.
- Towards reliable misinformation mitigation: Generalization, uncertainty, and GPT-4. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023, pages 6399–6429. Association for Computational Linguistics.
- Fake news detection: A survey of graph neural network methods. Applied Soft Computing, page 110235.
- Semeval-2023 task 3: Detecting the category, the framing, and the persuasion techniques in online news in a multi-lingual setup. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 2343–2361.
- Probing llms for hate speech detection: strengths and vulnerabilities. arXiv preprint arXiv:2310.12860.
- Learning to retrieve prompts for in-context learning. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2655–2671.
- Countering misinformation via emotional response generation. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 11476–11492.
- Fake news detection using stance extracted multimodal fusion-based hybrid neural network. IEEE Transactions on Computational Social Systems.
- On second thought, let’s not think step by step! bias and toxicity in zero-shot reasoning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pages 4454–4470. Association for Computational Linguistics.
- Zoom out and observe: News environment perception for fake news detection. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4543–4556.
- Replug: Retrieval-augmented black-box language models. arXiv preprint arXiv:2301.12652.
- Iftekharul Islam Shovon and Seokjoo Shin. 2023. The performance of graph neural network in detecting fake news from social media feeds. In 2023 International Conference on Information Networking (ICOIN), pages 560–564. IEEE.
- defend: Explainable fake news detection. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, pages 395–405.
- The role of user profiles for fake news detection. In Proceedings of the 2019 IEEE/ACM international conference on advances in social networks analysis and mining, pages 436–439.
- Value kaleidoscope: Engaging ai with pluralistic human values, rights, and duties. arXiv preprint arXiv:2309.00779.
- Harald Stiff and Fredrik Johansson. 2022. Detecting computer-generated disinformation. International Journal of Data Science and Analytics, 13(4):363–383.
- Zlpr: A novel loss for multi-label classification. arXiv preprint arXiv:2208.02955.
- Adapting fake news detection to the era of large language models. arXiv preprint arXiv:2311.04917.
- Fake news detectors are biased against texts generated by large language models. arXiv preprint arXiv:2309.08674.
- From chaos to clarity: Claim normalization to empower fact-checking. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 6594–6609.
- Not all fake news is written: A dataset and analysis of misleading video headlines. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023, pages 16241–16258. Association for Computational Linguistics.
- Just ask for calibration: Strategies for eliciting calibrated confidence scores from language models fine-tuned with human feedback. arXiv preprint arXiv:2305.14975.
- Duck: Rumour detection on social media by modelling user and comment propagation networks. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4939–4949.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
- Jiaying Wu and Bryan Hooi. 2023a. Decor: Degree-corrected social graph refinement for fake news detection. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 2582–2593.
- Jiaying Wu and Bryan Hooi. 2023b. Fake news in sheep’s clothing: Robust fake news detection against llm-empowered style attacks. arXiv preprint arXiv:2310.10830.
- Cross-document misinformation detection based on event graph reasoning. In 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022, pages 543–558. Association for Computational Linguistics (ACL).
- Leveraging contrastive learning and knowledge distillation for incomplete modality rumor detection. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 13492–13503.
- How powerful are graph neural networks? In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net.
- The earth is flat because…: Investigating llms’ belief towards misinformation via persuasive conversation. arXiv preprint arXiv:2312.09085.
- Evidence-aware fake news detection with graph neural networks. In Proceedings of the ACM Web Conference 2022, pages 2501–2510.
- Rumor detection on social media with crowd intelligence and chatgpt-assisted networks. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 5705–5717.
- WSDMS: debunk fake news via weakly supervised detection of misinforming sentences with contextualized social wisdom. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, EMNLP 2023, Singapore, December 6-10, 2023, pages 1525–1538. Association for Computational Linguistics.
- Rumor detection on social media with graph structured adversarial learning. In Proceedings of the twenty-ninth international conference on international joint conferences on artificial intelligence, pages 1417–1423.
- Metaadapt: Domain adaptive few-shot misinformation detection via meta learning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), ACL 2023, Toronto, Canada, July 9-14, 2023, pages 5223–5239. Association for Computational Linguistics.
- Defending against neural fake news. Advances in neural information processing systems, 32.
- Fengzhu Zeng and Wei Gao. 2022. Early rumor detection using neural hawkes process with a new benchmark dataset. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4105–4117.
- Bayesian graph local extrema convolution with long-tail strategy for misinformation detection. ACM Transactions on Knowledge Discovery from Data.
- Mining dual emotion for fake news detection. In Proceedings of the web conference 2021, pages 3465–3476.
- Thrust: Adaptively propels large language models with external knowledge. arXiv preprint arXiv:2307.10442.
- Herun Wan (15 papers)
- Shangbin Feng (53 papers)
- Zhaoxuan Tan (35 papers)
- Heng Wang (136 papers)
- Yulia Tsvetkov (142 papers)
- Minnan Luo (61 papers)