Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks (2310.10830v2)
Abstract: It is commonly perceived that fake news and real news exhibit distinct writing styles, such as the use of sensationalist versus objective language. However, we emphasize that style-related features can also be exploited for style-based attacks. Notably, the advent of powerful LLMs has empowered malicious actors to mimic the style of trustworthy news sources, doing so swiftly, cost-effectively, and at scale. Our analysis reveals that LLM-camouflaged fake news content significantly undermines the effectiveness of state-of-the-art text-based detectors (up to 38% decrease in F1 Score), implying a severe vulnerability to stylistic variations. To address this, we introduce SheepDog, a style-robust fake news detector that prioritizes content over style in determining news veracity. SheepDog achieves this resilience through (1) LLM-empowered news reframings that inject style diversity into the training process by customizing articles to match different styles; (2) a style-agnostic training scheme that ensures consistent veracity predictions across style-diverse reframings; and (3) content-focused veracity attributions that distill content-centric guidelines from LLMs for debunking fake news, offering supplementary cues and potential intepretability that assist veracity prediction. Extensive experiments on three real-world benchmarks demonstrate SheepDog's style robustness and adaptability to various backbones.
- Sentiment Aware Fake News Detection on Online Social Networks. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2507–2511.
- Defining and Measuring News Media Quality: Comparing the Content Perspective and the Audience Perspective. The International Journal of Press/Politics 27 (03 2021).
- Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems, Vol. 33. 1877–1901.
- Canyu Chen and Kai Shu. 2023. Can LLM-Generated Misinformation Be Detected? arXiv:2309.13788
- DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 492–502.
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL. 4171–4186.
- KAN: Knowledge-aware Attention Network for Fake News Detection. Proceedings of the AAAI Conference on Artificial Intelligence 35, 1 (2021), 81–89.
- What label should be applied to content produced by generative AI? https://doi.org/10.31234/osf.io/v4mfz
- PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-Based Classification Models. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 575–584.
- DeBERTa: Decoding-enhanced BERT with Disentangled Attention. In International Conference on Learning Representations.
- Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning. arXiv:2305.19523 [cs.LG]
- Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs. arXiv:2304.08968 [cs.CL]
- Nolan Higdon. 2020. What is Fake News? A Foundational Question for Developing Effective Critical News Literacy Education. Democratic Communiqué 279 (03 2020). Issue 1.
- Robust Fake News Detection Over Time and Attack. ACM Trans. Intell. Syst. Technol. 11, 1, Article 7 (2019), 23 pages.
- Learn over Past, Evolve for Future: Forecasting Temporal Trends for Fake News Detection. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track). 116–125.
- Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2225–2240.
- Jie Huang and Kevin Chen-Chuan Chang. 2023. Towards Reasoning in Large Language Models: A Survey. In Findings of the Association for Computational Linguistics: ACL 2023. 1049–1065.
- Faking Fake News for Real Fake News Detection: Propaganda-Loaded Training Data Generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 14571–14589.
- How Vulnerable Are Automatic Fake News Detection Methods to Adversarial Attacks? arXiv:2107.07970 [cs.CL]
- MALCOM: Generating Malicious Comments to Attack Neural Fake News Detection Models. In 2020 IEEE International Conference on Data Mining (ICDM). 282–291.
- RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv:1907.11692 [cs.CL]
- Yi-Ju Lu and Cheng-Te Li. 2020. GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 505–514.
- Interpretable and Effective Reinforcement Learning for Attacking against Graph-based Rumor Detection. In 2023 International Joint Conference on Neural Networks (IJCNN). 1–9.
- Sachit Menon and Carl Vondrick. 2023. Visual Classification via Description from Large Language Models. In The Eleventh International Conference on Learning Representations.
- Contrasting Linguistic Patterns in Human and LLM-Generated Text. arXiv:2308.09067 [cs.CL]
- MDFEND: Multi-domain fake news detection. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 3343–3347.
- Improving Fake News Detection of Influential Domain via Domain- and Instance-Level Transfer. In Proceedings of the 29th International Conference on Computational Linguistics. 2834–2848.
- FANG: Leveraging Social Context for Fake News Detection Using Graph Representation. In Proceedings of the 29th ACM International Conference on Information Knowledge Management. 1165–1174.
- OpenAI. 2022. ChatGPT: Optimizing language models for dialogue.
- OpenAI. 2023. GPT-4 Technical Report. arXiv:2303.08774 [cs.CL] https://arxiv.org/pdf/2303.08774.pdf
- Training language models to follow instructions with human feedback. In Advances in Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 27730–27744.
- Fact-Checking Complex Claims with Program-Guided Reasoning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 6981–7004.
- The Surprising Performance of Simple Baselines for Misinformation Detection. In Proceedings of the Web Conference 2021. 3432–3441.
- A Stylometric Inquiry into Hyperpartisan and Fake News. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 231–240.
- Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2931–2937.
- CSI: A Hybrid Deep Model for Fake News Detection. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 797–806.
- Timo Schick and Hinrich Schütze. 2021. Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 255–269.
- Zoom Out and Observe: News Environment Perception for Fake News Detection. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 4543–4556.
- DEFEND: Explainable Fake News Detection. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 395–405.
- FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media. Big Data 8, 3 (2020), 171–188.
- Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv:2307.09288 [cs.CL]
- Do Sentence Interactions Matter? Leveraging Sentence Level Representations for Fake News Classification. In Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-13). 134–139.
- Attacking Fake News Detectors via Manipulating News Social Engagement. In Proceedings of the ACM Web Conference 2023. 3978–3986.
- Emergent Abilities of Large Language Models. Transactions on Machine Learning Research (2022).
- Frank Wilcoxon. 1945. Individual Comparisons by Ranking Methods. Biometrics Bulletin 1 (1945), 80–83.
- Transformers: State-of-the-Art Natural Language Processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. 38–45.
- Jiaying Wu and Bryan Hooi. 2023. DECOR: Degree-Corrected Social Graph Refinement for Fake News Detection. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2582–2593.
- Unsupervised Data Augmentation for Consistency Training. In Advances in Neural Information Processing Systems, Vol. 33. 6256–6268.
- Defending Against Neural Fake News. In Advances in Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2019/file/3e9f0fc9b2f89e043bc6233994dfcf76-Paper.pdf
- Mining Dual Emotion for Fake News Detection. In Proceedings of the Web Conference 2021. 3465–3476.
- Haoyi Zheng and Huichun Zhan. 2023. ChatGPT in Scientific Writing: A Cautionary Tale. The American Journal of Medicine 136, 8 (2023), 725–726.e6.
- SAFE: Similarity-Aware Multi-modal Fake News Detection. In Advances in Knowledge Discovery and Data Mining. 354–367.
- Fake News Detection via NLP is Vulnerable to Adversarial Attacks. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence.
- Generalizing to the Future: Mitigating Entity Bias in Fake News Detection. In Proceedings of the 45nd International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery.
- Memory-Guided Multi-View Multi-Domain Fake News Detection. IEEE Transactions on Knowledge and Data Engineering (2022).
- Jiaying Wu (18 papers)
- Bryan Hooi (159 papers)
- Jiafeng Guo (161 papers)