Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fake News in Sheep's Clothing: Robust Fake News Detection Against LLM-Empowered Style Attacks (2310.10830v2)

Published 16 Oct 2023 in cs.CL

Abstract: It is commonly perceived that fake news and real news exhibit distinct writing styles, such as the use of sensationalist versus objective language. However, we emphasize that style-related features can also be exploited for style-based attacks. Notably, the advent of powerful LLMs has empowered malicious actors to mimic the style of trustworthy news sources, doing so swiftly, cost-effectively, and at scale. Our analysis reveals that LLM-camouflaged fake news content significantly undermines the effectiveness of state-of-the-art text-based detectors (up to 38% decrease in F1 Score), implying a severe vulnerability to stylistic variations. To address this, we introduce SheepDog, a style-robust fake news detector that prioritizes content over style in determining news veracity. SheepDog achieves this resilience through (1) LLM-empowered news reframings that inject style diversity into the training process by customizing articles to match different styles; (2) a style-agnostic training scheme that ensures consistent veracity predictions across style-diverse reframings; and (3) content-focused veracity attributions that distill content-centric guidelines from LLMs for debunking fake news, offering supplementary cues and potential intepretability that assist veracity prediction. Extensive experiments on three real-world benchmarks demonstrate SheepDog's style robustness and adaptability to various backbones.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (55)
  1. Sentiment Aware Fake News Detection on Online Social Networks. In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2507–2511.
  2. Defining and Measuring News Media Quality: Comparing the Content Perspective and the Audience Perspective. The International Journal of Press/Politics 27 (03 2021).
  3. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems, Vol. 33. 1877–1901.
  4. Canyu Chen and Kai Shu. 2023. Can LLM-Generated Misinformation Be Detected? arXiv:2309.13788
  5. DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 492–502.
  6. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In NAACL. 4171–4186.
  7. KAN: Knowledge-aware Attention Network for Fake News Detection. Proceedings of the AAAI Conference on Artificial Intelligence 35, 1 (2021), 81–89.
  8. What label should be applied to content produced by generative AI? https://doi.org/10.31234/osf.io/v4mfz
  9. PETGEN: Personalized Text Generation Attack on Deep Sequence Embedding-Based Classification Models. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 575–584.
  10. DeBERTa: Decoding-enhanced BERT with Disentangled Attention. In International Conference on Learning Representations.
  11. Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning. arXiv:2305.19523 [cs.LG]
  12. Stochastic Parrots Looking for Stochastic Parrots: LLMs are Easy to Fine-Tune and Hard to Detect with other LLMs. arXiv:2304.08968 [cs.CL]
  13. Nolan Higdon. 2020. What is Fake News? A Foundational Question for Developing Effective Critical News Literacy Education. Democratic Communiqué 279 (03 2020). Issue 1.
  14. Robust Fake News Detection Over Time and Attack. ACM Trans. Intell. Syst. Technol. 11, 1, Article 7 (2019), 23 pages.
  15. Learn over Past, Evolve for Future: Forecasting Temporal Trends for Fake News Detection. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track). 116–125.
  16. Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2225–2240.
  17. Jie Huang and Kevin Chen-Chuan Chang. 2023. Towards Reasoning in Large Language Models: A Survey. In Findings of the Association for Computational Linguistics: ACL 2023. 1049–1065.
  18. Faking Fake News for Real Fake News Detection: Propaganda-Loaded Training Data Generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 14571–14589.
  19. How Vulnerable Are Automatic Fake News Detection Methods to Adversarial Attacks? arXiv:2107.07970 [cs.CL]
  20. MALCOM: Generating Malicious Comments to Attack Neural Fake News Detection Models. In 2020 IEEE International Conference on Data Mining (ICDM). 282–291.
  21. RoBERTa: A Robustly Optimized BERT Pretraining Approach. arXiv:1907.11692 [cs.CL]
  22. Yi-Ju Lu and Cheng-Te Li. 2020. GCAN: Graph-aware Co-Attention Networks for Explainable Fake News Detection on Social Media. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 505–514.
  23. Interpretable and Effective Reinforcement Learning for Attacking against Graph-based Rumor Detection. In 2023 International Joint Conference on Neural Networks (IJCNN). 1–9.
  24. Sachit Menon and Carl Vondrick. 2023. Visual Classification via Description from Large Language Models. In The Eleventh International Conference on Learning Representations.
  25. Contrasting Linguistic Patterns in Human and LLM-Generated Text. arXiv:2308.09067 [cs.CL]
  26. MDFEND: Multi-domain fake news detection. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 3343–3347.
  27. Improving Fake News Detection of Influential Domain via Domain- and Instance-Level Transfer. In Proceedings of the 29th International Conference on Computational Linguistics. 2834–2848.
  28. FANG: Leveraging Social Context for Fake News Detection Using Graph Representation. In Proceedings of the 29th ACM International Conference on Information Knowledge Management. 1165–1174.
  29. OpenAI. 2022. ChatGPT: Optimizing language models for dialogue.
  30. OpenAI. 2023. GPT-4 Technical Report. arXiv:2303.08774 [cs.CL] https://arxiv.org/pdf/2303.08774.pdf
  31. Training language models to follow instructions with human feedback. In Advances in Neural Information Processing Systems, Vol. 35. Curran Associates, Inc., 27730–27744.
  32. Fact-Checking Complex Claims with Program-Guided Reasoning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 6981–7004.
  33. The Surprising Performance of Simple Baselines for Misinformation Detection. In Proceedings of the Web Conference 2021. 3432–3441.
  34. A Stylometric Inquiry into Hyperpartisan and Fake News. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 231–240.
  35. Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2931–2937.
  36. CSI: A Hybrid Deep Model for Fake News Detection. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. 797–806.
  37. Timo Schick and Hinrich Schütze. 2021. Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. 255–269.
  38. Zoom Out and Observe: News Environment Perception for Fake News Detection. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 4543–4556.
  39. DEFEND: Explainable Fake News Detection. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 395–405.
  40. FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media. Big Data 8, 3 (2020), 171–188.
  41. Llama 2: Open Foundation and Fine-Tuned Chat Models. arXiv:2307.09288 [cs.CL]
  42. Do Sentence Interactions Matter? Leveraging Sentence Level Representations for Fake News Classification. In Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-13). 134–139.
  43. Attacking Fake News Detectors via Manipulating News Social Engagement. In Proceedings of the ACM Web Conference 2023. 3978–3986.
  44. Emergent Abilities of Large Language Models. Transactions on Machine Learning Research (2022).
  45. Frank Wilcoxon. 1945. Individual Comparisons by Ranking Methods. Biometrics Bulletin 1 (1945), 80–83.
  46. Transformers: State-of-the-Art Natural Language Processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. 38–45.
  47. Jiaying Wu and Bryan Hooi. 2023. DECOR: Degree-Corrected Social Graph Refinement for Fake News Detection. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2582–2593.
  48. Unsupervised Data Augmentation for Consistency Training. In Advances in Neural Information Processing Systems, Vol. 33. 6256–6268.
  49. Defending Against Neural Fake News. In Advances in Neural Information Processing Systems, H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett (Eds.), Vol. 32. Curran Associates, Inc. https://proceedings.neurips.cc/paper_files/paper/2019/file/3e9f0fc9b2f89e043bc6233994dfcf76-Paper.pdf
  50. Mining Dual Emotion for Fake News Detection. In Proceedings of the Web Conference 2021. 3465–3476.
  51. Haoyi Zheng and Huichun Zhan. 2023. ChatGPT in Scientific Writing: A Cautionary Tale. The American Journal of Medicine 136, 8 (2023), 725–726.e6.
  52. SAFE: Similarity-Aware Multi-modal Fake News Detection. In Advances in Knowledge Discovery and Data Mining. 354–367.
  53. Fake News Detection via NLP is Vulnerable to Adversarial Attacks. In Proceedings of the 11th International Conference on Agents and Artificial Intelligence.
  54. Generalizing to the Future: Mitigating Entity Bias in Fake News Detection. In Proceedings of the 45nd International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery.
  55. Memory-Guided Multi-View Multi-Domain Fake News Detection. IEEE Transactions on Knowledge and Data Engineering (2022).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Jiaying Wu (18 papers)
  2. Bryan Hooi (159 papers)
  3. Jiafeng Guo (161 papers)
Citations (33)

Summary

We haven't generated a summary for this paper yet.