Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficacy of Utilizing Large Language Models to Detect Public Threat Posted Online (2401.02974v1)

Published 29 Dec 2023 in cs.CL, cs.AI, and cs.IR

Abstract: This paper examines the efficacy of utilizing LLMs to detect public threats posted online. Amid rising concerns over the spread of threatening rhetoric and advance notices of violence, automated content analysis techniques may aid in early identification and moderation. Custom data collection tools were developed to amass post titles from a popular Korean online community, comprising 500 non-threat examples and 20 threats. Various LLMs (GPT-3.5, GPT-4, PaLM) were prompted to classify individual posts as either "threat" or "safe." Statistical analysis found all models demonstrated strong accuracy, passing chi-square goodness of fit tests for both threat and non-threat identification. GPT-4 performed best overall with 97.9% non-threat and 100% threat accuracy. Affordability analysis also showed PaLM API pricing as highly cost-efficient. The findings indicate LLMs can effectively augment human content moderation at scale to help mitigate emerging online risks. However, biases, transparency, and ethical oversight remain vital considerations before real-world implementation.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (36)
  1. Knowledge collaboration in online communities. Organization Science, 22(5):1224–1239, Feb 2011.
  2. Kimberly M. Christopherson. The positive and negative implications of anonymity in internet social interactions: “on the internet, nobody knows you’re a dog”. Computers in Human Behavior, 23(6):3038–3056, Nov 2007.
  3. Internet anonymity practices in computer crime. Information Management & Computer Security, 11(5):209–215, Dec 2003.
  4. Ray Surette. Measuring copycat crime. Crime, Media, Culture: An International Journal, 12(1):37–64, Sep 2015.
  5. SUNG-EUN LEE. Police release photo, identity of sillim station stabber, Jul 2023.
  6. 오늘의 베스트. 오늘의 베스트, Dec 2023.
  7. Hae-rin Lee. Fears grow over copycat crimes following series of stabbing rampages, Aug 2023.
  8. The human cost of online content moderation, Mar 2018.
  9. Richard MacManus. Why developers are flocking to llama, meta’s open source llm, May 2023.
  10. Aminu Abdullahi. What are large language models?, Sep 2023.
  11. Emily A. Vogels. The state of online harassment. Pew Research Center, Jan 2021.
  12. Lee Jung-joo. 2 in 5 murder threat suspects were teens: police, Aug 2023.
  13. 2023년 대한민국 다발적 흉기난동 사태/사건 목록 - 나무위키.
  14. Da-hyun Jung. “terrorless” website grows popular by tracking random attacks, Aug 2023.
  15. Thomas Dohmke. Github copilot x: The ai-powered developer experience, Mar 2023.
  16. Yasmin Altmann. What is a chatbot? | chatbots simply explained, 2023.
  17. Popular large language model chatbots’ accuracy, comprehensiveness, and self-awareness in answering ocular symptom queries. iScience, 26(11):108163, Oct 2023.
  18. OpenAI. Pricing.
  19. Google Cloud. Pricing for generative ai on vertex ai.
  20. Eray Eliaçık. Which free llms are best suited for you? here are the differences. Dataconomy, Nov 2023.
  21. Using gpt-4 for content moderation.
  22. Watch Your Language: Large Language Models and Content Moderation. Sep 2023.
  23. Unitary. Ai vs humans in content moderation: Scale, risk, cost, bias, and context | unitary blog, Feb 2023.
  24. Heather Merrick. Can ai replace one of the most traumatic jobs on the internet?, Oct 2023.
  25. Janselle Miguel. Ai vs. human content moderation, Aug 2022.
  26. Christina Pazzanese. Ethical concerns mount as ai takes bigger decision-making role, Oct 2020.
  27. George Krasadakis. The ethical concerns associated with the general adoption of ai, Nov 2023.
  28. AIContentfy team. The role of ai in content moderation and censorship, Mar 2023.
  29. Threat Post Detection Tool.
  30. DC Inside Website Scraper.
  31. The chi-square test: Often used and more often misinterpreted. American Journal of Evaluation, 33(3):448–458, Nov 2011.
  32. Using chatgpt standard prompt engineering techniques in lesson preparation: Role, instructions and seed-word prompts. In 2023 58th International Scientific Conference on Information, Communication and Energy Systems and Technologies (ICEST), pages 47–50, 2023.
  33. Robots welcome? ethical and legal considerations for web crawling and scraping. Washington Journal of Law, Technology & Arts, 13(3), 2018.
  34. DC Inside. 디시인사이드 이용약관, Feb 2013.
  35. How to use large language models in ophthalmology: from prompt engineering to protecting confidentiality. Eye, Oct 2023.
  36. How good is your tokenizer? on the monolingual performance of multilingual language models. Association for Computational Linguistics, 1, Jan 2021.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com