Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Counterspeakers' Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate (2403.00179v1)

Published 29 Feb 2024 in cs.HC

Abstract: Counterspeech, i.e., direct responses against hate speech, has become an important tool to address the increasing amount of hate online while avoiding censorship. Although AI has been proposed to help scale up counterspeech efforts, this raises questions of how exactly AI could assist in this process, since counterspeech is a deeply empathetic and agentic process for those involved. In this work, we aim to answer this question, by conducting in-depth interviews with 10 extensively experienced counterspeakers and a large scale public survey with 342 everyday social media users. In participant responses, we identified four main types of barriers and AI needs related to resources, training, impact, and personal harms. However, our results also revealed overarching concerns of authenticity, agency, and functionality in using AI tools for counterspeech. To conclude, we discuss considerations for designing AI assistants that lower counterspeaking barriers without jeopardizing its meaning and purpose.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (151)
  1. Carolina Are. 2022. The Shadowban Cycle: An Autoethnography of Pole Dancing, Nudity and Censorship on Instagram. Feminist Media Studies 22, 8 (2022), 2002–2019. https://doi.org/10.1080/14680777.2021.1928259
  2. Mana Ashida and Mamoru Komachi. 2022. Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions. In Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH). Association for Computational Linguistics, Seattle, Washington (Hybrid), 11–23. https://doi.org/10.18653/v1/2022.woah-1.2
  3. Michael Baggs. 2021. Online hate speech rose 20% during pandemic: ’We’ve normalised it’. BBC (Nov. 2021).
  4. Fabienne Baider. 2023. Accountability Issues, Online Covert Hate Speech, and the Efficacy of Counter‐Speech. Politics and Governance 11, 2 (2023). https://www.cogitatiopress.com/politicsandgovernance/article/view/6465
  5. Considerations for successful counterspeech. Dangerous speech project (2016).
  6. Classification and Its Consequences for Online Harassment: Design Insights from HeartMob. Proceedings of the ACM on Human-Computer Interaction 1, CSCW (Dec. 2017), 24:1–24:19. https://doi.org/10.1145/3134659
  7. Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli (Eds.). Association for Computational Linguistics, Online, 1004–1015. https://doi.org/10.18653/v1/2021.acl-long.81
  8. Anti-Solutionist Strategies: Seriously Silly Design Fiction. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (San Jose California USA, 2016-05-07). ACM, 4968–4978. https://doi.org/10.1145/2858036.2858482
  9. Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. arXiv:1607.06520 [cs.CL]
  10. The Community Builder (CoBi): Helping Students to Develop Better Small Group Collaborative Learning Skills. In Companion Publication of the 2023 Conference on Computer Supported Cooperative Work and Social Computing (Minneapolis, MN, USA) (CSCW ’23 Companion). Association for Computing Machinery, New York, NY, USA, 376–380. https://doi.org/10.1145/3584931.3607498
  11. To Trust or to Think: Cognitive Forcing Functions Can Reduce Overreliance on AI in AI-Assisted Decision-Making. Proc. ACM Hum.-Comput. Interact. 5, CSCW1, Article 188 (apr 2021), 21 pages. https://doi.org/10.1145/3449287
  12. Catherine Buerger. 2021a. #iamhere: Collective Counterspeech and the Quest to Improve Online Discourse. Social Media + Society 7, 4 (2021), 20563051211063843. https://doi.org/10.1177/20563051211063843
  13. Catherine Buerger. 2021b. Speech as a Driver of Intergroup Violence: A Literature Review. https://doi.org/10.2139/ssrn.4066876
  14. Catherine Buerger. 2022. Why They Do It: Counterspeech Theories of Change. https://doi.org/10.2139/ssrn.4245211
  15. Joy Buolamwini and Timnit Gebru. 2018. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. In Proceedings of the 1st Conference on Fairness, Accountability and Transparency (Proceedings of Machine Learning Research, Vol. 81), Sorelle A. Friedler and Christo Wilson (Eds.). PMLR, 77–91. https://proceedings.mlr.press/v81/buolamwini18a.html
  16. The Stimulators of Social Media Fatigue Among Students: Role of Moral Disengagement. Journal of Educational Computing Research 57, 5 (2019), 1083–1107. https://doi.org/10.1177/0735633118781907 arXiv:https://doi.org/10.1177/0735633118781907
  17. Internet, Social Media and Online Hate Speech. Systematic Review. Aggression and Violent Behavior 58 (2021), 101608. https://doi.org/10.1016/j.avb.2021.101608
  18. Future teachers confronting extremism and hate speech. Humanities and Social Sciences Communications 9, 1 (15 Jun 2022), 201. https://doi.org/10.1057/s41599-022-01222-4
  19. Pew Research Center. 2023. Public Awareness of Artificial Intelligence in Everyday Activities.
  20. Counterspeech. Philosophy Compass 18, 1 (2023), e12890. https://doi.org/10.1111/phc3.12890
  21. K. Charmaz. 2006. Constructing Grounded Theory: A Practical Guide Through Qualitative Analysis. SAGE Publications. https://books.google.com/books?id=v1qP1KbXz1AC
  22. Kyla Chasalow and Karen Levy. 2021. Representativeness in Statistics, Politics, and Machine Learning. arXiv:2101.03827 [cs.CY]
  23. Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistics, Toronto, Canada, 1504–1532. https://doi.org/10.18653/v1/2023.acl-long.84
  24. Understanding Counterspeech for Online Harm Mitigation. arXiv:2307.04761 [cs.CL]
  25. CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 2819–2829. https://doi.org/10.18653/v1/P19-1271
  26. Empowering NGOs in countering online hate messages. Online Social Networks and Media 24 (2021), 100150. https://doi.org/10.1016/j.osnem.2021.100150
  27. Danielle Keats Citron and Helen Norton. 2011. Intermediaries and hate speech: Fostering digital citizenship for our information age. BUL Rev. 91 (2011), 1435.
  28. All That’s ‘Human’ Is Not Gold: Evaluating Human Evaluation of Generated Text. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, Online, 7282–7296. https://doi.org/10.18653/v1/2021.acl-long.565
  29. Jennifer Cobbe. 2021. Algorithmic Censorship by Social Platforms: Power and Resistance. Philosophy & Technology 34, 4 (dec 2021), 739–766. https://doi.org/10.1007/s13347-020-00429-0
  30. Design Frictions for Mindful Interactions: The Case for Microboundaries. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems (San Jose, California, USA) (CHI EA ’16). Association for Computing Machinery, New York, NY, USA, 1389–1397. https://doi.org/10.1145/2851581.2892410
  31. KATE CRAWFORD. 2021. The Atlas of AI: Power, Politics, and the Planetary Costs of Artificial Intelligence. Yale University Press. http://www.jstor.org/stable/j.ctv1ghv45t
  32. Stefano Cresci. 2020. A Decade of Social Bot Detection. Commun. ACM 63, 10 (sep 2020), 72–83. https://doi.org/10.1145/3409116
  33. John W Creswell and Vicki L Plano Clark. 2017. Designing and conducting mixed methods research. Sage publications.
  34. Intervening against online hate speech: A case for automated Counterspeech. IEAI Research Brief (2022), 1–8.
  35. John Danaher. 2019. The rise of the robots and the crisis of moral patiency. AI & SOCIETY 34, 1 (01 Mar 2019), 129–136. https://doi.org/10.1007/s00146-017-0773-9
  36. Hate Speech in Online Social Media. SIGWEB Newsl. 2020, Autumn, Article 4 (nov 2020), 8 pages. https://doi.org/10.1145/3427478.3427482
  37. Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Marie-Francine Moens, Xuanjing Huang, Lucia Specia, and Scott Wen-tau Yih (Eds.). Association for Computational Linguistics, Online and Punta Cana, Dominican Republic, 1286–1305. https://doi.org/10.18653/v1/2021.emnlp-main.98
  38. Oren Etzioni. 2017. Opinion — How to Regulate Artificial Intelligence. The New York Times (09 2017). https://www.nytimes.com/2017/09/01/opinion/artificial-intelligence-regulations-rules.html
  39. Mirko Farina and Andrea Lavazza. 2023. ChatGPT in society: emerging issues. Frontiers in Artificial Intelligence 6 (2023). https://doi.org/10.3389/frai.2023.1130913
  40. The Rise of Social Bots. Commun. ACM 59, 7 (jun 2016), 96–104. https://doi.org/10.1145/2818717
  41. Paula Fortuna and Sérgio Nunes. 2018. A Survey on Automatic Detection of Hate Speech in Text. Comput. Surveys 51, 4 (July 2018), 85:1–85:30. https://doi.org/10.1145/3232676
  42. Collective Civic Moderation for Deliberation? Exploring the Links between Citizens’ Organized Engagement in Comment Sections and the Deliberative Quality of Online Discussions. Political Communication 38, 5 (2021), 624–646. https://doi.org/10.1080/10584609.2020.1830322
  43. Countering hate on social media: Large scale classification of hate and counter speech. In Proceedings of the Fourth Workshop on Online Abuse and Harms. Association for Computational Linguistics, Online, 102–112. https://doi.org/10.18653/v1/2020.alw-1.13
  44. Impact and dynamics of hate and counter speech online. EPJ Data Science 11, 1 (24 Jan 2022), 3. https://doi.org/10.1140/epjds/s13688-021-00314-6
  45. Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey. arXiv:2310.15264 [cs.CL]
  46. Tarleton Gillespie. 2020. Content Moderation, AI, and the Question of Scale. Big Data & Society 7, 2 (July 2020), 2053951720943234. https://doi.org/10.1177/2053951720943234
  47. B.G. Glaser and A.L. Strauss. 1967. The Discovery of Grounded Theory: Strategies for Qualitative Research. Aldine Transaction. https://books.google.com/books?id=oUxEAQAAIAAJ
  48. Ella Glikson and Omri Asscher. 2023. AI-mediated apology in a multilingual work context: Implications for perceived authenticity and willingness to forgive. Computers in Human Behavior 140 (2023), 107592. https://doi.org/10.1016/j.chb.2022.107592
  49. Jury Learning: Integrating Dissenting Voices into Machine Learning Models. In CHI Conference on Human Factors in Computing Systems (CHI ’22). ACM. https://doi.org/10.1145/3491102.3502004
  50. ”You Have to Prove the Threat is Real”: Understanding the Needs of Female Journalists and Activists to Document and Report Online Harassment. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 242, 17 pages. https://doi.org/10.1145/3491102.3517517
  51. Investigating African-American Vernacular English in Transformer-Based Text Generation. arXiv:2010.02510 [cs.CL]
  52. Seva Gunitsky. 2015. Corrupting the Cyber-Commons: Social Media as a Tool of Autocratic Stability. Perspectives on Politics 13, 1 (2015), 42–54. https://doi.org/10.1017/S1537592714003120
  53. Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Anna Rogers, Jordan Boyd-Graber, and Naoaki Okazaki (Eds.). Association for Computational Linguistics, Toronto, Canada, 5792–5809. https://doi.org/10.18653/v1/2023.acl-long.318
  54. How Transgender People and Communities Were Involved in Trans Technology Design Processes. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 294, 16 pages. https://doi.org/10.1145/3544548.3580972
  55. WokeGPT: Improving Counterspeech Generation Against Online Hate Speech by Intelligently Augmenting Datasets Using a Novel Metric. In 2023 International Joint Conference on Neural Networks (IJCNN) (2023-06). 1–10. https://doi.org/10.1109/IJCNN54540.2023.10191114
  56. Is Civility Contagious? Examining the Impact of Modeling in Online Political Discussions. Social Media + Society 4, 3 (2018), 2056305118793404. https://doi.org/10.1177/2056305118793404
  57. Xiaochuang Han and Yulia Tsvetkov. 2020. Fortifying Toxic Speech Detectors Against Veiled Toxicity. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Online, 7732–7739. https://doi.org/10.18653/v1/2020.emnlp-main.622
  58. AI-Mediated Communication: Definition, Research Agenda, and Ethical Considerations. Journal of Computer-Mediated Communication 25, 1 (01 2020), 89–100. https://doi.org/10.1093/jcmc/zmz022 arXiv:https://academic.oup.com/jcmc/article-pdf/25/1/89/32961176/zmz022.pdf
  59. Empathy-based counterspeech can reduce racist hate speech in a social media field experiment. Proceedings of the National Academy of Sciences 118, 50 (2021), e2116310118. https://doi.org/10.1073/pnas.2116310118 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.2116310118
  60. Towards a critical race methodology in algorithmic fairness. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (Barcelona, Spain) (FAT* ’20). Association for Computing Machinery, New York, NY, USA, 501–512. https://doi.org/10.1145/3351095.3372826
  61. Working at the Intersection of Race, Disability and Accessibility. In Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility (New York, NY, USA) (ASSETS ’23). Association for Computing Machinery, New York, NY, USA, Article 26, 18 pages. https://doi.org/10.1145/3597638.3608389
  62. ToxiGen: Controlling Language Models to Generate Implied and Adversarial Toxicity. In ACL. https://arxiv.org/abs/2203.09509
  63. Sabit Hassan and Malihe Alikhani. 2023. DisCGen: A Framework for Discourse-Informed Counterspeech Generation. arXiv:2311.18147 [cs.CL]
  64. Jess Hohenstein and Malte Jung. 2020. AI as a moral crumple zone: The effects of AI-mediated communication on attribution and trust. Computers in Human Behavior 106 (2020), 106190. https://doi.org/10.1016/j.chb.2019.106190
  65. Artificial intelligence in communication impacts language and social relationships. Scientific Reports 13, 1 (April 2023), 5487.
  66. Jonathan Horowitz. 2017. Who Is This “We” You Speak of? Grounding Activist Identity in Social Psychology. Socius 3 (2017), 2378023117717819. https://doi.org/10.1177/2378023117717819 arXiv:https://doi.org/10.1177/2378023117717819 PMID: 30221196.
  67. Jeffrey W Howard. 2021. Terror, Hate and the Demands of Counter-Speech. Br. J. Polit. Sci. 51, 3 (July 2021), 924–939.
  68. Women’s Perspectives on Harm and Justice after Online Harassment. Proceedings of the ACM on Human-Computer Interaction 6, CSCW2 (Nov. 2022), 355:1–355:23. https://doi.org/10.1145/3555775
  69. Michael Inzlicht and Aidan V. Campbell. 2022. Effort feels meaningful. Trends in Cognitive Sciences 26, 12 (2022), 1035–1037. https://doi.org/10.1016/j.tics.2022.09.016
  70. Co-Writing with Opinionated Language Models Affects Users’ Views. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3544548.3581196
  71. AI-Mediated Communication: How the Perception That Profile Text Was Written by AI Affects Trustworthiness. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3290605.3300469
  72. Systematic review of empirical studies on cyberbullying in adults: What we know and what we should investigate. Aggression and Violent Behavior 38 (2018), 113–122. https://doi.org/10.1016/j.avb.2017.12.003
  73. A Just and Comprehensive Strategy for Using NLP to Address Online Abuse. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Anna Korhonen, David Traum, and Lluís Màrquez (Eds.). Association for Computational Linguistics, Florence, Italy, 3658–3666. https://doi.org/10.18653/v1/P19-1357
  74. Hyunjin Kang and Chen Lou. 2022. AI agency vs. human agency: understanding human–AI interactions on TikTok and their implications for user engagement. Journal of Computer-Mediated Communication 27, 5 (08 2022), zmac014. https://doi.org/10.1093/jcmc/zmac014 arXiv:https://academic.oup.com/jcmc/article-pdf/27/5/zmac014/45473652/zmac014.pdf
  75. David Kaye. 2019. Speech Police: The Global Struggle to Govern the Internet. Columbia Global Reports. http://www.jstor.org/stable/j.ctv1fx4h8v
  76. The shape of and solutions to the MTurk quality crisis. Political Science Research and Methods 8, 4 (2020), 614–629. https://doi.org/10.1017/psrm.2020.6
  77. Barbara A Kitchenham and Shari L Pfleeger. 2008. Personal opinion surveys. In Guide to advanced empirical software engineering. Springer, 63–92.
  78. ChatGPT’s inconsistent moral advice influences users’ judgment. Scientific Reports 13, 1 (April 2023), 4569.
  79. Enes Kulenović. 2022. Should Democracies Ban Hate Speech? Hate Speech Laws and Counterspeech. Ethical Theory and Moral Practice (Nov. 2022). https://doi.org/10.1007/s10677-022-10336-2
  80. What drives unverified information sharing and cyberchondria during the COVID-19 pandemic? European Journal of Information Systems 29, 3 (2020), 288–305. https://doi.org/10.1080/0960085X.2020.1770632 arXiv:https://doi.org/10.1080/0960085X.2020.1770632
  81. Rae Langton. 2018. 144Blocking as Counter-Speech. In New Work on Speech Acts. Oxford University Press. https://doi.org/10.1093/oso/9780198738831.003.0006 arXiv:https://academic.oup.com/book/0/chapter/155957982/chapter-ag-pdf/44951161/book_9256_section_155957982.ag.pdf
  82. Anti-Defamation League. 2023. Online Hate and Harassment: The American Experience. https://www.adl.org/resources/report/online-hate-and-harassment-american-experience-2023
  83. Kalev Leetaru. 2019. Online Toxicity Is As Old As The Web Itself But The Return To Communities May Help. Forbes Magazine (May 2019).
  84. Olivier Lemeire. 2021. Falsifying generic stereotypes. Philosophical Studies 178, 7 (01 Jul 2021), 2293–2312. https://doi.org/10.1007/s11098-020-01555-3
  85. Maxime Lepoutre. 2022. Hateful Counterspeech. Ethical Theory and Moral Practice (27 Oct 2022). https://doi.org/10.1007/s10677-022-10323-7
  86. Duri Long and Brian Magerko. 2020. What is AI Literacy? Competencies and Design Considerations. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–16. https://doi.org/10.1145/3313831.3376727
  87. Stable Bias: Evaluating Societal Representations in Diffusion Models. In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track. https://openreview.net/forum?id=qVXYU3F017
  88. Sana Maqsood and Sonia Chiasson. 2021. “They Think It’s Totally Fine to Talk to Somebody on the Internet They Don’t Know”: Teachers’ Perceptions and Mitigation Strategies of Tweens’ Online Risks. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 688, 17 pages. https://doi.org/10.1145/3411764.3445224
  89. Slowing It Down: Towards Facilitating Interpersonal Mindfulness in Online Polarizing Conversations Over Social Media. Proc. ACM Hum.-Comput. Interact. 7, CSCW1, Article 90 (apr 2023), 27 pages. https://doi.org/10.1145/3579523
  90. Hate Begets Hate: A Temporal Study of Hate Speech. Proc. ACM Hum.-Comput. Interact. 4, CSCW2, Article 92 (oct 2020), 24 pages. https://doi.org/10.1145/3415163
  91. Thou Shalt Not Hate: Countering Online Hate Speech. ICWSM 13 (July 2019), 369–380. http://arxiv.org/abs/1808.04409
  92. A Survey on Bias and Fairness in Machine Learning. ACM Comput. Surv. 54, 6, Article 115 (jul 2021), 35 pages. https://doi.org/10.1145/3457607
  93. Countering Hate Speech on Facebook: The Case of the Roma Minority in Slovakia. Social Science Computer Review 38, 2 (April 2020), 128–146. https://doi.org/10.1177/0894439318791786
  94. A Measurement Study of Hate Speech in Social Media. In Proceedings of the 28th ACM Conference on Hypertext and Social Media (Prague, Czech Republic) (HT ’17). Association for Computing Machinery, New York, NY, USA, 85–94. https://doi.org/10.1145/3078714.3078723
  95. Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language. arXiv:2311.00161 [cs.CL]
  96. Sarah Myers West. 2018. Censored, suspended, shadowbanned: User interpretations of content moderation on social media platforms. New Media & Society 20, 11 (2018), 4366–4383.
  97. Dawn Carla Nunziato. 2021. The Varieties of Counterspeech and Censorship on Social Media Symposium: Cheap Speech Twenty-Five Years Later: Democracy & Public Discourse in the Digital Age. UC Davis Law Review 54, 5 (2021), 2491–2552. https://heinonline.org/HOL/P?h=hein.journals/davlr54&i=2509
  98. Comparing the Perceived Legitimacy of Content Moderation Processes: Contractors, Algorithms, Expert Panels, and Digital Juries. Proc. ACM Hum.-Comput. Interact. CSCW (Oct. 2022).
  99. Bhikhu Parekh. 2012. Is There a Case for Banning Hate Speech? In The Content and Context of Hate Speech: Rethinking Regulation and Responses, Michael Herz and Peter Molnar (Eds.). Cambridge University Press, Cambridge, 37–56. https://doi.org/10.1017/CBO9781139042871.006
  100. Sara Parker and Derek Ruths. 2023. Is hate speech detection the solution the world wants? Proceedings of the National Academy of Sciences 120, 10 (2023), e2209384120. https://doi.org/10.1073/pnas.2209384120 arXiv:https://www.pnas.org/doi/pdf/10.1073/pnas.2209384120
  101. Characterizations of Online Harassment: Comparing Policies Across Social Media Platforms. In Proceedings of the 2016 ACM International Conference on Supporting Group Work (GROUP ’16). Association for Computing Machinery, New York, NY, USA, 369–374. https://doi.org/10.1145/2957276.2957297
  102. Hate Speech: A Systematized Review. SAGE Open 10, 4 (2020), 2158244020973022. https://doi.org/10.1177/2158244020973022 arXiv:https://doi.org/10.1177/2158244020973022
  103. Resources and benchmark corpora for hate speech detection: a systematic review. Language Resources and Evaluation 55, 2 (01 Jun 2021), 477–523. https://doi.org/10.1007/s10579-020-09502-8
  104. Loretta J Ross. 2019. Speaking up without tearing down. Teaching Tolerance 61 (2019), 19–22.
  105. Systematic Review of Determinants and Consequences of Bystander Interventions in Online Hate and Cyberbullying among Adults. Behaviour & Information Technology 42, 5 (April 2023), 527–544. https://doi.org/10.1080/0144929X.2022.2027013
  106. Can AI-Generated Text be Reliably Detected? arXiv:2303.11156 [cs.CL]
  107. On the Rise of Fear Speech in Online Social Media. Proceedings of the National Academy of Sciences 120, 11 (march 2023), e2212270120. https://doi.org/10.1073/pnas.2212270120
  108. CounterGeDi: A controllable approach to generate polite, detoxified and emotional counterspeech. (May 2022). arXiv:2205.04304 [cs.CL]
  109. NLPositionality: Characterizing Design Biases of Datasets and Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Toronto, Canada, 9080–9102. https://doi.org/10.18653/v1/2023.acl-long.505
  110. The Risk of Racial Bias in Hate Speech Detection. In ACL. https://www.aclweb.org/anthology/P19-1163.pdf
  111. Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, Seattle, United States, 5884–5906. https://doi.org/10.18653/v1/2022.naacl-main.431
  112. Annotators with Attitudes: How Annotator Beliefs And Identities Bias Toxic Language Detection. arXiv:2111.07997 [cs.CL]
  113. Julia Sasse and Jens Grossklags. 2023. Breaking the Silence: Investigating Which Types of Moderation Reduce Negative Effects of Sexist Social Media Content. Proc. ACM Hum.-Comput. Interact. 7, CSCW2, Article 327 (oct 2023), 26 pages. https://doi.org/10.1145/3610176
  114. How Computers See Gender: An Evaluation of Gender Classification in Commercial Facial Analysis Services. Proc. ACM Hum.-Comput. Interact. 3, CSCW, Article 144 (nov 2019), 33 pages. https://doi.org/10.1145/3359246
  115. Let’s Talk About Race: Identity, Chatbots, and AI. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems (Montreal QC, Canada) (CHI ’18). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3173574.3173889
  116. Proactive Moderation of Online Discussions: Existing Practices and the Potential for Algorithmic Support. Proc. ACM Hum.-Comput. Interact. 6, CSCW2, Article 370 (nov 2022), 27 pages. https://doi.org/10.1145/3555095
  117. Cyberbullying and Mental Health in Adults: The Moderating Role of Social Media Use and Gender. Frontiers in Psychiatry 12 (2021). https://doi.org/10.3389/fpsyt.2021.674298
  118. Online Harassment in Majority Contexts: Examining Harms and Remedies across Countries. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (¡conf-loc¿, ¡city¿Hamburg¡/city¿, ¡country¿Germany¡/country¿, ¡/conf-loc¿) (CHI ’23). Association for Computing Machinery, New York, NY, USA, Article 485, 16 pages. https://doi.org/10.1145/3544548.3581020
  119. Judith Schoonenboom and R. Burke Johnson. 2017. How to Construct a Mixed Methods Research Design. KZfSS Kölner Zeitschrift für Soziologie und Sozialpsychologie 69, 2 (01 Oct 2017), 107–131. https://doi.org/10.1007/s11577-017-0454-1
  120. Social identity as a key concept for connecting transformative societal change with individual environmental activism. Journal of Environmental Psychology 72 (2020), 101525. https://doi.org/10.1016/j.jenvp.2020.101525
  121. Ava Elizabeth Scott. 2023. To Do or Not To Do? Managing Intentions with Technology. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems (Hamburg, Germany) (CHI EA ’23). Association for Computing Machinery, New York, NY, USA, Article 504, 7 pages. https://doi.org/10.1145/3544549.3577046
  122. Joseph Seering. 2020. Reconsidering Self-Moderation: The Role of Research in Supporting Community-Based Models for Online Content Moderation. Proc. ACM Hum.-Comput. Interact. 4, CSCW2, Article 107 (oct 2020), 28 pages. https://doi.org/10.1145/3415178
  123. Reflective Design. In Proceedings of the 4th Decennial Conference on Critical Computing: Between Sense and Sensibility (Aarhus, Denmark) (CC ’05). Association for Computing Machinery, New York, NY, USA, 49–58. https://doi.org/10.1145/1094562.1094569
  124. AI and the quest for diversity and inclusion: a systematic literature review. AI and Ethics (13 Nov 2023). https://doi.org/10.1007/s43681-023-00362-w
  125. Human–AI collaboration enables more empathic conversations in text-based peer-to-peer mental health support. Nature Machine Intelligence 5, 1 (01 Jan 2023), 46–57. https://doi.org/10.1038/s42256-022-00593-2
  126. Jana Siebert and Johannes Ulrich Siebert. 2023. Effective mitigation of the belief perseverance bias after the retraction of misinformation: Awareness training and counter-speech. PLOS ONE 18, 3 (03 2023), 1–22. https://doi.org/10.1371/journal.pone.0282202
  127. What’s Race Got To Do With It? Engaging in Race in HCI. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI EA ’20). Association for Computing Machinery, New York, NY, USA, 1–8. https://doi.org/10.1145/3334480.3375156
  128. A Roadmap to Pluralistic Alignment. arXiv:2402.05070 [cs.AI]
  129. Artificial Intelligence and Life in 2030. http://ai100.stanford.edu/2016-report
  130. Barry Stricke. 2019. People v. Robots: A Roadmap for Enforcing California’s New Online Bot Disclosure Act. Vanderbilt Journal of Entertainment & Technology Law 22, 4 (2019), 839–894.
  131. Krista Thomason. 2021. The Moral Risks of Online Shaming. In Oxford Handbook of Digital Ethics. Oxford University Press.
  132. Stefanie Ullmann and Marcus Tomalin. 2023. Counterspeech: Multidisciplinary Perspectives on Countering Dangerous Speech. Taylor & Francis.
  133. United Nations, Human Rights Council. 2021. Recommendations made by the Forum on Minority Issues at its thirteenth session on the theme “Hate speech, social media and minorities”. Human Rights Council, Forty-sixth session, Agenda item 5. Available from https://undocs.org/A/HRC/46/58.
  134. Human-Centered Artificial Intelligence: Designing for User Empowerment and Ethical Considerations. In 2023 5th International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA). 1–7. https://doi.org/10.1109/HORA58378.2023.10156761
  135. Explanations Can Reduce Overreliance on AI Systems During Decision-Making. Proc. ACM Hum.-Comput. Interact. 7, CSCW1, Article 129 (apr 2023), 38 pages. https://doi.org/10.1145/3579605
  136. Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks. arXiv preprint arXiv:2306.07899 (2023).
  137. Emily A Vogels. 2021. The state of online harassment. Pew Research Center 13 (2021), 625.
  138. John Frank Weaver. 2018. We Need the California Bot Bill, but We Need It to Be Better Everything Is Not Terminator. RAIL: The Journal of Robotics, Artificial Intelligence & Law 1, 6 (2018), [vi]–438. https://heinonline.org/HOL/P?h=hein.journals/rail1&i=444
  139. Ethical and social risks of harm from Language Models. CoRR abs/2112.04359 (2021). arXiv:2112.04359 https://arxiv.org/abs/2112.04359
  140. What Makes Online Communities ‘Better’? Measuring Values, Consensus, and Conflict across Thousands of Subreddits. Proceedings of the International AAAI Conference on Web and Social Media 16, 1 (May 2022), 1121–1132. https://doi.org/10.1609/icwsm.v16i1.19363
  141. Suzanne Whitten. 2023. A Republican Conception of Counterspeech. Ethical Theory and Moral Practice (28 Jul 2023). https://doi.org/10.1007/s10677-023-10409-w
  142. Claudia Wilhelm and Sven Joeckel. 2019. Gendered Morality and Backlash Effects in Online Discussions: An Experimental Study on How Users Respond to Hate Speech Comments Against Women and Sexual Minorities. Sex Roles 80, 7 (April 2019), 381–392. https://doi.org/10.1007/s11199-018-0941-5
  143. Mickie Wong-Lo and Lyndal M. Bullock. 2014. Digital Metamorphosis: Examination of the Bystander Culture in Cyberbullying. Aggression and Violent Behavior 19, 4 (July 2014), 418–422. https://doi.org/10.1016/j.avb.2014.06.007
  144. Tianyi Xie and Renee V. Galliher. 2023. Responding to Microaggressions: Social Cost of Bystander Intervention Strategies. The Counseling Psychologist 51, 2 (2023), 242–269. https://doi.org/10.1177/00110000221140482 arXiv:https://doi.org/10.1177/00110000221140482
  145. Guobin Yang. [n. d.]. Narrative Agency in Hashtag Activism: The Case of #BlackLivesMatter. Media and Communication 4, 472 ([n. d.]), 13–17. https://repository.upenn.edu/handle/20.500.14332/2135
  146. Scalable and Generalizable Social Bot Detection through Data Selection. Proceedings of the AAAI Conference on Artificial Intelligence 34, 01 (Apr. 2020), 1096–1103. https://doi.org/10.1609/aaai.v34i01.5460
  147. Hate Speech and Counter Speech Detection: Conversational Context Does Matter. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Seattle, United States, 2022-07). Association for Computational Linguistics, 5918–5930. https://doi.org/10.18653/v1/2022.naacl-main.433
  148. Assessing the potential of GPT-4 to perpetuate racial and gender biases in health care: a model evaluation study. The Lancet Digital Health 6, 1 (2024), e12–e22. https://doi.org/10.1016/S2589-7500(23)00225-X
  149. Wanzheng Zhu and Suma Bhat. 2021. Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech. (June 2021). arXiv:2106.01625 [cs.CL]
  150. Racism is a Virus: Anti-Asian Hate and Counterhate in Social Media during the COVID-19 Crisis. CoRR abs/2005.12423 (2020). arXiv:2005.12423 https://arxiv.org/abs/2005.12423
  151. Homophobic Hate Speech Affects Well-Being of Highly Identified LGBT People. Journal of Language and Social Psychology 42, 4 (2023), 453–463. https://doi.org/10.1177/0261927X231174569 arXiv:https://doi.org/10.1177/0261927X231174569
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jimin Mun (7 papers)
  2. Cathy Buerger (1 paper)
  3. Jenny T. Liang (11 papers)
  4. Joshua Garland (35 papers)
  5. Maarten Sap (86 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com