Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Decentralised Moderation for Interoperable Social Networks: A Conversation-based Approach for Pleroma and the Fediverse (2404.03048v2)

Published 3 Apr 2024 in cs.CY and cs.CL

Abstract: The recent development of decentralised and interoperable social networks (such as the "fediverse") creates new challenges for content moderators. This is because millions of posts generated on one server can easily "spread" to another, even if the recipient server has very different moderation policies. An obvious solution would be to leverage moderation tools to automatically tag (and filter) posts that contravene moderation policies, e.g. related to toxic speech. Recent work has exploited the conversational context of a post to improve this automatic tagging, e.g. using the replies to a post to help classify if it contains toxic speech. This has shown particular potential in environments with large training sets that contain complete conversations. This, however, creates challenges in a decentralised context, as a single conversation may be fragmented across multiple servers. Thus, each server only has a partial view of an entire conversation because conversations are often federated across servers in a non-synchronized fashion. To address this, we propose a decentralised conversation-aware content moderation approach suitable for the fediverse. Our approach employs a graph deep learning model (GraphNLI) trained locally on each server. The model exploits local data to train a model that combines post and conversational information captured through random walks to detect toxicity. We evaluate our approach with data from Pleroma, a major decentralised and interoperable micro-blogging network containing 2 million conversations. Our model effectively detects toxicity on larger instances, exclusively trained using their local post information (0.8837 macro-F1). Our approach has considerable scope to improve moderation in decentralised and interoperable social networks such as Pleroma or Mastodon.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates. In Proceedings of the ACM Web Conference 2022, 2729–2737.
  2. A graph-based context-aware model to understand online conversations. ACM Transactions on the Web, 18(1): 1–27.
  3. Investigating toxicity across multiple Reddit communities, users, and moderators. In Companion proceedings of the web conference 2020, 294–298.
  4. Will Admins Cope? Decentralized Moderation in the Fediverse. In Proceedings of the ACM Web Conference 2023, 3109–3120.
  5. Deep Learning for Hate Speech Detection in Tweets. In Proceedings of the 26th International Conference on World Wide Web Companion, WWW ’17 Companion, 759–760. Republic and Canton of Geneva, CHE: International World Wide Web Conferences Steering Committee. ISBN 9781450349147.
  6. Breaking Up a Digital Monopoly. Communications of the ACM, 66(6): 38–41.
  7. 4chan and/b: An Analysis of Anonymity and Ephemerality in a Large Online Community. In Proceedings of the International AAAI Conference on Web and Social Media, volume 5.
  8. Toxicity in the decentralized web and the potential for model sharing. Proceedings of the ACM on Measurement and Analysis of Computing Systems, 6(2): 1–25.
  9. Birman, I. 2018. Moderation in different communities on Reddit–A qualitative analysis study.
  10. Cyber hate speech on twitter: An application of machine classification and statistical modeling for policy and decision making. Policy & internet, 7(2): 223–242.
  11. DeepHate: Hate Speech Detection via Multi-Faceted Text Representations. In 12th ACM Conference on Web Science, WebSci ’20, 11–20. New York, NY, USA: Association for Computing Machinery. ISBN 9781450379892.
  12. You can’t stay here: The efficacy of reddit’s 2015 ban examined through hate speech. Proceedings of the ACM on Human-Computer Interaction, 1(CSCW): 1–22.
  13. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
  14. An empirical view on consolidation of the web. ACM Transactions on Internet Technology (TOIT), 22(3): 1–30.
  15. EU. 2023. Q&A: DMA: Ensuring fair and open digital markets.
  16. Exploring ethics and obligations for studying digital communities. In Proceedings of the 2016 ACM International Conference on Supporting Group Work, 457–460.
  17. The digital covenant: non-centralized platform governance on the mastodon social network. Information, Communication & Society, 1–17.
  18. Exploring content moderation in the decentralised web: The pleroma case. In Proceedings of the 17th International Conference on emerging Networking EXperiments and Technologies, 328–335.
  19. Flocking to mastodon: Tracking the great twitter migration. In Proceedings of the 2023 ACM on Internet Measurement Conference, 111–123.
  20. Exploring crowdsourced content moderation through lens of reddit during covid-19. In Proceedings of the 17th Asian Internet Engineering Conference, 26–35.
  21. Jigsaw, G. 2023. Perspective API. https://www.perspectiveapi.com/.
  22. Towards robust toxic content classification. arXiv preprint arXiv:1912.06872.
  23. Information consumption and boundary spanning in decentralized online social networks: the case of mastodon users. Online Social Networks and Media, 30: 100220.
  24. Network analysis of the information consumption-production dichotomy in mastodon user behaviors. In Proceedings of the International AAAI Conference on Web and Social Media, volume 16, 1378–1382.
  25. Algorithmic thinking in the public interest: navigating technical, legal, and ethical hurdles to web scraping in the social sciences. Quality & Quantity, 56(3): 1023–1044.
  26. Mahy, R. 2023. More Instant Messaging Interoperability (MIMI) message content. Internet-Draft draft-ietf-mimi-content-00, Internet Engineering Task Force. Work in Progress.
  27. The impact of toxic language on the health of reddit communities. In Canadian Conference on Artificial Intelligence, 51–56. Springer.
  28. “Is it a qoincidence?”: An exploratory study of QAnon on Voat. In Proceedings of the Web Conference 2021, 460–471.
  29. Raiders of the lost kek: 3.5 years of augmented 4chan posts from the politically incorrect board. In Proceedings of the International AAAI Conference on Web and Social Media, volume 14, 885–894.
  30. Challenges in the decentralised web: The mastodon case. In Proceedings of the internet measurement conference, 217–229.
  31. Characterizing and detecting hateful users on twitter. In Twelfth international AAAI conference on web and social media.
  32. Toxic comment detection in online discussions. Deep learning-based approaches for sentiment analysis, 85–109.
  33. Design and evaluation of IPFS: a storage layer for the decentralized web. In Proceedings of the ACM SIGCOMM 2022 Conference, SIGCOMM ’22, 739–752. New York, NY, USA: Association for Computing Machinery. ISBN 9781450394208.
  34. Varian, H. R. 2019. Recent Trends in Concentration, Competition, and Entry. Antitrust Law Journal, 82(3): 807–834.
  35. W3C. 2018. ActivityPub W3C Recommendation. https://www.w3.org/TR/activitypub/.
  36. Hateful symbols or hateful people? predictive features for hate speech detection on twitter. In Proceedings of the NAACL student research workshop, 88–93.
  37. Conversations About Crime: Re-Enforcing and Fighting Against Platformed Racism on Reddit. Proceedings of the ACM on Human-Computer Interaction, 6(CSCW1): 1–38.
  38. AnnoBERT: effectively representing multiple annotators’ label choices to improve hate speech detection. In Proceedings of the International AAAI Conference on Web and Social Media, volume 17, 902–913.
  39. Mastodon content warnings: Inappropriate contents in a microblogging platform. In Proceedings of the International AAAI Conference on Web and Social Media, volume 13, 639–645.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Youtube Logo Streamline Icon: https://streamlinehq.com