Mathematical Framework for Online Social Media Auditing (2209.05550v2)
Abstract: Social media platforms (SMPs) leverage algorithmic filtering (AF) as a means of selecting the content that constitutes a user's feed with the aim of maximizing their rewards. Selectively choosing the contents to be shown on the user's feed may yield a certain extent of influence, either minor or major, on the user's decision-making, compared to what it would have been under a natural/fair content selection. As we have witnessed over the past decade, algorithmic filtering can cause detrimental side effects, ranging from biasing individual decisions to shaping those of society as a whole, for example, diverting users' attention from whether to get the COVID-19 vaccine or inducing the public to choose a presidential candidate. The government's constant attempts to regulate the adverse effects of AF are often complicated, due to bureaucracy, legal affairs, and financial considerations. On the other hand SMPs seek to monitor their own algorithmic activities to avoid being fined for exceeding the allowable threshold. In this paper, we mathematically formalize this framework and utilize it to construct a data-driven statistical auditing procedure to regulate AF from deflecting users' beliefs over time, along with sample complexity guarantees. This state-of-the-art algorithm can be used either by authorities acting as external regulators or by SMPs for self-auditing.
- Optimal testing for properties of distributions. Advances in Neural Information Processing Systems, 28, 2015.
- Bayesian learning in social networks. The Review of Economic Studies, 78(4):1201–1236, 2011.
- J Anderson and L. Rainie. The future of truth and misinformation online. Pew Research Center, 2017.
- Abhijit V Banerjee. A simple model of herd behavior. The quarterly journal of economics, 107(3):797–817, 1992.
- Hal Berghel. Lies, damn lies, and fake news. Computer, 50(2):80–85, 2017.
- Testing closeness of discrete distributions. Journal of the ACM (JACM), 60(1):1–25, 2013.
- Aaron Blake. A new study suggests fake news might have won donald trump the 2016 election. 2018.
- Oversight Board. Ensuring respect for free expression, through independent judgment. March 2020.
- Engin Bozdag. Bias in algorithmic filtering and personalization. Ethics and information technology, 15(3):209–227, 2013.
- Valerie C Brannon. Free speech and the regulation of social media content. Congressional Research Service, 27, 2019.
- A. Campbell. How data privacy laws can fight fake news. Just security, 2019.
- Robyn Caplan. 29 algorithmic filtering. Mediated Communication, page 561, 2018.
- The use of social media in nutrition interventions for adolescents and young adults—a systematic review. International journal of medical informatics, 120:77–91, 2018.
- Deep fakes: A looming challenge for privacy, democracy, and national security. Calif. L. Rev., 107:1753, 2019.
- Learning and testing irreducible markov chains via the k𝑘kitalic_k-cover time. In Vitaly Feldman, Katrina Ligett, and Sivan Sabato, editors, Proceedings of the 32nd International Conference on Algorithmic Learning Theory, volume 132 of Proceedings of Machine Learning Research, pages 458–480. PMLR, 16–19 Mar 2021.
- Optimal algorithms for testing closeness of discrete distributions. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms, pages 1193–1203. SIAM, 2014.
- Alexandra Chouldechova. Fair prediction with disparate impact: A study of bias in recidivism prediction instruments. Big data, 5(2):153–163, 2017.
- The price of tolerance in distribution testing. In Proceedings of Thirty Fifth Conference on Learning Theory, volume 178 of Proceedings of Machine Learning Research, pages 573–624. PMLR, 02–05 Jul 2022.
- A user-driven framework for regulating and auditing social media. arXiv preprint arXiv:2304.10525, 2023.
- Regulating algorithmic filtering on social media. arXiv preprint: arxiv.org/pdf/2006.09647v3.pdf, 2020.
- Regulating algorithmic filtering on social media. In Advances in Neural Information Processing Systems, volume 34, pages 6997–7011. Curran Associates, Inc., 2021.
- Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing). Wiley-Interscience, USA, 2006.
- Disinformation and ‘fake news’. 2019.
- Testing symmetric markov chains from a single trajectory. In Conference On Learning Theory, pages 385–409. PMLR, 2018.
- ” algorithms ruin everything” # riptwitter, folk theories, and resistance to algorithmic change in social media. In Proceedings of the 2017 CHI conference on human factors in computing systems, pages 3163–3174, 2017.
- Which distribution distances are sublinearly testable? In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 2747–2764. SIAM, 2018.
- Calibrating noise to sensitivity in private data analysis. In Theory of cryptography conference, pages 265–284. Springer, 2006.
- The algorithmic foundations of differential privacy. Found. Trends Theor. Comput. Sci., 9(3-4):211–407, 2014.
- Automated hate speech detection and the problem of offensive language. In Proceedings of the International AAAI Conference on Web and Social Media, volume 11, pages 512–515, 2017.
- Dipayan Ghosh. A new digital social contract is coming for silicon valley. Harvard Business Review, 27, 2019.
- The Independent. Amid capitol violence, facebook, youtube remove trump video. January 2021.
- Social media for health promotion and weight management: a critical debate. BMC public health, 18(1):1–7, 2018.
- A systematic review of hate speech automatic detection using natural language processing. CoRR, abs/2106.00742, 2021.
- Breaching the contract? using social contract theory to explain individuals’ online behavior to safeguard privacy. Media Psychology, 23(2):269–292, 2020.
- Kate Klonick. The new governors: The people, rules, and processes governing online speech. Harv. L. Rev., 131:1598, 2017.
- Botond Koszegi. Behavioral contract theory. Journal of Economic Literature, 52(4):1075–1118, 2014.
- Jovan Kurbalija. An introduction to internet governance. Diplo Foundation, 2016.
- Francis LF Lee. Impact of social media on opinion polarization in varying times. Communication and the Public, 1(1):56–71, 2016.
- Media Manipulation and Disinformation Online. New York: Data & Society Research Institute, 2017.
- Testing properties of collections of distributions. volume 17, pages 179–194, 01 2011.
- Alan Manning. Implicit contract theory. Current Issues in Labour Economics, page 63, 1989.
- Rotem Medzini. Enhanced self-regulation: The case of facebook’s content governance. New Media & Society, page 1461444821989352, 2021.
- The modern news consumer: News attitudes and practices in the digital era. 2016.
- Open issues in combating fake news: Interpretability as an opportunity. arXiv:1711.04024, 2019.
- A theory of non-bayesian social learning. Econometrica, 86(2):445–490, 2018.
- BBC News. Twitter suspends 70,000 accounts linked to qanon. January 2021.
- Social media definition and the governance challenge-an introduction to the special issue. Obar, JA and Wildman, S.(2015). Social media definition and the governance challenge: An introduction to the special issue. Telecommunications policy, 39(9):745–750, 2015.
- Eli Pariser. How the new personalized web is changing what we read and how we think. 2011.
- Jeannette Paschen. Investigating the emotional appeal of fake news using artificial intelligence and human contributions. Journal of Product & Brand Management, 05 2019.
- Ross D Petty. Marketing without consent: Consumer choice and costs, privacy, and public policy. Journal of Public Policy & Marketing, 19(1):42–53, 2000.
- Kelly Quinn. Why we share: A uses and gratifications approach to privacy regulation in social media use. Journal of Broadcasting & Electronic Media, 60(1):61–86, 2016.
- Automatic detection of hate speech on facebook using sentiment and emotion analysis. In 2019 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), pages 169–174, 2019.
- Rumor source detection with multiple observations under adaptive diffusions. IEEE Transactions on Network Science and Engineering, 8(1):2–12, 2020.
- Multiple random walks on graphs: mixing few to cover many. Combinatorics, Probability and Computing, 32(4):594–637, 2023.
- Potential for Discrimination in Online Targeted Advertising. In Sorelle A. Friedler and Christo Wilson, editors, Proceedings of the 1st Conference on Fairness, Accountability and Transparency, volume 81 of Proceedings of Machine Learning Research, pages 5–19, New York, NY, USA, 23–24 Feb 2018. PMLR.
- Analyzing and mining comments and comment ratings on the social web. ACM Transactions on the Web (TWEB), 8(3):1–39, 2014.
- Social media users’ legal consciousness about privacy. Social Media+ Society, 3(1):2056305117695325, 2017.
- Latanya Sweeney. Discrimination in online ad delivery: Google ads, black names and white names, racial discrimination, and click advertising. Queue, 11(3):10–29, 2013.
- Minimax learning of ergodic markov chains. In Algorithmic Learning Theory, pages 904–930. PMLR, 2019.
- Minimax testing of identity to a reference ergodic markov chain. In International Conference on Artificial Intelligence and Statistics, pages 191–201. PMLR, 2020.
- A right to reasonable inferences: re-thinking data protection law in the age of big data and AI. Colum. Bus. L. Rev., page 494, 2019.
- World Economic Forum. World economic forum global agenda council on the future of software and society. A call for agile governance principles. Technical report, 2016. https://www3.weforum.org/docs/IP/2016/ICT/Agile_Governance_Summary.pdf.
- A statistical framework for differential privacy. Journal of the American Statistical Association, 105(489):375–389, 2010.
- Wasim Huleihel (38 papers)
- Yehonathan Refael (9 papers)