Algorithmic Collective Action in Machine Learning (2302.04262v3)
Abstract: We initiate a principled study of algorithmic collective action on digital platforms that deploy machine learning algorithms. We propose a simple theoretical model of a collective interacting with a firm's learning algorithm. The collective pools the data of participating individuals and executes an algorithmic strategy by instructing participants how to modify their own data to achieve a collective goal. We investigate the consequences of this model in three fundamental learning-theoretic settings: the case of a nonparametric optimal learning algorithm, a parametric risk minimizer, and gradient-based optimization. In each setting, we come up with coordinated algorithmic strategies and characterize natural success criteria as a function of the collective's size. Complementing our theory, we conduct systematic experiments on a skill classification task involving tens of thousands of resumes from a gig platform for freelancers. Through more than two thousand model training runs of a BERT-like LLM, we see a striking correspondence emerge between our empirical observations and the predictions made by our theory. Taken together, our theory and experiments broadly support the conclusion that algorithmic collectives of exceedingly small fractional size can exert significant control over a platform's learning algorithm.
- Adversarial scrutiny of evidentiary statistical software. In Conference on Fairness, Accountability, and Transparency, pages 1733–1746, 2022.
- Politics of adversarial machine learning. arXiv preprint arXiv:2002.05648, 2020.
- Adversarial for good? How the adversarial ML community’s values impede socially beneficial uses of attacks. arXiv preprint arXiv:2107.10302, 2021.
- Machine learning with adversaries: Byzantine tolerant gradient descent. In Advances in Neural Information Processing Systems, volume 30, 2017.
- When users control the algorithms: values expressed in practices on Twitter. Proc. ACM Hum.-Comput. Interact., 3:1–20, 2019.
- Lindsey Cameron. The rise of algorithmic work: Implications for organizational control and worker autonomy. PhD thesis, University of Michigan, 2020.
- Expanding the locus of resistance: Understanding the co-constitution of control and resistance in the gig economy. Organization Science, 33(1):38–58, 2022.
- Julie Yujie Chen. Thrown under the bus and outrunning it! The logic of Didi and taxi drivers’ labour and activism in the on-demand economy. New Media & Society, 20(8):2691–2711, 2018.
- Lowkey: Leveraging adversarial attacks to protect social media users from facial recognition. In International Conference on Learning Representations, 2021.
- Backdoor learning curves: Explaining backdoor poisoning beyond influence functions. arXiv preprint arXiv:2106.07214, 2021.
- Online algorithmic recourse by collective action. ICML Workshop on Algorithmic Recourse, 2021.
- The organizational psychology of gig work: An integrative conceptual review. Journal of Applied Psychology, 2022.
- Ethical adversaries: Towards mitigating unfairness with adversarial machine learning. SIGKDD Explor. Newsl., 23(1):32–41, 2021.
- Bradley Efron. Exponential Families in Theory and Practice. Cambridge University Press, 2022.
- An equivalence between data poisoning and Byzantine gradient attacks. In International Conference on Machine Learning, 2022.
- Witches’ brew: Industrial scale data poisoning via gradient matching. In International Conference on Learning Representations, 2021.
- Ghost work: How to stop Silicon Valley from building a new global underclass. Eamon Dolan Books, 2019.
- Backdoor smoothing: Demystifying backdoor attacks on deep neural networks. Computers & Security, 120:102814, 2022.
- An overview of backdoor attacks against deep neural networks and possible defences. IEEE Open Journal of Signal Processing, 3:261–287, 2022.
- Strategic classification. In Innovations in Theoretical Computer Science, page 111–122, 2016.
- Turkopticon: Interrupting worker invisibility in Amazon Mechanical Turk. In Association for Computing Machinery, page 611–620, 2013.
- Algorithmic management and algorithmic competencies: Understanding and appropriating algorithms in gig work. In International conference on information, pages 578–589. Springer, 2019.
- Kameni Florentin Flambeau Jiechieu and Norbert Tsopze. Skills prediction based on multi-label resume classification using cnn with model predictions explanation. Neural Computing and Applications, 33:5069–5087, 2021.
- POTs: Protective optimization technologies. In Conference on Fairness, Accountability, and Transparency, 2020.
- Untargeted backdoor watermark: Towards harmless and stealthy dataset copyright protection. In Advances in Neural Information Processing Systems, 2022.
- Excess capacity and backdoor poisoning. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.
- Communication-Efficient Learning of Deep Networks from Decentralized Data. In Proc. 20202020th International Conference on Artificial Intelligence and Statistics, 2017.
- Alberto Melucci. Challenging Codes: Collective Action in the Information Age. Cambridge University Press, 1996.
- Stefania Milan. When algorithms shape collective action: Social media and the dynamics of cloud protesting. Social Media + Society, 1(2), 2015.
- Mancur Olson. The logic of collective action: public goods and the theory of groups. Number 124 in Harvard economic studies. Harvard Univ. Press, 1965.
- Victoria O’Meara. Weapons of the chic: Instagram influencer engagement pods as practices of resistance to Instagram platform labor. Social Media+ Society, 5(4):2056305119879671, 2019.
- Hatim A. Rahman. The invisible cage: Workers’ reactivity to opaque algorithmic evaluations. Administrative Science Quarterly, 66(4):945–988, 2021.
- Hilary C Robinson. Making a digital working class: Uber drivers in Boston, 2016-2017. PhD thesis, Massachusetts Institute of Technology, 2017.
- We are dynamo: Overcoming stalling and friction in collective action for crowd workers. In Association for Computing Machinery, 2015.
- Distilbert, a distilled version of bert: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108, 2019.
- Juliet Schor. After the gig: How the sharing economy got hijacked and how to win it back. University of California Press, 2021.
- Dependence and precarity in the platform economy. Theory and Society, 49(5):833–861, 2020.
- Fawkes: Protecting privacy against unauthorized deep learning models. In Proceedings of the 29th USENIX Security Symposium, 2020.
- Exploiting social navigation. arXiv preprint arXiv:1410.0151, 2014.
- Ping Sun. Your order, their labor: An exploration of algorithms and laboring on food delivery platforms in China. Chinese Journal of Communication, 12(3):308–323, 2019.
- The sharing economy and digital platforms: A review and research agenda. International Journal of Information Management, 43:328–341, 2018.
- Model-targeted poisoning attacks with provable convergence. In International Conference on Machine Learning, 2021.
- A comprehensive survey on poisoning attacks and countermeasures in machine learning. ACM Computing Surveys, 55(8), 2022.
- What do platforms do? Understanding the gig economy. Annual Review of Sociology, 46(1):273–294, 2020.
- Niels Van Doorn. Platform labor: on the gendered and racialized exploitation of low-income service work in the ‘on-demand’economy. Information, Communication & Society, 20(6):898–914, 2017.
- Can “conscious data contribution” help users to exert “data leverage” against technology companies? Proc. ACM Hum.-Comput. Interact., 2021.
- “Data strikes”: evaluating the effectiveness of a new form of collective action against technology companies. In The World Wide Web Conference, 2019.
- Data leverage: A framework for empowering the public in its relationship with technology companies. In Conference on Fairness, Accountability, and Transparency, 2021.
- Transformers: State-of-the-art natural language processing. In Empirical Methods in Natural Language Processing: System Demonstrations, 2020.
- Good gig, bad gig: autonomy and algorithmic control in the global gig economy. Work, Employment and Society, 33(1):56–75, 2019.
- The gig economy. A critical introduction. Cambridge: Polity, 2019.
- The emergence of algorithmic solidarity: unveiling mutual aid practices and resistance among chinese delivery workers. Media International Australia, 183(1):107–123, 2022.
- Who leads and who follows in strategic classification? In Advances in Neural Information Processing Systems, 2021.