Public-private funding models in open source software development: A case study on scikit-learn (2404.06484v5)
Abstract: Governments are increasingly funding open source software (OSS) development to support software security, digital sovereignty, and national competitiveness in science and innovation, amongst others. However, little is known about how OSS developers evaluate the relative benefits and drawbacks of governmental funding for OSS. This study explores this question through a case study on scikit-learn, a Python library for machine learning, funded by public research grants, commercial sponsorship, micro-donations, and a 32 euro million grant announced in France's artificial intelligence strategy. Through 25 interviews with scikit-learn's maintainers and funders, this study makes two key contributions. First, it contributes empirical findings about the benefits and drawbacks of public and private funding in an impactful OSS project, and the governance protocols employed by the maintainers to balance the diverse interests of their community and funders. Second, it offers practical lessons on funding for OSS developers, governments, and companies based on the experience of scikit-learn. The paper concludes with key recommendations for practitioners and future research directions.
- The European Public Sector Open Source Opportunity: Challenges and Recommendations for Europe’s Open Source Future. Technical report, The Linux Foundation, Brussels, Belgium, September 2023.
- Sven Herpig. Fostering Open Source Software Security – Blueprint for a Government Cybersecurity Open Source Program Office. Technical report, Stiftung Neue Verantwortung, Berlin, Germany, May 2023.
- Frank Nagle. Government Technology Policy, Social Value, and National Competitiveness, March 2019.
- Steven J. Vaughan-Nichols. Log4Shell: We Are in So Much Trouble, December 2021.
- White House. Readout of White House Meeting on Software Security, January 2022.
- European Working Team on Digital Commons. Towards a Sovereign Digital Infrastructure of Commons: Report of the European Working Team on the Digital Commons. Technical report, OpenFuture, June 2022.
- STF. Sovereign Tech Fund, 2022.
- Paul Keller. European Public Digital Infrastructure Fund White Paper. Open Future, December 2022. Publisher: Open Future Foundation.
- Inria. The 2019 Inria French Academy of Sciences Dassault Systèmes Innovation Prize: scikit-learn, a success story for machine learning free software, January 2020.
- The growing influence of industry in AI research. Science, 379(6635):884–886, March 2023. Publisher: American Association for the Advancement of Science.
- SGPI. France 2030: Stratégie National pour L’Intelligence Artificielle, November 2021.
- Nadia Eghbal. Roads and Bridges: The Unseen Labor Behind Our Digital Infrastructure. Technical report, Ford Foundation, June 2016.
- Open Source Projects as Critical Digital Infrastructure, August 2020.
- Avoiding the Success Trap: Toward Policy for Open-Source Software as Infrastructure. Technical report, Atlantic Council, Washington D.C., USA, February 2023.
- How Early Participation Determines Long-Term Sustained Activity in GitHub Projects?, September 2023. arXiv:2308.06005 [cs].
- On the abandonment and survival of open source projects: An empirical investigation. In 2019 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (ESEM), pages 1–12. IEEE, 2019. ISSN: 1949-3789.
- Uncovering the Periphery: A Qualitative Survey of Episodic Volunteering in Free/Libre and Open Source Software Communities. IEEE transactions on software engineering, 46(9):962–980, 2020. Place: LOS ALAMITOS Publisher: IEEE.
- Understanding Open Source Software development. Pearson Education, London, 2002.
- Understanding the Motivations, Participation, and Performance of Open Source Software Developers: A Longitudinal Study of the Apache Projects. Management science, 52(7):984–999, 2006. Place: Linthicum Publisher: INFORMS.
- Who Will Stay in the FLOSS Community? Modeling Participant’s Initial Behavior. IEEE transactions on software engineering, 41(1):82–99, 2015. Place: New York Publisher: IEEE.
- Alex Salkever. Open Source Maintainers: Exploring the people, practices, and constraints facing the world’s most critical open source software projects. Technical report, Linux Foundation, July 2023.
- Is It Enough to Recommend Tasks to Newcomers? Understanding Mentoring on Good First Issues, February 2023. arXiv:2302.05058 [cs].
- Recommending Good First Issues in GitHub OSS Projects. In 2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE), pages 1830–1842. ACM, 2022. ISSN: 1558-1225.
- Boundary Organizations: Enabling Collaboration among Unexpected Allies. Administrative Science Quarterly, 53(3):422–459, September 2008. Publisher: SAGE Publications Inc.
- Open source communities of competitors. Interactions, 20(6):54–59, November 2013.
- Do software firms collaborate or compete? A model of coopetition in community-initiated OSS projects. e-Informatica Vol. XIII, 2019. arXiv:1808.06489 [cs].
- Cooperation among competitors in the open-source arena. In IS in Organizations and Society, page 38, Dublin, Ireland, 2016.
- Why Modern Open Source Projects Fail. In arXiv.org, Ithaca, 2017. Cornell University Library, arXiv.org. ISSN: 2331-8422.
- Nadia Eghbal. Working in Public: The Making and Maintenance of Open Source Software. Stripe Press, San Francisco, August 2020.
- Sébastien Broca. Communs et capitalisme numérique : histoire d’un antagonisme et de quelques affinités électives. Terminal. Technologie de l’information, culture & société, (130), June 2021. Number: 130 Publisher: Creis-Terminal.
- Eric S. Raymond. The Cathedral and the Bazaar: Musings on Linux and Open Source by an Accidental Revolutionary. O’Reilly Media, Incorporated, Sebastopol, 2001.
- Christopher M. Kelty. Two bits: the cultural significance of free software. Experimental futures. Duke University Press, Durham, N.C. ; London, 2008.
- What Makes a Virtual Organization Work: Lessons From the Open-Source World. MIT Sloan Management Review, October 2000.
- Open Source Platforms Under Co-opetition: A Comparative Analysis of SourceForge and ’CodeX’ (Xerox) as Two ’Co-opetitive Learning and Knowledge Exchange Networks’ (CoLKENs). 2003.
- Nadia Eghbal. Where money meets open source, 2017.
- The Labor of Maintaining and Scaling Free and Open-Source Software Projects. Proceedings of the ACM on Human-Computer Interaction, 5(CSCW1):175:1–175:28, April 2021.
- Steven Vaughan-Nichols. Hard work and poor pay stresses out open-source maintainers. ZDNET, June 2021.
- Synopsys. Open Source Security and Analysis Report. Technical report, Mountain View, CA, USA, 2023.
- Open Source Security Foundation. OSS Mobilization Plan. Technical report, Open Source Security Foundation, Linux Foundation, Washington D.C., USA, May 2022.
- Benjamin Birkinbine. Incorporating the Digital Commons: Corporate Involvement in Free and Open Source Software. University of Westminster Press, February 2020.
- GitHub. Sponsoring an open source contributor, 2023.
- Who, What, Why and How? Towards the Monetary Incentive in Crowd Collaboration: A Case Study of Github’s Sponsor Mechanism. arXiv:2111.13323 [cs], November 2021. arXiv: 2111.13323.
- Monetary donations to an open source software platform. Research Policy, 38(2):404–414, March 2009.
- Analysis of Donations in the Eclipse Project. In 2017 8th International Workshop on Empirical Software Engineering in Practice (IWESEP), pages 18–22, March 2017.
- Are Donation Badges Appealing?: A Case Study of Developer Responses to Eclipse Bug Reports. IEEE Software, 36(3):22–27, May 2019. Conference Name: IEEE Software.
- Sponsorship Funding in Open-Source Software: Effort Reallocation and Spillover Effects in Knowledge-Sharing Ecosystems, June 2023.
- How to not get rich: an empirical study of donations in open source. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, ICSE ’20, pages 1209–1221, New York, NY, USA, October 2020. Association for Computing Machinery.
- Nadia Eghbal. A handy guide to financial support for open source., August 2022. original-date: 2016-06-16T01:45:55Z.
- The impact of open source software and hardware on technological independence, competitiveness and innovation in the EU economy. Technical report, European Commission, Brussels, Belgium, 2021.
- The Emergence of Governance in an Open Source Community. Academy of Management journal, 50(5):1079–1106, 2007. Place: Briarcliff Manor Publisher: Academy of Management.
- Duane O’Brien. Sustaining FOSS Projects By Democratizing The Sponsorship Process, 2019.
- Duane O’Brien. The FOSS Contributor Fund: Forming a Community of Adopters, November 2019.
- Alyssa Wright. Bloomberg Launches FOSS Fund to Support Free and Open Source Projects. Bloomberg L.P., March 2023.
- On Company Contributions to Community Open Source Software Projects. IEEE transactions on software engineering, 47(7):1381–1401, 2021. Place: LOS ALAMITOS Publisher: IEEE.
- Lessons Learned From the Ant Group Open Source Program Office. Computer, 56(4):92–97, April 2023. Conference Name: Computer.
- How do Firms Make Use of Open Source Communities? Long Range Planning, 41(6):629–649, December 2008.
- A man on the inside: Unlocking communities as complementary assets. Research Policy, 35(8):1243–1259, October 2006.
- Comparing motivations of individual programmers and firms to take part in the open source movement: From community to business. Knowledge, Technology & Policy, page 25, 2006.
- Henry Chesbrough. Measuring the Economic Value of Open Source. Technical report, Linux Foundation, San Francisco, CA, USA, March 2023.
- The Penguin’s Window: Corporate Brands From an Open-Source Perspective. Journal of the Academy of Marketing Science, 34(2):115–127, April 2006. Publisher: SAGE Publications Inc.
- Why Hackers Do What They Do: Understanding Motivation and Effort in Free/Open Source Software Projects, September 2003.
- Craig Hale. Google might actually be the best friend for open-source software right now. TechRadar, August 2022.
- An Investigation of Work Practices Used by Companies Making Contributions to Established OSS Projects. In 2018 IEEE/ACM 40th International Conference on Software Engineering: Software Engineering in Practice Track (ICSE-SEIP), pages 201–210. ACM, 2018.
- Companies’ domination in FLOSS development: an empirical study of OpenStack. In Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, ICSE ’18, pages 440–441, New York, NY, USA, 2018. Association for Computing Machinery.
- Corporate dominance in open source ecosystems: a case study of OpenStack. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, pages 1048–1060, New York, NY, USA, November 2022. Association for Computing Machinery.
- Patrick Adam Wagstrom. Vertical interaction in open software engineering communities. PhD thesis, Carnegie Insitute of Technology/School of Computer Science, Carnegie Mellon University, 2009.
- Key success factors of sustainable open source communities. Technical report, EU, Brussels, Belgium, 2020.
- CERN. The birth of the Web | CERN, 2023.
- B. During. Trouble in paradise: the open source project PyPy, EU-funding and agile practices. In AGILE 2006 (AGILE’06), pages 11 pp.–231, July 2006.
- Ten simple rules for funding scientific open source software. PLOS Computational Biology, 18(11):e1010627, November 2022. Publisher: Public Library of Science.
- Digital sovereignty in practice: The EU’s push to shape the new global economy. Technical report, November 2022.
- European Commission. The European Commission adopts its new Open Source Software Strategy 2020-2023. Technical report, Directorate-General for Informatics, European Commission, Brussels, Belgium, October 2020.
- Osden Jokonya. Investigating Open Source Software Benefits in Public Sector. In 2015 48th Hawaii International Conference on System Sciences, pages 2242–2251, January 2015. ISSN: 1530-1605.
- France bets big on open-source AI, August 2023.
- Case Study Research in Software Engineering: Guidelines and Examples. John Wiley & Sons, March 2012. Google-Books-ID: T7rXoaxqPIAC.
- Robert K. Yin. Case study research and applications: design and methods. Sage, Los Angeles, USA, sixth edition. edition, 2018.
- The Need for Open Source Software in Machine Learning. Journal of Machine Learning Research, 8:2443–2466, October 2007.
- How Open Source Machine Learning Software Shapes AI. In Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’22, pages 385–395, New York, NY, USA, July 2022. Association for Computing Machinery.
- The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency and Usability in AI, March 2024. arXiv:2403.13784 [cs].
- LF AI & Data. Linux Foundation AI & Data Landscape, 2022.
- OSS Insight. OSS Insight - scikit-learn, March 2023.
- scikit learn. scikit-learn, 2023.
- Gaël Varoquaux. A foundation for scikit-learn at Inria, September 2018.
- Steinar Kvale. InterViews: an introduction to qualitative research interviewing. Sage, Thousand Oaks ; London, 1996.
- Using thematic analysis in psychology. Qualitative research in psychology, 3(2):77–101, 2006. Place: London Publisher: Taylor & Francis Group.
- Kathy Charmaz. Constructing grounded theory: A practical guide through qualitative analysis. SAGE, 2006.
- Recommended Steps for Thematic Synthesis in Software Engineering. In 2011 International Symposium on Empirical Software Engineering and Measurement, pages 275–284, September 2011. ISSN: 1949-3789.
- Naturalistic inquiry. Sage, Beverly Hills ; London, 1985.
- Social Identity Map: A Reflexivity Tool for Practicing Explicit Positionality in Critical Qualitative Research. International journal of qualitative methods, 18:160940691987007–, 2019. Place: Los Angeles, CA Publisher: SAGE Publications.
- Entry Strategies Under Competing Standards: Hybrid Business Models in the Open Source Software Industry. Management science, 52(7):1085–1098, 2006. Place: Linthicum Publisher: INFORMS.
- Carrots and Rainbows: Motivation and Social Practice in Open Source Software Development. MIS quarterly, 36(2):649–676, 2012. Place: Minneapolis Publisher: Management Information Systems Research Center, University of Minnesota.
- X Cao and Henry William Chesbrough. OSS research in organizational management: A literature review and critical reappraisal, 2022.
- Guidelines for conducting and reporting case study research in software engineering. Empirical software engineering : an international journal, 14(2):131–164, 2008. Place: Boston Publisher: Springer US.
- Selecting Empirical Methods for Software Engineering Research. In Forrest Shull, Janice Singer, and Dag I. K. Sjøberg, editors, Guide to Advanced Empirical Software Engineering, pages 285–311. Springer, London, 2008.
- Robert Mikecz. Interviewing Elites: Addressing Methodological Issues. Qualitative Inquiry, 18(6):482–493, 2012. _eprint: https://doi.org/10.1177/1077800412442818.
- Kathleen M. Eisenhardt. Building Theories from Case Study Research. The Academy of Management review, 14(4):532–550, 1989. Place: Ada, Ohio, etc Publisher: Academy of Management.
- Multisite Qualitative Policy Research: Optimizing Description and Generalizability. Educational researcher, 12(2):14–19, 1983. Publisher: American Educational Research Association.