The Participatory Ceiling in Foundation Models
The paper entitled "Participation in the Age of Foundation Models" by Harini Suresh et al., presented at the ACM Conference on Fairness, Accountability, and Transparency (FAccT '24), explores the intersection of participatory ML approaches and the deployment of foundation models. This analysis explores how to empower historically marginalized communities which are often disproportionately affected by power imbalances and potential harms induced by these models.
Participatory Machine Learning and Foundation Models
Participatory ML has gained attention as a pathway to reduce biases and redistribute power by incorporating diverse stakeholder perspectives. Traditionally, this approach has been employed in specific applications where the context, stakeholders, and tasks are well-defined. However, the universal applicability and vast scope of foundation models bring new challenges to participatory approaches.
Foundation models represent a shift in ML paradigms, aimed at providing generalizable solutions across a myriad of tasks. Such models, trained on extensive datasets often scraped from the internet, are primarily developed and controlled by large tech companies. This centralization of control presents significant barriers to meaningful participation from affected communities.
Analysis Using the Parameters of Participation
The paper evaluates existing participatory efforts in foundation model development using the "Parameters of Participation" framework by Delgado et al. This framework examines the goals, stakes, scope, and forms of participation along a spectrum ranging from consultation to ownership. Several mechanisms were reviewed, including reinforcement learning with human feedback (RLHF), the development of rulesets and policies, and red teaming.
- RLHF: Primarily involves gathering feedback from crowd workers to refine models through preference modeling. This method usually only achieves the consultation mode of participation as it focuses on improving model performance without engaging deeper with impacted communities.
- Rulesets and Policies: This includes efforts like Anthropic's Collective Constitutional AI, where public polling is used to determine values that guide the model’s behavior. These efforts often remain at the inclusion mode due to limited stakes and forms of engagement, as participants do not effectively shape foundational decisions.
- Red Teaming: Involves domain experts identifying vulnerabilities in models. While useful, these initiatives are mostly confined to consultative roles without extending into collaboration or ownership modes due to their limited impact on decision-making.
The Participatory Ceiling and Centralization of Control
The paper identifies a "participatory ceiling," a conceptual threshold limiting the efficacy of participatory efforts in universally applicable foundation models. This ceiling exists primarily due to two factors:
- Corporate Control and Incentives: Foundation model developers, often large tech companies, lack sufficient incentives to share control or engage deeply with affected communities. These companies are driven by profit motives and risk aversion, limiting transparency and meaningful stakeholder engagement.
- Context-Agnostic Models: Foundation models are designed to be generalizable, making it challenging to address specific, context-sensitive issues through participatory means. Participation necessitates context-specificity to be meaningful, which is at odds with the universal applicability of foundation models.
Blueprint for Participatory Foundation Models
To overcome these challenges, the paper proposes a three-layer framework to facilitate meaningful participation:
- Foundation Layer: The basic, universally applicable model developed by tech companies or other institutions. This layer should ensure foundational utility and be amenable to inputs from subsequent layers.
- Subfloor Layer: This intermediate layer focuses on domain-specific technical infrastructure, norms, and governance. For instance, in the healthcare domain, this could involve patient advocacy groups pooling data to audit speech-to-text models for inclusivity and fairness.
- Surface Layer: This layer pertains to specific downstream applications, where localized participatory methods can be used to co-create solutions tailored to a particular context, such as a clinic deploying a transcription tool for patient interactions.
Case Studies
The authors provide case studies to illustrate the application of this blueprint:
- Clinical Care: By engaging patient advocacy groups to collect dialect-specific data, the speech-to-text model can be fine-tuned for inclusivity, and localized use cases can benefit from domain-specific audits and guidelines.
- Journalism: Addressing copyright concerns, entities like newsrooms can form collectives at the subfloor layer to assert collective rights and negotiate fair use agreements with foundation model developers.
- Financial Services: Establishing robust reporting and recourse mechanisms at the subfloor layer can help better capture discriminatory patterns and inform more equitable fraud detection systems.
Conclusion
The proposed framework aims to break the participatory ceiling through domain- and application-specific engagements that collectively shape foundation models while recognizing local expertise and needs. This stratification enables more robust and meaningful participation, balancing the scale and generalizability of foundation models with the necessity for contextual sensitivity and community empowerment.
The paper underscores the importance of continuing to explore mechanisms that allow for equitable and just governance of AI technologies. It calls for further research into scoping subfloors, fostering accountability, and ensuring that participatory efforts genuinely redistribute power to those most affected by technological advancements.