Preserving depth of insight in scalable participation models

Determine the extent to which lower-touch or asynchronous participatory methods can preserve the depth of insights into participants’ normative preferences for cultural representation and the level of participant buy-in achieved by synchronous, in-depth engagements when systematizing community-informed rubrics for evaluating the cultural appropriateness of AI-generated images of cultural artifacts.

Background

The paper engages three communities (blind and low-vision individuals in the UK, and residents of Kerala and Tamil Nadu) to co-design rubrics that systematize what constitutes culturally appropriate depictions of salient artifacts in AI-generated images. These rubrics are intended for later operationalization in MLLM-as-a-judge pipelines.

While the study demonstrates the benefits of synchronous, in-depth workshops for eliciting lived and normative perspectives (including participant buy-in and nuanced criteria), the authors also discuss pragmatic pressures to scale participation. They outline lighter-weight approaches (e.g., asynchronous elicitation methods, interfaces to help users design LLM-as-a-judge criteria, or starting from LLM-generated rubrics and refining with communities), and raise an explicit open question about whether such approaches can preserve the depth achieved by synchronous engagements.

References

While such lower-touch approaches may improve scalability, our study suggests that synchronous engagement plays an important role in fostering participant buy-in and surfacing deeper insights into participants' more normative desires for cultural representation (the higher-level themes) , highlighting an open question about how much of this depth can be preserved in more lightweight models of participation.

Evaluating AI-Generated Images of Cultural Artifacts with Community-Informed Rubrics  (2604.02406 - Johnson et al., 2 Apr 2026) in Section 6.2, Defining the scope and purpose of community-centered measurement (Discussion)