Dice Question Streamline Icon: https://streamlinehq.com

Consensus process for retiring deprecated benchmark components

Establish a formal, field-wide consensus process for retiring deprecated data-generating mechanisms, methods, and performance measures from living synthetic benchmarks, specifying governance procedures (e.g., voting mechanisms) that are appropriate to the benchmark’s organizational structure.

Information Square Streamline Icon: https://streamlinehq.com

Background

Living synthetic benchmarks will inevitably accumulate outdated or practically irrelevant components over time. To maintain clarity of reporting and conserve computational resources, such components may need to be removed.

The paper notes that deciding how to reach consensus on deprecating components depends on how a benchmark is organized and is currently unresolved, implying the need for defined governance procedures—potentially including voting or other decision mechanisms—to guide removals in a neutral, transparent manner.

References

How exactly such a consensus should be reached (e.g., by voting) strongly depends on how a benchmark is organized. This is still an open question, which we will discuss further in Section~\ref{sec:discussion}.

Living Synthetic Benchmarks: A Neutral and Cumulative Framework for Simulation Studies (2510.19489 - Bartoš et al., 22 Oct 2025) in Section 2, Subsection 'Retiring Deprecated Benchmark Components'