Disambiguating the causes of unsuccessful outcomes after intervention
Ascertain whether unsuccessful trial outcomes after applying DoVer’s orchestrator-level interventions in LLM-based multi-agent systems arise from incorrect failure attribution hypotheses or from system limitations that prevent faithful execution of the intervention, and develop diagnostics that reliably differentiate between these two causes to resolve the ambiguity in intervention evaluation.
Sponsor
References
The Inconclusive category is necessary because we frequently observe that agents fail to follow the intervened instruction, resulting in unsuccessful trials. In such cases, it is unclear whether the outcome stems from an incorrect failure hypothesis or from other limitations of the system that prevent the intervention from being carried out.