Trust and Verification of ArachNet-Generated Workflows Without Expert Ground Truth
Establish validation methodologies and correctness guarantees for workflows generated by ArachNet when addressing novel queries in the absence of expert ground truth, and develop mechanisms to verify that a generated workflow applies the appropriate measurement methodology for the given query.
References
While our case studies demonstrate functional equivalence to expert solutions in specific scenarios, several verification questions remain open. How do we validate that generated workflows are correct for novel queries without expert ground truth? What guarantees can we provide about workflow correctness? However, the core challenge of verifying that a workflow uses the right measurement methodology for a given query remains open.
— Towards an Agentic Workflow for Internet Measurement Research
(2511.10611 - Ramanathan et al., 13 Nov 2025) in Section: Research Challenges — Trust and Verification