Verifying declared workloads from network-tap logs
Develop analysis methods that, given sampled inter-accelerator communication logs (including GPU kernels) captured by mutually vetted network taps in an AI compute cluster, can verify whether the cluster executed only the declared high-level code and no undeclared workloads.
References
Analyzes the kernels, other logged data, and Prover declarations to verify that the compute cluster executed (only) the declared high-level code. This is an unsolved problem.
— Verifying International Agreements on AI: Six Layers of Verification for Rules on Large-Scale AI Development and Deployment
(2507.15916 - Baker et al., 21 Jul 2025) in Appendix A.3 (Network Taps and Analysis)