Design secure, comprehensive research APIs for white‑/de facto white‑box auditing
Develop structured research application programming interfaces (APIs) that enable external auditors to run arbitrary white‑box analyses on proprietary AI models while preventing parameter leakage and model reconstruction, achieving sufficient comprehensiveness, flexibility, and security for rigorous auditing of AI systems.
References
Overall, while conceptually simple, designing APIs that simultaneously provide the comprehensiveness, flexibility, and security required for rigorous auditing is an open area of research.
— Black-Box Access is Insufficient for Rigorous AI Audits
(2401.14446 - Casper et al., 25 Jan 2024) in Section 6, Methods to Address Security Risks – Technical: API access