Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Distributionally Robust Statistical Verification with Imprecise Neural Networks (2308.14815v3)

Published 28 Aug 2023 in cs.AI, cs.LG, and cs.RO

Abstract: A particularly challenging problem in AI safety is providing guarantees on the behavior of high-dimensional autonomous systems. Verification approaches centered around reachability analysis fail to scale, and purely statistical approaches are constrained by the distributional assumptions about the sampling process. Instead, we pose a distributionally robust version of the statistical verification problem for black-box systems, where our performance guarantees hold over a large family of distributions. This paper proposes a novel approach based on a combination of active learning, uncertainty quantification, and neural network verification. A central piece of our approach is an ensemble technique called Imprecise Neural Networks, which provides the uncertainty to guide active learning. The active learning uses an exhaustive neural-network verification tool Sherlock to collect samples. An evaluation on multiple physical simulators in the openAI gym Mujoco environments with reinforcement-learned controllers demonstrates that our approach can provide useful and scalable guarantees for high-dimensional systems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (71)
  1. Additivity of uncertainty measures on credal sets. International Journal of General Systems, 34(6):691–713, 2005.
  2. Quantitative verification with adaptive uncertainty reduction. Journal of Systems and Software, 188:111275, June 2022. ISSN 0164-1212. 10.1016/j.jss.2022.111275. URL https://www.sciencedirect.com/science/article/pii/S016412122200036X.
  3. Probabilistically safe motion planning to avoid dynamic obstacles with uncertain motion patterns. Autonomous Robots, 35(1):51–76, July 2013. ISSN 1573-7527. 10.1007/s10514-013-9334-3. URL https://doi.org/10.1007/s10514-013-9334-3.
  4. DeepAbstract: Neural Network Abstraction for Accelerating Verification. In Dang Van Hung and Oleg Sokolsky, editors, Automated Technology for Verification and Analysis, Lecture Notes in Computer Science, pages 92–107, Cham, 2020. Springer International Publishing. ISBN 978-3-030-59152-6. 10.1007/978-3-030-59152-6_5.
  5. Context-Specific Validation of Data-Driven Models. arXiv:1802.04929 [cs], February 2018. URL http://arxiv.org/abs/1802.04929. arXiv: 1802.04929.
  6. OpenAI Gym, June 2016. URL http://arxiv.org/abs/1606.01540. arXiv:1606.01540 [cs].
  7. Active learning for regression based on query by committee. In Hujun Yin, Peter Tino, Emilio Corchado, Will Byrne, and Xin Yao, editors, Intelligent Data Engineering and Automated Learning - IDEAL 2007, pages 209–218, Berlin, Heidelberg, 2007. Springer Berlin Heidelberg. ISBN 978-3-540-77226-2.
  8. Neural predictive monitoring under partial observability. In Lu Feng and Dana Fisman, editors, Runtime Verification, pages 121–141, Cham, 2021. Springer International Publishing. ISBN 978-3-030-88494-9.
  9. Imprecise Bayesian neural networks. Submitted to AAAI 2023, 2022.
  10. A novel Bayes’ theorem for upper probabilities. Available at \hrefhttps://arxiv.org/abs/2307.06831arXiv:2307.06831, 2023.
  11. Statistical Guarantees for the Robustness of Bayesian Neural Networks. pages 5693–5700, 2019. URL https://www.ijcai.org/proceedings/2019/789.
  12. Pointwise Feasibility of Gaussian Process-based Safety-Critical Control under Model Uncertainty. In 2021 60th IEEE Conference on Decision and Control (CDC), pages 6762–6769, December 2021. 10.1109/CDC45484.2021.9683743. ISSN: 2576-2370.
  13. Discovering Closed-Loop Failures of Vision-Based Controllers via Reachability Analysis. IEEE Robotics and Automation Letters, 8(5):2692–2699, May 2023. ISSN 2377-3766. 10.1109/LRA.2023.3258719. Conference Name: IEEE Robotics and Automation Letters.
  14. Risk verification of stochastic systems with neural network controllers. Artificial Intelligence, 313:103782, 2022. ISSN 0004-3702. https://doi.org/10.1016/j.artint.2022.103782. URL https://www.sciencedirect.com/science/article/pii/S0004370222001229.
  15. Frank P. A. Coolen. Imprecise highest density regions related to intervals of measures. Memorandum COSOR, 9254, 1992.
  16. A Survey of Algorithms for Black-Box Safety Validation of Cyber-Physical Systems. Journal of Artificial Intelligence Research, 72:377–428, January 2022. ISSN 1076-9757. 10.1613/jair.1.12716. URL https://dl.acm.org/doi/10.1613/jair.1.12716.
  17. Thierry Denoeux. An evidential neural network model for regression based on random fuzzy numbers. Available at \hrefhttps://arxiv.org/abs/2208.00647arXiv:2208.00647, 2022.
  18. From intelligent agents to trustworthy human-centred multiagent systems. AI Communications, 35(4):443–457, January 2022. ISSN 0921-7126. 10.3233/AIC-220127. URL https://doi.org/10.3233/AIC-220127.
  19. VERIFAI: A Toolkit for the Design and Analysis of Artificial Intelligence-Based Systems. arXiv:1902.04245 [cs], February 2019. URL http://arxiv.org/abs/1902.04245. arXiv: 1902.04245.
  20. VOS: Learning What You Don’t Know by Virtual Outlier Synthesis. 2022. URL https://openreview.net/forum?id=TW7d65uYu5M.
  21. Output Range Analysis for Deep Feedforward Neural Networks. In Aaron Dutle, César Muñoz, and Anthony Narkawicz, editors, NASA Formal Methods, Lecture Notes in Computer Science, pages 121–138, Cham, 2018a. Springer International Publishing. ISBN 978-3-319-77935-5. 10.1007/978-3-319-77935-5_9.
  22. Output range analysis for deep feedforward neural networks. In Aaron Dutle, César Muñoz, and Anthony Narkawicz, editors, NASA Formal Methods, pages 121–138, Cham, 2018b. Springer International Publishing. ISBN 978-3-319-77935-5.
  23. Sherlock - a tool for verification of neural network feedback systems: Demo abstract. HSCC ’19, page 262–263, New York, NY, USA, 2019a. Association for Computing Machinery. ISBN 9781450362825. 10.1145/3302504.3313351. URL https://doi.org/10.1145/3302504.3313351.
  24. Reachability analysis for neural feedback systems using regressive polynomial rule inference. In Proceedings of the 22nd ACM International Conference on Hybrid Systems: Computation and Control, HSCC ’19, pages 157–168, New York, NY, USA, April 2019b. Association for Computing Machinery. ISBN 978-1-4503-6282-5. 10.1145/3302504.3311807. URL https://doi.org/10.1145/3302504.3311807.
  25. Distributionally robust statistical verification with imprecise neural networks, 2023. URL https://arxiv.org/abs/2308.14815.
  26. An Abstraction-Based Framework for Neural Network Verification. October 2019. URL http://arxiv.org/abs/1910.14574. arXiv: 1910.14574.
  27. Task-Driven Out-of-Distribution Detection with Statistical Guarantees for Robot Learning. In Proceedings of the 5th Conference on Robot Learning, pages 970–980. PMLR, January 2022. URL https://proceedings.mlr.press/v164/farid22a.html. ISSN: 2640-3498.
  28. Probabilistic Verification and Reachability Analysis of Neural Networks via Semidefinite Programming. In 2019 IEEE 58th Conference on Decision and Control (CDC), pages 2726–2731, December 2019. 10.1109/CDC40024.2019.9029310. ISSN: 2576-2370.
  29. AI2: Safety and Robustness Certification of Neural Networks with Abstract Interpretation. In 2018 IEEE Symposium on Security and Privacy (SP), pages 3–18, May 2018. 10.1109/SP.2018.00058.
  30. Judicious judgment meets unsettling updating: dilation, sure loss, and Simpson’s paradox. Statistical Science, 36(2):169–190, 2021.
  31. On calibration of modern neural networks. In Proceedings of the 34th International Conference on Machine Learning - Volume 70, ICML’17, pages 1321–1330, Sydney, NSW, Australia, August 2017. JMLR.org.
  32. Distribution-free binary classification: prediction sets, confidence intervals and calibration. arXiv:2006.10564 [cs, math, stat], February 2022. URL http://arxiv.org/abs/2006.10564. arXiv: 2006.10564.
  33. Robust statistics. Wiley Series in Probability and Statistics. Hoboken, New Jersey : Wiley, 2nd edition, 2009.
  34. Verisig 2.0: Verification of Neural Network Controllers Using Taylor Model Preconditioning. In Computer Aided Verification, pages 249–262, Cham, 2021. Springer International Publishing. ISBN 978-3-030-81685-8.
  35. A new definition of entropy of belief functions in the Dempster–Shafer theory. International Journal of Approximate Reasoning, 92:49–65, 2018.
  36. Reluplex: An Efficient SMT Solver for Verifying Deep Neural Networks. arXiv:1702.01135 [cs], February 2017. URL http://arxiv.org/abs/1702.01135. arXiv: 1702.01135.
  37. Automatic Abstraction Refinement in Neural Network Verification using Sensitivity Analysis. In Proceedings of the 26th ACM International Conference on Hybrid Systems: Computation and Control, HSCC ’23, pages 1–13, New York, NY, USA, May 2023. Association for Computing Machinery. ISBN 9798400700330. 10.1145/3575870.3587129. URL https://dl.acm.org/doi/10.1145/3575870.3587129.
  38. Simple and scalable predictive uncertainty estimation using deep ensembles. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, pages 6405–6416, Red Hook, NY, USA, December 2017. Curran Associates Inc. ISBN 978-1-5108-6096-4.
  39. Statistical Model Checking Past, Present, and Future. In Tiziana Margaria and Bernhard Steffen, editors, Leveraging Applications of Formal Methods, Verification and Validation. Specialized Techniques and Applications, Lecture Notes in Computer Science, pages 135–142, Berlin, Heidelberg, 2014. Springer. ISBN 978-3-662-45231-8. 10.1007/978-3-662-45231-8_10.
  40. A simple and efficient sampling-based algorithm for general reachability analysis. In Roya Firoozi, Negar Mehr, Esen Yel, Rika Antonova, Jeannette Bohg, Mac Schwager, and Mykel Kochenderfer, editors, Proceedings of The 4th Annual Learning for Dynamics and Control Conference, volume 168 of Proceedings of Machine Learning Research, pages 1086–1099. PMLR, 23–24 Jun 2022. URL https://proceedings.mlr.press/v168/lew22a.html.
  41. When Gaussian Process Meets Big Data: A Review of Scalable GPs. IEEE Transactions on Neural Networks and Learning Systems, 31(11):4405–4423, November 2020. ISSN 2162-2388. 10.1109/TNNLS.2019.2957109. Conference Name: IEEE Transactions on Neural Networks and Learning Systems.
  42. Sample-efficient safety assurances using conformal prediction. In Steven M. LaValle, Jason M. O’Kane, Michael Otte, Dorsa Sadigh, and Pratap Tokekar, editors, Algorithmic Foundations of Robotics XV, pages 149–169, Cham, 2023. Springer International Publishing. ISBN 978-3-031-21090-7.
  43. Uncertainty Quantification with Statistical Guarantees in End-to-End Autonomous Driving Control. 2020 IEEE International Conference on Robotics and Automation (ICRA), 2020. 10.1109/ICRA40945.2020.9196844.
  44. Revisiting the Calibration of Modern Neural Networks. In Advances in Neural Information Processing Systems, volume 34, pages 15682–15694. Curran Associates, Inc., 2021.
  45. Sayan Mitra. Verifying Cyber-Physical Systems: A Path to Safe Autonomy. The MIT Press, Cambridge, Massachusetts, February 2021. ISBN 978-0-262-04480-6.
  46. Bayesian Safety Validation for Black-Box Systems. In Conference proceedings of the 2023 AIAA AVIATION Forum, May 2023. 10.48550/arXiv.2305.02449. URL http://arxiv.org/abs/2305.02449. arXiv:2305.02449 [cs, stat].
  47. Multi-agent reachability calibration with conformal prediction, 2023.
  48. Interval neural networks: Uncertainty scores. ArXiv, abs/2003.11566, 2020.
  49. Inductive confidence machines for regression. In Machine Learning: ECML 2002: 13th European Conference on Machine Learning Helsinki, Finland, August 19–23, 2002 Proceedings 13, pages 345–356. Springer, 2002.
  50. PAC Confidence Predictions for Deep Neural Network Classifiers. ICLR, 2021.
  51. HiddenGems: Efficient safety boundary detection with active learning. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5147–5154, October 2022. 10.1109/IROS47612.2022.9982243. ISSN: 2153-0866.
  52. Statistical Verification of Autonomous Systems using Surrogate Models and Conformal Inference. In Proc. of ICCPS’22, July 2021. URL http://arxiv.org/abs/2004.00279. arXiv: 2004.00279.
  53. Statistical verification using surrogate models and conformal inference and a comparison with risk-aware verification. ACM Trans. Cyber-Phys. Syst., dec 2023. ISSN 2378-962X. 10.1145/3635160. URL https://doi.org/10.1145/3635160. Just Accepted.
  54. Carl Edward Rasmussen and Christopher K. I. Williams. Gaussian Processes for Machine Learning. The MIT Press, Cambridge, Mass, November 2005. ISBN 978-0-262-18253-9.
  55. Toward verified artificial intelligence. Communications of the ACM, 65(7):46–55, June 2022. ISSN 0001-0782. 10.1145/3503914. URL https://dl.acm.org/doi/10.1145/3503914.
  56. A Tutorial on Conformal Prediction. J. Mach. Learn. Res., 9:371–421, June 2008. ISSN 1532-4435. URL http://dl.acm.org/citation.cfm?id=1390681.1390693.
  57. OVERT: an algorithm for safety verification of neural network control policies for nonlinear systems. The Journal of Machine Learning Research, 23(1):117:5090–117:5134, January 2022. ISSN 1532-4435.
  58. A System-Level View on Out-of-Distribution Data in Robotics, 2022.
  59. Ralph C. Smith. Uncertainty Quantification: Theory, Implementation, and Applications. SIAM, December 2013. ISBN 978-1-61197-321-1. Google-Books-ID: 4c1GAgAAQBAJ.
  60. When Cyber-Physical Systems Meet AI: A Benchmark, an Evaluation, and a Way Forward. In Proceedings of the 44th International Conference on Software Engineering: Software Engineering in Practice, pages 343–352, May 2022. 10.1145/3510457.3513049. URL http://arxiv.org/abs/2111.04324. arXiv:2111.04324 [cs].
  61. Sullivan. Introduction to Uncertainty Quantification. Springer, New York, NY, 1st ed. 2015 edition edition, December 2015. ISBN 978-3-319-23394-9.
  62. NNV: The Neural Network Verification Tool for Deep Neural Networks and Learning-Enabled Cyber-Physical Systems. In Computer Aided Verification, 2020.
  63. Lower Previsions. Chichester, United Kingdom : John Wiley and Sons, 2014.
  64. Algorithmic Learning in a Random World. Springer, New York, 2005 edition edition, March 2005. ISBN 978-0-387-00152-4.
  65. Peter Walley. Statistical Reasoning with Imprecise Probabilities, volume 42 of Monographs on Statistics and Applied Probability. London : Chapman and Hall, 1991.
  66. Probabilistic conformance for cyber-physical systems. In Proceedings of the ACM/IEEE 12th International Conference on Cyber-Physical Systems, ICCPS ’21, pages 55–66, New York, NY, USA, May 2021. Association for Computing Machinery. ISBN 978-1-4503-8353-0. 10.1145/3450267.3450534. URL https://doi.org/10.1145/3450267.3450534.
  67. Larry Wasserman. Recent methodological advances in robust Bayesian inference. Bayesian statistics, 4:483–502, 1992.
  68. Bayes’ theorem for Choquet capacities. The Annals of Statistics, 18(3):1328–1339, 1990.
  69. Probabilistic Safety for Bayesian Neural Networks. In Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), pages 1198–1207. PMLR, August 2020. URL https://proceedings.mlr.press/v124/wicker20a.html. ISSN: 2640-3498.
  70. Statistical verification of learning-based cyber-physical systems. In Proceedings of the 23rd International Conference on Hybrid Systems: Computation and Control, HSCC ’20, pages 1–7, New York, NY, USA, April 2020. Association for Computing Machinery. ISBN 978-1-4503-7018-9. 10.1145/3365365.3382209. URL https://doi.org/10.1145/3365365.3382209.
  71. FalsifAI: Falsification of AI-Enabled Hybrid Control Systems Guided by Time-Aware Coverage Criteria. IEEE Transactions on Software Engineering, 49(4):1842–1859, April 2023. ISSN 1939-3520. 10.1109/TSE.2022.3194640. Conference Name: IEEE Transactions on Software Engineering.
Citations (4)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com