Not All Errors Are Made Equal: A Regret Metric for Detecting System-level Trajectory Prediction Failures (2403.04745v4)
Abstract: Robot decision-making increasingly relies on data-driven human prediction models when operating around people. While these models are known to mispredict in out-of-distribution interactions, only a subset of prediction errors impact downstream robot performance. We propose characterizing such "system-level" prediction failures via the mathematical notion of regret: high-regret interactions are precisely those in which mispredictions degraded closed-loop robot performance. We further introduce a probabilistic generalization of regret that calibrates failure detection across disparate deployment contexts and renders regret compatible with reward-based and reward-free (e.g., generative) planners. In simulated autonomous driving interactions and social navigation interactions deployed on hardware, we showcase that our system-level failure metric can be used offline to automatically extract closed-loop human-robot interactions that state-of-the-art generative human predictors and robot planners previously struggled with. We further find that the very presence of high-regret data during human predictor fine-tuning is highly predictive of robot re-deployment performance improvements. Fine-tuning with the informative but significantly smaller high-regret data (23% of deployment data) is competitive with fine-tuning on the full deployment dataset, indicating a promising avenue for efficiently mitigating system-level human-robot interaction failures. Project website: https://cmu-intentlab.github.io/not-all-errors/
- i-sim2real: Reinforcement learning of robotic policies in tight human-robot interaction loops. In Conference on Robot Learning. PMLR, 2023.
- Robust online model adaptation by extended kalman filter with exponential moving average and dynamic multi-epoch strategy. In Learning for Dynamics and Control, pages 65–74. PMLR, 2020.
- Task-aware risk estimation of perception failures for autonomous vehicles. arXiv preprint arXiv:2305.01870, 2023.
- Hamilton-jacobi reachability: A brief overview and recent advances. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC). IEEE, 2017.
- nuscenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020.
- Discovering closed-loop failures of vision-based controllers via reachability analysis. IEEE Robotics and Automation Letters, 2023.
- Tree-structured policy planning with learned behavior models. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023.
- Interactive joint planning for autonomous vehicles. IEEE Robotics and Automation Letters, 2024.
- Human motion prediction using semi-adaptable neural networks. In 2019 American Control Conference (ACC), pages 4884–4890. IEEE, 2019.
- L. E. Dubins. On curves of minimal length with a constraint on average curvature, and with prescribed initial and terminal positions and tangents. American Journal of Mathematics, 79(3):497–516, 1957. ISSN 00029327, 10806377. URL http://www.jstor.org/stable/2372560.
- Semantic anomaly detection with large language models. Autonomous Robots, 47(8), 2023.
- Task-relevant failure detection for trajectory predictors in autonomous vehicles. In Conference on Robot Learning. PMLR, 2023.
- Confidence-aware motion prediction for real-time collision avoidance. The International Journal of Robotics Research, 2020.
- Rethinking integration of prediction and planning in deep learning-based automated driving systems: a review. arXiv preprint arXiv:2308.05731, 2023.
- A baseline for detecting misclassified and out-of-distribution examples in neural networks. arXiv preprint arXiv:1610.02136, 2016.
- Interpretable trajectory prediction for autonomous vehicles viacounterfactual responsibility. In IEEE/RSJ Int. Conf. on Intelligent Robots & Systems, 2023.
- Deception game: Closing the safety-learning loop in interactive robot autonomy. In 7th Annual Conference on Robot Learning, 2023. URL https://openreview.net/forum?id=0o2JgvlzMUc.
- Interpretable self-aware neural networks for robust trajectory prediction. In Conference on Robot Learning. PMLR, 2023.
- The trajectron: Probabilistic multi-agent trajectory modeling with dynamic spatiotemporal graphs. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.
- Injecting planning-awareness into prediction and detection evaluation. In 2022 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2022.
- Expanding the deployment envelope of behavior prediction via adaptive meta-learning. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023.
- Interact: Transformer models for human intent prediction conditioned on robot actions. arXiv preprint arXiv:2311.12943, 2023a.
- Manicast: Collaborative manipulation with cost-aware human forecasting. arXiv preprint arXiv:2310.13258, 2023b.
- A game-theoretic framework for joint forecasting and planning. International Conference on Intelligent Robots and Systems, 2023c.
- Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, 2014.
- Philip Koopman. The heavy tail safety ceiling. In Automated and Connected Vehicle Systems Testing Symposium. SAE, 2018.
- A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems, 31, 2018.
- Pre-training on synthetic driving data for trajectory prediction. arXiv preprint arXiv:2309.10121, 2023.
- Towards robust and adaptive motion forecasting: A causal representation perspective. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
- R Duncan Luce. Individual choice behavior: A theoretical analysis. Courier Corporation, 2005.
- Online distribution shift detection via recency prediction, 2023.
- Online update of safety assurances using confidence-based predictions. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023.
- Run-time monitoring of machine learning for robotic perception: A survey of emerging trends. IEEE Access, 9, 2021. ISSN 2169-3536.
- Motionlm: Multi-agent motion forecasting as language modeling. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023.
- A system-level view on out-of-distribution data in robotics. arXiv preprint arXiv:2212.14020, 2022.
- Safeshift: Safety-informed distribution shifts for robust trajectory prediction in autonomous driving, 2023.
- On complementing end-to-end human behavior predictors with planning. Robotics: Science and Systems, 2021.
- Interventional behavior prediction: Avoiding overly confident anticipation in interactive prediction. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022.
- Adaptive human trajectory prediction via latent corridors, 2023.
- Safety assurances for human-robot interaction via confidence-aware game-theoretic human models. In 2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022.
- Identifying driver interactions via conditional behavior prediction. In 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021.
- Neural discrete representation learning. Advances in neural information processing systems, 30, 2017.
- Joint out-of-distribution detection and uncertainty estimation for trajectory prediction. In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2023.
- Bits: Bi-level imitation for traffic simulation. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023.
- Visual anomaly detection for images: a systematic survey. Procedia Computer Science, 2022.
- Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
- Out-of-domain detection for natural language understanding in dialog systems. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020.
- Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th international conference on machine learning (icml-03), 2003.