Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
88 tokens/sec
GPT-4o
11 tokens/sec
Gemini 2.5 Pro Pro
52 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
10 tokens/sec
DeepSeek R1 via Azure Pro
33 tokens/sec
Gemini 2.5 Flash Deprecated
12 tokens/sec
2000 character limit reached

Not All Errors Are Made Equal: A Regret Metric for Detecting System-level Trajectory Prediction Failures (2403.04745v4)

Published 7 Mar 2024 in cs.RO

Abstract: Robot decision-making increasingly relies on data-driven human prediction models when operating around people. While these models are known to mispredict in out-of-distribution interactions, only a subset of prediction errors impact downstream robot performance. We propose characterizing such "system-level" prediction failures via the mathematical notion of regret: high-regret interactions are precisely those in which mispredictions degraded closed-loop robot performance. We further introduce a probabilistic generalization of regret that calibrates failure detection across disparate deployment contexts and renders regret compatible with reward-based and reward-free (e.g., generative) planners. In simulated autonomous driving interactions and social navigation interactions deployed on hardware, we showcase that our system-level failure metric can be used offline to automatically extract closed-loop human-robot interactions that state-of-the-art generative human predictors and robot planners previously struggled with. We further find that the very presence of high-regret data during human predictor fine-tuning is highly predictive of robot re-deployment performance improvements. Fine-tuning with the informative but significantly smaller high-regret data (23% of deployment data) is competitive with fine-tuning on the full deployment dataset, indicating a promising avenue for efficiently mitigating system-level human-robot interaction failures. Project website: https://cmu-intentlab.github.io/not-all-errors/

Definition Search Book Streamline Icon: https://streamlinehq.com
References (48)
  1. i-sim2real: Reinforcement learning of robotic policies in tight human-robot interaction loops. In Conference on Robot Learning. PMLR, 2023.
  2. Robust online model adaptation by extended kalman filter with exponential moving average and dynamic multi-epoch strategy. In Learning for Dynamics and Control, pages 65–74. PMLR, 2020.
  3. Task-aware risk estimation of perception failures for autonomous vehicles. arXiv preprint arXiv:2305.01870, 2023.
  4. Hamilton-jacobi reachability: A brief overview and recent advances. In 2017 IEEE 56th Annual Conference on Decision and Control (CDC). IEEE, 2017.
  5. nuscenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020.
  6. Discovering closed-loop failures of vision-based controllers via reachability analysis. IEEE Robotics and Automation Letters, 2023.
  7. Tree-structured policy planning with learned behavior models. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023.
  8. Interactive joint planning for autonomous vehicles. IEEE Robotics and Automation Letters, 2024.
  9. Human motion prediction using semi-adaptable neural networks. In 2019 American Control Conference (ACC), pages 4884–4890. IEEE, 2019.
  10. L. E. Dubins. On curves of minimal length with a constraint on average curvature, and with prescribed initial and terminal positions and tangents. American Journal of Mathematics, 79(3):497–516, 1957. ISSN 00029327, 10806377. URL http://www.jstor.org/stable/2372560.
  11. Semantic anomaly detection with large language models. Autonomous Robots, 47(8), 2023.
  12. Task-relevant failure detection for trajectory predictors in autonomous vehicles. In Conference on Robot Learning. PMLR, 2023.
  13. Confidence-aware motion prediction for real-time collision avoidance. The International Journal of Robotics Research, 2020.
  14. Rethinking integration of prediction and planning in deep learning-based automated driving systems: a review. arXiv preprint arXiv:2308.05731, 2023.
  15. A baseline for detecting misclassified and out-of-distribution examples in neural networks. arXiv preprint arXiv:1610.02136, 2016.
  16. Interpretable trajectory prediction for autonomous vehicles viacounterfactual responsibility. In IEEE/RSJ Int. Conf. on Intelligent Robots & Systems, 2023.
  17. Deception game: Closing the safety-learning loop in interactive robot autonomy. In 7th Annual Conference on Robot Learning, 2023. URL https://openreview.net/forum?id=0o2JgvlzMUc.
  18. Interpretable self-aware neural networks for robust trajectory prediction. In Conference on Robot Learning. PMLR, 2023.
  19. The trajectron: Probabilistic multi-agent trajectory modeling with dynamic spatiotemporal graphs. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019.
  20. Injecting planning-awareness into prediction and detection evaluation. In 2022 IEEE Intelligent Vehicles Symposium (IV). IEEE, 2022.
  21. Expanding the deployment envelope of behavior prediction via adaptive meta-learning. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023.
  22. Interact: Transformer models for human intent prediction conditioned on robot actions. arXiv preprint arXiv:2311.12943, 2023a.
  23. Manicast: Collaborative manipulation with cost-aware human forecasting. arXiv preprint arXiv:2310.13258, 2023b.
  24. A game-theoretic framework for joint forecasting and planning. International Conference on Intelligent Robots and Systems, 2023c.
  25. Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, 2014.
  26. Philip Koopman. The heavy tail safety ceiling. In Automated and Connected Vehicle Systems Testing Symposium. SAE, 2018.
  27. A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems, 31, 2018.
  28. Pre-training on synthetic driving data for trajectory prediction. arXiv preprint arXiv:2309.10121, 2023.
  29. Towards robust and adaptive motion forecasting: A causal representation perspective. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022.
  30. R Duncan Luce. Individual choice behavior: A theoretical analysis. Courier Corporation, 2005.
  31. Online distribution shift detection via recency prediction, 2023.
  32. Online update of safety assurances using confidence-based predictions. In 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023.
  33. Run-time monitoring of machine learning for robotic perception: A survey of emerging trends. IEEE Access, 9, 2021. ISSN 2169-3536.
  34. Motionlm: Multi-agent motion forecasting as language modeling. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023.
  35. A system-level view on out-of-distribution data in robotics. arXiv preprint arXiv:2212.14020, 2022.
  36. Safeshift: Safety-informed distribution shifts for robust trajectory prediction in autonomous driving, 2023.
  37. On complementing end-to-end human behavior predictors with planning. Robotics: Science and Systems, 2021.
  38. Interventional behavior prediction: Avoiding overly confident anticipation in interactive prediction. In 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022.
  39. Adaptive human trajectory prediction via latent corridors, 2023.
  40. Safety assurances for human-robot interaction via confidence-aware game-theoretic human models. In 2022 International Conference on Robotics and Automation (ICRA). IEEE, 2022.
  41. Identifying driver interactions via conditional behavior prediction. In 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2021.
  42. Neural discrete representation learning. Advances in neural information processing systems, 30, 2017.
  43. Joint out-of-distribution detection and uncertainty estimation for trajectory prediction. In 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2023.
  44. Bits: Bi-level imitation for traffic simulation. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2023.
  45. Visual anomaly detection for images: a systematic survey. Procedia Computer Science, 2022.
  46. Agentformer: Agent-aware transformers for socio-temporal multi-agent forecasting. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021.
  47. Out-of-domain detection for natural language understanding in dialog systems. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2020.
  48. Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th international conference on machine learning (icml-03), 2003.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com