Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sim-to-Real Causal Transfer: A Metric Learning Approach to Causally-Aware Interaction Representations (2312.04540v1)

Published 7 Dec 2023 in cs.LG, cs.AI, cs.CV, cs.MA, and cs.RO

Abstract: Modeling spatial-temporal interactions among neighboring agents is at the heart of multi-agent problems such as motion forecasting and crowd navigation. Despite notable progress, it remains unclear to which extent modern representations can capture the causal relationships behind agent interactions. In this work, we take an in-depth look at the causal awareness of these representations, from computational formalism to real-world practice. First, we cast doubt on the notion of non-causal robustness studied in the recent CausalAgents benchmark. We show that recent representations are already partially resilient to perturbations of non-causal agents, and yet modeling indirect causal effects involving mediator agents remains challenging. To address this challenge, we introduce a metric learning approach that regularizes latent representations with causal annotations. Our controlled experiments show that this approach not only leads to higher degrees of causal awareness but also yields stronger out-of-distribution robustness. To further operationalize it in practice, we propose a sim-to-real causal transfer method via cross-domain multi-task learning. Experiments on pedestrian datasets show that our method can substantially boost generalization, even in the absence of real-world causal annotations. We hope our work provides a new perspective on the challenges and potential pathways towards causally-aware representations of multi-agent interactions. Our code is available at https://github.com/socialcausality.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (70)
  1. Socially-Aware Large-Scale Crowd Forecasting. In 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp.  2211–2218, June 2014.
  2. Social LSTM: Human Trajectory Prediction in Crowded Spaces. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.  961–971, June 2016.
  3. Invariant Risk Minimization. arXiv:1907.02893 [cs, stat], March 2020.
  4. Generative Causal Representation Learning for Out-of-Distribution Motion Forecasting, February 2023.
  5. AdvDO: Realistic Adversarial Attacks for Trajectory Prediction. In Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner (eds.), Computer Vision – ECCV 2022, Lecture Notes in Computer Science, pp.  36–52, Cham, 2022. Springer Nature Switzerland. ISBN 978-3-031-20065-6.
  6. Robust Trajectory Prediction against Adversarial Attacks. In Proceedings of The 6th Conference on Robot Learning, pp.  128–137. PMLR, March 2023.
  7. Causal Discovery of Dynamic Models for Predicting Human Spatial Interactions. In Filippo Cavallo, John-John Cabibihan, Laura Fiorini, Alessandra Sorrentino, Hongsheng He, Xiaorui Liu, Yoshio Matsumoto, and Shuzhi Sam Ge (eds.), Social Robotics, Lecture Notes in Computer Science, pp.  154–164, Cham, 2022. Springer Nature Switzerland. ISBN 978-3-031-24667-8.
  8. MultiPath: Multiple Probabilistic Anchor Trajectory Hypotheses for Behavior Prediction. In Conference on Robot Learning, pp.  86–99. PMLR, May 2020.
  9. Crowd-Robot Interaction: Crowd-Aware Robot Navigation With Attention-Based Deep Reinforcement Learning. In International Conference on Robotics and Automation (ICRA), pp.  6015–6022, May 2019.
  10. Human Trajectory Prediction via Counterfactual Analysis. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp.  9824–9833, 2021.
  11. Unsupervised Sampling Promoting for Stochastic Human Trajectory Prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  17874–17884, 2023.
  12. Revisiting Parameter-Efficient Tuning: Are We Really There Yet? In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp.  2612–2626, December 2022.
  13. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets. In D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett (eds.), Advances in Neural Information Processing Systems 29, pp. 2172–2180. Curran Associates, Inc., 2016.
  14. DROGON: A Trajectory Prediction Model based on Intention-Conditioned Behavior Reasoning. In Proceedings of the 2020 Conference on Robot Learning, pp.  49–63. PMLR, October 2021.
  15. Convolutional Social Pooling for Vehicle Trajectory Prediction. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.  1549–15498, June 2018.
  16. On the Transfer of Disentangled Representations in Realistic Settings. In International Conference on Learning Representations, September 2020.
  17. Generalization and Robustness Implications in Object-Centric Learning. In Proceedings of the 39th International Conference on Machine Learning, pp.  5221–5285. PMLR, June 2022.
  18. Large Scale Interactive Motion Forecasting for Autonomous Driving: The Waymo Open Motion Dataset. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp.  9710–9719, 2021.
  19. Robot companion: A social-force based approach with human awareness-navigation in crowded environments. In 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp.  1688–1694, November 2013.
  20. Latent Variable Sequential Set Transformers for Joint Multi-Agent Motion Prediction. In International Conference on Learning Representations, October 2021.
  21. DenseTNT: Waymo Open Dataset Motion Prediction Challenge 1st Place Solution. arXiv:2106.14160 [cs], June 2021.
  22. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  2255–2264, June 2018.
  23. Social Force Model for Pedestrian Dynamics. Physics Review E, May 1998.
  24. AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty. In International Conference on Learning Representations, March 2020.
  25. Beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework. In International Conference on Learning Representations, 2017.
  26. Causal-based Time Series Domain Generalization for Vehicle Intention Prediction. In 2022 International Conference on Robotics and Automation (ICRA), pp.  7806–7813, May 2022.
  27. AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning. In International Conference on Learning Representations, January 2022.
  28. STGAT: Modeling Spatial-Temporal Interactions for Human Trajectory Prediction. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp.  6272–6281, 2019.
  29. Social-BiGAT: Multimodal Trajectory Forecasting using Bicycle-GAN and Graph Attention Networks. Advances in Neural Information Processing Systems, 32:137–146, 2019.
  30. Human Trajectory Forecasting in Crowds: A Deep Learning Perspective. IEEE Transactions on Intelligent Transportation Systems, pp. 1–15, 2021.
  31. Motion Style Transfer: Modular Low-Rank Adaptation for Deep Motion Forecasting. In Conference on Robot Learning (CoRL), November 2022.
  32. Out-of-Distribution Generalization via Risk Extrapolation (REx). In Proceedings of the 38th International Conference on Machine Learning, pp.  5815–5826. PMLR, July 2021.
  33. Surgical Fine-Tuning Improves Adaptation to Distribution Shifts. In International Conference on Learning Representations, 2023.
  34. Crowds by Example. Computer Graphics Forum, 26:655–664, 2007.
  35. EvolveGraph: Multi-Agent Trajectory Prediction with Dynamic Relational Reasoning. In Advances in Neural Information Processing Systems, volume 33, pp.  19783–19794. Curran Associates, Inc., 2020.
  36. SimAug: Learning Robust Representations from Simulation for Trajectory Prediction. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (eds.), Computer Vision – ECCV 2020, Lecture Notes in Computer Science, pp.  275–292, Cham, 2020. Springer International Publishing. ISBN 978-3-030-58601-0.
  37. Social NCE: Contrastive Learning of Socially-Aware Motion Representations. In IEEE/CVF International Conference on Computer Vision (ICCV), pp.  15118–15129, 2021.
  38. Towards Robust and Adaptive Motion Forecasting: A Causal Representation Perspective. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  17081–17092, 2022.
  39. Causal Triplet: An Open Challenge for Intervention-centric Causal Representation Learning. In Conference on Causal Learning and Reasoning (CLeaR), April 2023.
  40. Challenging Common Assumptions in the Unsupervised Learning of Disentangled Representations. In Proceedings of the 36th International Conference on Machine Learning, pp.  4114–4124. PMLR, May 2019.
  41. Object-Centric Learning with Slot Attention. In Advances in Neural Information Processing Systems, volume 33, pp.  11525–11538. Curran Associates, Inc., 2020.
  42. People tracking with human motion predictions from social forces. In 2010 IEEE International Conference on Robotics and Automation, pp.  464–469, May 2010.
  43. You Mostly Walk Alone: Analyzing Feature Attribution in Trajectory Prediction. In International Conference on Learning Representations, September 2021.
  44. From Goals, Waypoints & Paths to Long Term Human Trajectory Forecasting. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pp.  15233–15242, 2021.
  45. CausalCity: Complex Simulations with Agency for Causal Discovery and Reasoning. In Proceedings of the First Conference on Causal Learning and Reasoning, pp.  559–575. PMLR, June 2022.
  46. Abnormal crowd behavior detection using social force model. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.  935–942, June 2009.
  47. Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  14412–14420, June 2020.
  48. The role of Disentanglement in Generalisation. In International Conference on Learning Representations, February 2022.
  49. Fast User Adaptation for Human Motion Prediction in Physical Human–Robot Interaction. IEEE Robotics and Automation Letters, 7:120–127, January 2022.
  50. Improving Data Association by Joint Modeling of Pedestrian Trajectories and Groupings. In Computer Vision – ECCV 2010, Lecture Notes in Computer Science, pp.  452–465, Berlin, Heidelberg, 2010. Springer. ISBN 978-3-642-15549-9.
  51. Elements of Causal Inference: Foundations and Learning Algorithms. Adaptive Computation and Machine Learning Series. MIT Press, Cambridge, MA, USA, November 2017. ISBN 978-0-262-03731-0.
  52. Do ImageNet Classifiers Generalize to ImageNet? In Proceedings of the 36th International Conference on Machine Learning, pp.  5389–5400. PMLR, May 2019.
  53. PRECOG: PREdiction Conditioned on Goals in Visual Multi-Agent Settings. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp.  2821–2830, October 2019.
  54. CausalAgents: A Robustness Benchmark for Motion Forecasting using Causal Relationships, October 2022.
  55. Human motion trajectory prediction: A survey. The International Journal of Robotics Research, 39:895–935, July 2020.
  56. Are socially-aware trajectory prediction models really socially-aware? Transportation Research Part C: Emerging Technologies, 141:103705, August 2022.
  57. SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  1349–1358, 2019.
  58. Trajectron++: Dynamically-Feasible Trajectory Forecasting with Heterogeneous Data. In Andrea Vedaldi, Horst Bischof, Thomas Brox, and Jan-Michael Frahm (eds.), Computer Vision – ECCV 2020, Lecture Notes in Computer Science, pp.  683–700, Cham, 2020. Springer International Publishing. ISBN 978-3-030-58523-5.
  59. Bernhard Schölkopf. Causality for Machine Learning. arXiv:1911.10500 [cs, stat], December 2019.
  60. Bernhard Schölkopf and Julius von Kügelgen. From Statistical to Causal Learning. arXiv:2204.00607 [cs, stat], April 2022.
  61. Toward Causal Representation Learning. Proceedings of the IEEE, 109:612–634, May 2021.
  62. Reciprocal Velocity Obstacles for real-time multi-agent navigation. In 2008 IEEE International Conference on Robotics and Automation, pp.  1928–1935, May 2008.
  63. Reciprocal n-body collision avoidance. In Cédric Pradalier, Roland Siegwart, and Gerhard Hirzinger (eds.), Robotics Research, pp.  3–19, Berlin, Heidelberg, 2011. Springer Berlin Heidelberg. ISBN 978-3-642-19457-3.
  64. Are Disentangled Representations Helpful for Abstract Visual Reasoning? In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
  65. Social Attention: Modeling Attention in Human Crowds. In 2018 IEEE International Conference on Robotics and Automation (ICRA), pp.  1–7, Brisbane, Australia, May 2018. IEEE Press.
  66. PreTraM: Self-supervised Pre-training via Connecting Trajectory and Map. In Shai Avidan, Gabriel Brostow, Moustapha Cissé, Giovanni Maria Farinella, and Tal Hassner (eds.), Computer Vision – ECCV 2022, Lecture Notes in Computer Science, pp.  34–50, Cham, 2022. Springer Nature Switzerland. ISBN 978-3-031-19842-7.
  67. EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp.  1410–1420, 2023.
  68. Social force model with explicit collision prediction. EPL (Europhysics Letters), 93:68005, March 2011.
  69. INTERACTION Dataset: An INTERnational, Adversarial and Cooperative moTION Dataset in Interactive Driving Scenarios with Semantic Maps, September 2019.
  70. Train Offline, Test Online: A Real Robot Learning Benchmark, June 2023.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets