Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
162 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Knowledge-aware Graph Transformer for Pedestrian Trajectory Prediction (2401.04872v1)

Published 10 Jan 2024 in cs.CV, cs.LG, and cs.RO

Abstract: Predicting pedestrian motion trajectories is crucial for path planning and motion control of autonomous vehicles. Accurately forecasting crowd trajectories is challenging due to the uncertain nature of human motions in different environments. For training, recent deep learning-based prediction approaches mainly utilize information like trajectory history and interactions between pedestrians, among others. This can limit the prediction performance across various scenarios since the discrepancies between training datasets have not been properly incorporated. To overcome this limitation, this paper proposes a graph transformer structure to improve prediction performance, capturing the differences between the various sites and scenarios contained in the datasets. In particular, a self-attention mechanism and a domain adaption module have been designed to improve the generalization ability of the model. Moreover, an additional metric considering cross-dataset sequences is introduced for training and performance evaluation purposes. The proposed framework is validated and compared against existing methods using popular public datasets, i.e., ETH and UCY. Experimental results demonstrate the improved performance of our proposed scheme.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. K. Li, M. Shan, K. Narula, S. Worrall, and E. Nebot, “Socially aware crowd navigation with multimodal pedestrian trajectory prediction for autonomous vehicles,” in IEEE International Conference on Intelligent Transportation Systems (ITSC), pp. 1–8, 2020.
  2. Y. Cai, L. Dai, H. Wang, L. Chen, Y. Li, M. A. Sotelo, and Z. Li, “Pedestrian motion trajectory prediction in intelligent driving from far shot first-person perspective video,” IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 6, pp. 5298-5313, 2021.
  3. C. Wang, Y. Wang, M. Xu, and D. J. Crandall, “Stepwise goal-driven networks for trajectory prediction,” IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 2716–2723, Apr. 2022.
  4. S. Eiffert, H. Kong, N. Pirmarzdashti, and S. Sukkarieh, “Path planning in dynamic environments using generative RNNs and Monte Carlo tree search,” in IEEE International Conference on Robotics and Automation (ICRA), 2020.
  5. M. Moussaid, N. Perozo, S. Garnier, D. Helbing, and G. Theraulaz, “The walking behaviour of pedestrian social groups and its impact on crowd dynamics,” PLoS One, vol. 5, no. 4, p. e10047, 2010.
  6. S. Lefevre, C. Laugier, and J. Ibanez-Guzman, “Exploiting map information for driver intention estimation at road intersections,” in Proc. IEEE Intell. Veh. Symp. (IV) , pp. 583–588, 2011.
  7. R. Chai, A. Tsourdos, A. Savvaris, S. Chai, Y. Xia, C.L.P. Chen, “Design and implementation of deep neural network-based control for automatic parking maneuver process,” IEEE Trans. Neural Networks Learn. Syst., vol. 33, no. 4, pp. 1400–1413, Apr. 2022
  8. D. Helbing and P. Molnar, “Social force model for pedestrian dynamics.” Physical Review E, vol. 51, no. 5, pp. 4282–4286, May 1995.
  9. A. Alahi, K. Goel, V. Ramanathan, A. Robicquet, L. Fei-Fei, and S. Savarese, “Social LSTM: Human trajectory prediction in crowded spaces,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 961–971, 2016.
  10. A. Gupta, J. Johnson, L. Fei-Fei, S. Savarese, and A. Alahi, “Social GAN: Socially acceptable trajectories with generative adversarial networks,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2255–2264, 2018.
  11. S. Eiffert, K. Li, M. Shan, S. Worrall, S. Sukkarieh, and E. Nebot. “Probabilistic crowd GAN: Multimodal pedestrian trajectory prediction using a graph vehicle-pedestrian attention networ,” IEEE Robotics and Automation Letters, vol. 5, no. 4, pp. 5026–5033, Oct. 2020.
  12. A. Sadeghian, V.Kosaraju, A. Sadeghian,N.Hirose, S. H. Rezatofighi, and S. Savarese, “SoPhie: An attentive GAN for predicting paths compliant to social and physical constraints,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1349–1358, 2018.
  13. V. Kosaraju, A. Sadeghian, R. Martín-Martín, I. Reid, H. Rezatofighi, S. Savarese, “Social-bigat: Multimodal trajectory forecasting using bicycle-GAN and graph attention networks,” in Advances in Neural Information Processing Systems (NIPS), pp. 137–146, 2019.
  14. A. Vemula, K. Muelling, and J. Oh, “Social attention: Modeling attention in human crowds,” in IEEE International Conference on Robotics and Automation (ICRA), pp. 4601–4607, May 2018.
  15. X. Li, X. Ying, and M. C. Chuah, “GRIP: Graph-based interaction-aware trajectory prediction,” in IEEE International Conference on Intelligent Transportation Systems (ITSC), pp. 3960–3966, Oct. 2019.
  16. A. Mohamed, K. Qian, M. Elhoseiny, and C. Claudel, “Social-STGCNN: A social spatio-temporal graph convolutional neural network for human trajectory prediction,” in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 14424–14432, 2020.
  17. B. Xie, Y. Deng, Z. Shao, H. Liu, and Y. Li, “VMV-GCN: Volumetric multi-view based graph CNN for event stream classification,” in IEEE Robotics and Automation Letters, vol. 7, no. 2, pp. 1976–1983, Apr. 2022.
  18. J. Bruna, W. Zaremba, A. Szlam, and Y. LeCun, “Spectral networks and locally connected networks on graphs,” in Int. Conf. Learn. Representations, pp. 1–14, 2014.
  19. M. Defferrard, X. Bresson, and P. Van der Gheynst, “Convolutional neural networks on graphs with fast localized spectral filtering,” in Advances in Neural Information Processing Systems (NIPS), pp. 3844–3852, 2016.
  20. T. N. Kipf and M. Welling, “Semi-supervised classification with graph convolutional networks,” in Int. Conf. Learn. Representations, pp. 1–14, 2017.
  21. W. Hamilton, Z. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” in Advances in neural information processing systems (NIPS), pp. 1024–1034, 2017.
  22. P. Velickovic, G. Cucurull, A. Casanova, A. Romero, P. Lio, and Y. Bengio, “Graph attention networks,” Int. Conf. Learn. Representations, 2017, pp. 1–12.
  23. J. Liang, L. Jiang, J. C. Niebles, A. Hauptmann, and L. Fei-Fei, “Peeking into the future: Predicting future person activities and locations in videos,” in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. Workshops (CVPRW), pp. 5718–5727, Jun. 2019.
  24. K. Li, S. Eiffert, M. Shan, F. Gomez-Donoso, S. Worrall, and E. Nebot, “Attentional-GCNN: Adaptive pedestrian trajectory prediction towards generic autonomous vehicle use case,” in IEEE International Conference on Robotics and Automation (ICRA), pp. 14241–14247, May 2021.
  25. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances in Neural Information Processing Systems (NIPS), pp. 6000–6010, 2017.
  26. S. Pellegrini, A. Ess, K. Schindler, and L. van Gool, “You’ll never walk alone: Modeling social behavior for multi-target tracking,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009.
  27. A. Lerner, Y. Chrysanthou, and D. Lischinski, “Crowds by example,” Computer Graphics Forum, vol. 26, no. 3, pp. 655–664, 2007.
  28. S. Pellegrini, A. Ess, and L. Van Gool, “Improving data association by joint modeling of pedestrian trajectories and groupings,” in Eur. Conf. Comput. Vis.(ECCV), pp. 452–465, 2010.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets