Stochastic Occupancy Grid Map Prediction in Dynamic Scenes (2210.08577v2)
Abstract: This paper presents two variations of a novel stochastic prediction algorithm that enables mobile robots to accurately and robustly predict the future state of complex dynamic scenes. The proposed algorithm uses a variational autoencoder to predict a range of possible future states of the environment. The algorithm takes full advantage of the motion of the robot itself, the motion of dynamic objects, and the geometry of static objects in the scene to improve prediction accuracy. Three simulated and real-world datasets collected by different robot models are used to demonstrate that the proposed algorithm is able to achieve more accurate and robust prediction performance than other prediction algorithms. Furthermore, a predictive uncertainty-aware planner is proposed to demonstrate the effectiveness of the proposed predictor in simulation and real-world navigation experiments. Implementations are open source at https://github.com/TempleRAIL/SOGMP.
- R. ED. Types and applications of autonomous mobile robots. https://www.conveyco.com/blog/types-and-applications-of-amrs, July 2022. (Accessed on 08/20/2022).
- J.-u. Kim. Keimyung hospital demonstrates smart autonomous mobile robot. https://www.koreabiomed.com/news/articleView.html?idxno=10585, Mar 2021. (Accessed on 08/20/2022).
- SICK. Revolutionizing grocery shopping with mobile robots. https://sickusablog.com/revolutionizing-grocery-shopping-mobile-robots, Mar 2021. (Accessed on 08/20/2022).
- Object detection and tracking for autonomous navigation in dynamic environments. The International Journal of Robotics Research, 29(14):1707–1725, 2010.
- A random finite set approach for dynamic occupancy grid maps with real-time application. The International Journal of Robotics Research, 37(8):841–866, 2018.
- P. Ondruska and I. Posner. Deep tracking: Seeing beyond seeing using recurrent neural networks. In Thirtieth AAAI conference on artificial intelligence, 2016.
- Dynamic environment prediction in urban scenes using recurrent representation learning. In 2019 IEEE Intelligent Transportation Systems Conference (ITSC), pages 2052–2059. IEEE, 2019.
- Double-prong convlstm for spatiotemporal occupancy prediction in dynamic environments. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 13931–13937. IEEE, 2021.
- Long-term occupancy grid prediction using recurrent neural networks. In 2019 International Conference on Robotics and Automation (ICRA), pages 9299–9305. IEEE, 2019.
- Motion estimation in occupancy grid maps in stationary settings using recurrent neural networks. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 8587–8593. IEEE, 2020.
- Attention augmented convlstm for environment prediction. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 1346–1353. IEEE, 2021.
- Convolutional LSTM network: A machine learning approach for precipitation nowcasting. In Advances in Neural Information Processing Systems, volume 28, 2015.
- Deep predictive coding networks for video prediction and unsupervised learning. In International Conference on Learning Representations, 2016.
- V. L. Guen and N. Thome. Disentangling physical dynamics from unknown factors for unsupervised video prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11474–11484, 2020.
- Dynamic occupancy grid mapping with recurrent neural networks. In 2021 IEEE International Conference on Robotics and Automation (ICRA), pages 6717–6724. IEEE, 2021.
- Deep tracking in the wild: End-to-end tracking using recurrent neural networks. The International Journal of Robotics Research, 37(4-5):492–512, 2018.
- 2d lidar map prediction via estimating motion flow with gru. In 2019 International Conference on Robotics and Automation (ICRA), pages 6617–6623. IEEE, 2019.
- Learning spatiotemporal occupancy grid maps for lifelong navigation in dynamic scenes. In 2022 International Conference on Robotics and Automation (ICRA), pages 484–490. IEEE, 2022.
- N. Mohajerin and M. Rohani. Multi-step prediction of occupancy grid maps with recurrent neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 10600–10608, 2019.
- A consistent metric for performance evaluation of multi-object filters. IEEE Transactions on Signal Processing, 56(8):3447–3457, 2008. doi:10.1109/TSP.2008.920469.
- Uncertainty-aware occupancy map prediction using generative networks for robot navigation. In 2019 International Conference on Robotics and Automation (ICRA), pages 5453–5459. IEEE, 2019.
- S. Thrun. Learning occupancy grid maps with forward sensor models. Autonomous robots, 15(2):111–127, 2003.
- D. P. Kingma and M. Welling. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- Socially compliant navigation dataset (scand): A large-scale dataset of demonstrations for social navigation. arXiv preprint arXiv:2203.15041, 2022.
- Towards safe navigation through crowded dynamic environments. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 4934–4940. IEEE, 2021.
- Z. Xie and P. Dames. DRL-VO: Learning to navigate through crowded dynamic scenes using velocity obstacles. IEEE Transactions on Robotics, 39(4):2700–2719, 2023. doi:10.1109/TRO.2023.3257549.
- PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems, pages 8026–8037, 2019.
- Ros: an open-source robot operating system. In ICRA workshop on open source software, volume 3, page 5. Kobe, Japan, 2009.
- Learning local planners for human-aware navigation in indoor environments. In 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 6053–6060. IEEE, 2020.
- The dynamic window approach to collision avoidance. IEEE Robotics & Automation Magazine, 4(1):23–33, 1997.
- N. L. Baisa. Derivation of a constant velocity motion model for visual tracking. arXiv preprint arXiv:2005.00844, 2020.
- What the constant velocity model can teach us about pedestrian motion prediction. IEEE Robotics and Automation Letters, 5(2):1696–1703, 2020.
- Z. Xie and P. Dames. Stochastic Occupancy Grid Map Prediction in Dynamic Scenes: Dataset. https://doi.org/10.5281/zenodo.7051560.
- D. Helbing and P. Molnar. Social force model for pedestrian dynamics. Physical Review E, 51(5):4282, 1995.
- Experimental study of the behavioural mechanisms underlying self-organization in human crowds. Proceedings of the Royal Society B: Biological Sciences, 276(1668):2755–2762, 2009.
- Weighted mean square error for estimation of visual quality of image denoising methods. In CD ROM Proceedings of VPQM, volume 5. Scottsdale USA, 2010.
- D. Eigen and R. Fergus. Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. In Proceedings of the IEEE international conference on computer vision, pages 2650–2658, 2015.
- Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4):600–612, 2004.
- A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, KDD’96, page 226–231. AAAI Press, 1996.
- C. E. Shannon. A mathematical theory of communication. ACM SIGMOBILE mobile computing and communications review, 5(1):3–55, 2001.
- Zhanteng Xie (6 papers)
- Philip Dames (11 papers)