Multimodal Anomaly Detection based on Deep Auto-Encoder for Object Slip Perception of Mobile Manipulation Robots (2403.03563v1)
Abstract: Object slip perception is essential for mobile manipulation robots to perform manipulation tasks reliably in the dynamic real-world. Traditional approaches to robot arms' slip perception use tactile or vision sensors. However, mobile robots still have to deal with noise in their sensor signals caused by the robot's movement in a changing environment. To solve this problem, we present an anomaly detection method that utilizes multisensory data based on a deep autoencoder model. The proposed framework integrates heterogeneous data streams collected from various robot sensors, including RGB and depth cameras, a microphone, and a force-torque sensor. The integrated data is used to train a deep autoencoder to construct latent representations of the multisensory data that indicate the normal status. Anomalies can then be identified by error scores measured by the difference between the trained encoder's latent values and the latent values of reconstructed input data. In order to evaluate the proposed framework, we conducted an experiment that mimics an object slip by a mobile service robot operating in a real-world environment with diverse household objects and different moving patterns. The experimental results verified that the proposed framework reliably detects anomalies in object slip situations despite various object types and robot behaviors, and visual and auditory noise in the environment.
- R. D. Howe and M. R. Cutkosky, “Sensing skin acceleration for slip and texture perception.” in International Conference on Robotics and Automation (ICRA), 1989, pp. 145–150.
- J. W. James, N. Pestell, and N. F. Lepora, “Slip detection with a biomimetic tactile sensor,” IEEE Robotics and Automation Letters (RA-L), vol. 3, no. 4, pp. 3340–3346, 2018.
- B. S. Zapata-Impata, P. Gil, and F. Torres, “Tactile-driven grasp stability and slip prediction,” Robotics, vol. 8, no. 4, p. 85, 2019.
- J. Li, S. Dong, and E. Adelson, “Slip detection with combined tactile and visual information,” in International Conference on Robotics and Automation (ICRA), 2018, pp. 7772–7777.
- V. Chandola, A. Banerjee, and V. Kumar, “Anomaly detection: A survey,” ACM Computing Surveys (CSUR), vol. 41, no. 3, pp. 1–58, 2009.
- W. Lee and D. Xiang, “Information-theoretic measures for anomaly detection,” in 2001 IEEE Symposium on Security and Privacy (S&P), 2000, pp. 130–143.
- A. Patcha and J.-M. Park, “An overview of anomaly detection techniques: Existing solutions and latest technological trends,” Computer Networks, vol. 51, no. 12, pp. 3448–3470, 2007.
- E. Khalastchi, M. Kalech, G. A. Kaminka, and R. Lin, “Online data-driven anomaly detection in autonomous robots,” Knowledge and Information Systems, vol. 43, no. 3, pp. 657–688, 2015.
- J. An and S. Cho, “Variational autoencoder based anomaly detection using reconstruction probability,” Technical Report. SNU Data Mining Center, 2015.
- K. H. Kim, S. Shim, Y. Lim, J. Jeon, J. Choi, B. Kim, and A. S. Yoon, “RaPP: Novelty detection with reconstruction along projection pathway,” in International Conference on Learning Representations (ICLR), 2020.
- A. Vasilev, V. Golkov, M. Meissner, I. Lipp, E. Sgarlata, V. Tomassini, D. K. Jones, and D. Cremers, “q-Space novelty detection with variational autoencoders,” arXiv preprint arXiv:1806.02997, 2018.
- K. Noda, H. Arie, Y. Suga, and T. Ogata, “Multimodal integration learning of robot behavior using deep neural networks,” Robotics and Autonomous Systems, vol. 62, no. 6, pp. 721–736, 2014.
- M. A. Lee, Y. Zhu, K. Srinivasan, P. Shah, S. Savarese, L. Fei-Fei, A. Garg, and J. Bohg, “Making sense of vision and touch: Self-supervised learning of multimodal representations for contact-rich tasks,” in International Conference on Robotics and Automation (ICRA), 2019, pp. 8943–8950.
- D. Park, Z. Erickson, T. Bhattacharjee, and C. C. Kemp, “Multimodal execution monitoring for anomaly detection during robot manipulation,” in International Conference on Robotics and Automation (ICRA), 2016, pp. 407–414.
- D. Park, H. Kim, Y. Hoshi, Z. Erickson, A. Kapusta, and C. C. Kemp, “A multimodal execution monitor with anomaly classification for robot-assisted feeding,” in International Conference on Intelligent Robots and Systems (IROS), 2017, pp. 5406–5413.
- D. Park, Y. Hoshi, and C. C. Kemp, “A multimodal anomaly detector for robot-assisted feeding using an lstm-based variational autoencoder,” IEEE Robotics and Automation Letters (RA-L), vol. 3, no. 3, pp. 1544–1551, 2018.
- S. Davis and P. Mermelstein, “Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357–366, 1980.
- A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, A. Desmaison, A. Kopf, E. Yang, Z. DeVito, M. Raison, A. Tejani, S. Chilamkurthy, B. Steiner, L. Fang, J. Bai, and S. Chintala, “PyTorch: An imperative style, high-performance deep learning library,” in Advances in Neural Information Processing Systems 32, 2019, pp. 8024–8035.
- M. A. Kramer, “Nonlinear principal component analysis using autoassociative neural networks,” AIChE Journal, vol. 37, no. 2, pp. 233–243, 1991.
- B. Xu, N. Wang, T. Chen, and M. Li, “Empirical evaluation of rectified activations in convolutional network,” arXiv preprint arXiv:1505.00853, 2015.
- S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” Proceedings of Machine Learning Research, vol. 37, pp. 448–456, 2015.
- G. H. Golub and C. Reinsch, “Singular value decomposition and least squares solutions,” in Linear Algebra. Springer, 1971, pp. 134–151.
- T. Yamamoto, K. Terada, A. Ochiai, F. Saito, Y. Asahara, and K. Murase, “Development of the research platform of a domestic mobile manipulator utilized for international competition and field test,” in International Conference on Intelligent Robots and Systems (IROS), 2018, pp. 7675–7682.
- B. Calli, A. Singh, A. Walsman, S. Srinivasa, P. Abbeel, and A. M. Dollar, “The YCB object and model set: Towards common benchmarks for manipulation research,” in International Conference on Advanced Robotics (ICAR), 2015, pp. 510–517.