Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution (2308.14666v2)
Abstract: In many real-world settings, image observations of freely rotating 3D rigid bodies may be available when low-dimensional measurements are not. However, the high-dimensionality of image data precludes the use of classical estimation techniques to learn the dynamics. The usefulness of standard deep learning methods is also limited, because an image of a rigid body reveals nothing about the distribution of mass inside the body, which, together with initial angular velocity, is what determines how the body will rotate. We present a physics-based neural network model to estimate and predict 3D rotational dynamics from image sequences. We achieve this using a multi-stage prediction pipeline that maps individual images to a latent representation homeomorphic to $\mathbf{SO}(3)$, computes angular velocities from latent pairs, and predicts future latent states using the Hamiltonian equations of motion. We demonstrate the efficacy of our approach on new rotating rigid-body datasets of sequences of synthetic images of rotating objects, including cubes, prisms and satellites, with unknown uniform and non-uniform mass distributions. Our model outperforms competing baselines on our datasets, producing better qualitative predictions and reducing the error observed for the state-of-the-art Hamiltonian Generative Network by a factor of 2.
- OSIRIS-REx Flight Dynamics and Navigation Design. Space Sci. Rev. 2018, 214, 69. https://doi.org/10.1007/s11214-018-0501-x.
- A review of space robotics technologies for on-orbit servicing. Prog. Aerosp. Sci. 2014, 68, 1–26. https://doi.org/10.1016/j.paerosci.2014.03.002.
- Review of active space debris removal methods. Space Policy 2019, 47, 194–206. https://doi.org/10.1016/j.spacepol.2018.12.005.
- Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control. In Proceedings of the Conference on Neural Information Processing Systems 2020, Virtual, 6–12 December 2020.
- Hamiltonian Generative Networks. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26–30 April 2020.
- LagNetViP: A Lagrangian Neural Network for Video Prediction. arXiv 2020, arXiv:2010.12932.
- SE3-nets: Learning Rigid Body Motion Using Deep Neural Networks. In Proceedings of the International Conference on Robotics and Automation 2017, Singapore, 29 May–3 June 2017.
- A Smooth Representation of Belief over SO(3) for Deep Rotation Learning with Uncertainty. arXiv 2020, arXiv:2006.01031.
- Hamiltonian-based Neural ODE Networks on the SE(3) Manifold For Dynamics Learning and Control. In Proceedings of the Robotics: Science and Systems, Virtual, 12–16 July 2021.
- Hamiltonian Neural Networks. In Proceedings of the Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 8–14 December 2019.
- Symplectic Recurrent Neural Networks. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26–30 April 2020.
- Lagrangian Neural Networks. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26–30 April 2020.
- Auto-Encoding Variational Bayes. arXiv 2014, arXiv:1312.6114.
- Explorations in Homeomorphic Variational Auto-Encoding. In Proceedings of the International Conference of Machine Learning Workshop on Theoretical Foundations and Application of Deep Generative Models, Stockholm, Sweden, 14–15 July 2018.
- An analysis of svd for deep rotation estimation. Adv. Neural Inf. Process. Syst. 2020, 33, 22554–22565.
- Brégier, R. Deep regression on manifolds: A 3D rotation case study. In Proceedings of the 2021 International Conference on 3D Vision (3DV), London, UK, 1–3 December 2021; pp. 166–174.
- On the Continuity of Rotation Representations in Neural Networks. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 5738–5746.
- Geometric Integration of Quaternions. AIAA J. Guid. Control 2013, 36, 1762–1772. https://doi.org/10.2514/1.58558.
- Classical Mechanics; Addison Wesley: Boston, MA, USA, 2002.
- Symplectic ODE-Net: Learning Hamiltonian Dynamics with Control. In Proceedings of the International Conference on Learning Representations 2020, Addis Ababa, Ethiopia, 26–30 April 2020.
- Dissipative SymODEN: Encoding Hamiltonian Dynamics with Dissipation and Control into Deep Learning. In Proceedings of the ICLR 2020 Workshop on Integration of Deep Neural Models and Differential Equations, Addis Ababa, Ethiopia, 26–30 April 2020.
- Simplifying Hamiltonian and Lagrangian Neural Networks via Explicit Constraints. Conf. Neural Inf. Process. Syst. 2020, 33, 13.
- Global Formulations of Lagrangian and Hamiltonian Dynamics on Manifolds; Springer: Cham, Switzerland, 2018.
- Introduction to Mechanics and Symmetry; Texts in Applied Mathematics; Springer: New York, NY, USA, 1999.
- Lin, Z. Riemannian Geometry of Symmetric Positive Definite Matrices via Cholesky Decomposition. SIAM J. Matrix Anal. Appl. 2019, 40, 1353–1370. https://doi.org/10.1137/18m1221084.
- Atkinson, K.A. An Introduction to Numerical Analysis; John Wiley & Sons: Hoboken, NJ, USA,1989.
- Markley, F.L. Unit quaternion from rotation matrix. AIAA J. Guid. Control 2008, 31, 440–442. https://doi.org/10.2514/1.31730.
- Satellite Pose Estimation Challenge: Dataset, Competition Design, and Results. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 4083–4098. https://doi.org/10.1109/taes.2020.2989063.
- Pose estimation for non-cooperative spacecraft rendezvous using convolutional neural networks. In Proceedings of the 2018 IEEE Aerospace Conference, Big Sky, MT, USA, 3–10 March 2018; pp. 1–12. https://doi.org/10.1109/AERO.2018.8396425.
- SPEED+: Next-Generation Dataset for Spacecraft Pose Estimation across Domain Gap. In Proceedings of the 2022 IEEE Aerospace Conference (AERO), Big Sky, MT, USA, 5–12 March 2022; pp. 1–15. https://doi.org/10.1109/AERO53065.2022.9843439.
- Deep Learning for Spacecraft Pose Estimation from Photorealistic Rendering. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 6007–6013. https://doi.org/10.1109/ICRA40945.2020.9197244.
- Community, B.O. Blender—A 3D Modelling and Rendering Package; Blender Foundation, Stichting Blender Foundation: Amsterdam, The Netherlands, 2018.
- Neural ordinary differential equations. Adv. Neural Inf. Process. Syst. 2018, 31, 1–13.
- Long short-term memory. Neural Comput. 1997, 9, 1735–1780.
- Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980.
- Fast and accurate deep network learning by exponential linear units (elus). arXiv 2015, arXiv:1511.07289.
- [Re] Hamiltonian Generative Networks. ReScience C 2021, 7, #18. https://doi.org/10.5281/zenodo.4835278.
- Embed to control: A locally linear latent dynamics model for control from raw images. Adv. Neural Inf. Process. Syst. 2015, 27, 1–9.