2000 character limit reached
Learning with 3D rotations, a hitchhiker's guide to SO(3) (2404.11735v2)
Published 17 Apr 2024 in cs.LG, cs.CV, and cs.RO
Abstract: Many settings in machine learning require the selection of a rotation representation. However, choosing a suitable representation from the many available options is challenging. This paper acts as a survey and guide through rotation representations. We walk through their properties that harm or benefit deep learning with gradient-based optimization. By consolidating insights from rotation-based learning, we provide a comprehensive overview of learning functions with rotation representations. We provide guidance on selecting representations based on whether rotations are in the model's input or output and whether the data primarily comprises small angles.
- Loss it right: Euclidean and riemannian metrics in learning-based visual odometry. In ISR Europe 2023; 56th International Symposium on Robotics, pp. 107–111. VDE, 2023.
- URL https://marctenbosch.com/quaternions/code.htm.
- Bosch, M. T. Let’s remove quaternions from every 3d engine, 2020b. URL https://marctenbosch.com/quaternions/.
- Brégier, R. Deep regression on manifolds: a 3d rotation case study. In 2021 International Conference on 3D Vision (3DV), pp. 166–174. IEEE, 2021.
- Modern koopman theory for dynamical systems. arXiv preprint arXiv:2102.12086, 2021.
- Sharp minima can generalize for deep nets. In International Conference on Machine Learning, pp. 1019–1028. PMLR, 2017.
- Position Information in Transformers: An Overview. Computational Linguistics, 48(3):733–763, 09 2022. ISSN 0891-2017. doi: 10.1162/coli_a_00445. URL https://doi.org/10.1162/coli_a_00445.
- Euler, L. Du mouvement de rotation des corps solides autour d’un axe variable. Mémoires de l’académie des sciences de Berlin, pp. 154–193, 1765.
- A loss curvature perspective on training instability in deep learning. arXiv preprint arXiv:2110.04369, 2021.
- Grassia, F. S. Practical parameterization of rotations using the exponential map. Journal of graphics tools, 3(3):29–48, 1998.
- Rotation averaging. International journal of computer vision, 103:267–305, 2013.
- Huynh, D. Q. Metrics for 3d rotations: Comparison and analysis. Journal of Mathematical Imaging and Vision, 35:155–164, 2009.
- An analysis of svd for deep rotation estimation. Advances in Neural Information Processing Systems, 33:22554–22565, 2020.
- Constructive approximation of discontinuous functions by neural networks. Neural Processing Letters, 27:209–226, 2008.
- LLC., M. Cmu graphics lab motion capture database, 2024. URL http://mocap.cs.cmu.edu/.
- Understanding plasticity in neural networks. In International Conference on Machine Learning, pp. 23190–23211. PMLR, 2023.
- Macdonald, A. Linear and geometric algebra. Alan Macdonald, 2010.
- Mäkinen, J. Rotation manifold so (3) and its tangential vectors. Computational Mechanics, 42:907–919, 2008.
- Learning rotations. Mathematical Methods in the Applied Sciences, 2022.
- Learning 3-d object orientation from images. In 2009 IEEE International conference on robotics and automation, pp. 794–800. IEEE, 2009.
- Stuelpnagel, J. On the parametrization of the three-dimensional rotation group. SIAM review, 6(4):422–430, 1964.
- Modified rodrigues parameters: an efficient representation of orientation in 3d vision and graphics. Journal of Mathematical Imaging and Vision, 60:422–442, 2018.
- Mujoco: A physics engine for model-based control. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5026–5033, 2012. doi: 10.1109/IROS.2012.6386109.
- SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods, 17:261–272, 2020. doi: 10.1038/s41592-019-0686-2.
- Densefusion: 6d object pose estimation by iterative dense fusion. 2019.
- PoseCNN: A convolutional neural network for 6D object pose estimation in cluttered scenes. In Robotics: Science and Systems (RSS), 2018.
- The essential order of approximation for neural networks. Science in China Series F: Information Sciences, 47:97–112, 2004.
- Simultaneous lp-approximation order for neural networks. Neural Networks, 18(7):914–923, 2005.
- Approximation capabilities of neural odes and invertible residual networks. In International Conference on Machine Learning, pp. 11086–11095. PMLR, 2020.
- On the continuity of rotation representations in neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5745–5753, 2019.