RSMT: Real-time Stylized Motion Transition for Characters (2306.11970v1)
Abstract: Styled online in-between motion generation has important application scenarios in computer animation and games. Its core challenge lies in the need to satisfy four critical requirements simultaneously: generation speed, motion quality, style diversity, and synthesis controllability. While the first two challenges demand a delicate balance between simple fast models and learning capacity for generation quality, the latter two are rarely investigated together in existing methods, which largely focus on either control without style or uncontrolled stylized motions. To this end, we propose a Real-time Stylized Motion Transition method (RSMT) to achieve all aforementioned goals. Our method consists of two critical, independent components: a general motion manifold model and a style motion sampler. The former acts as a high-quality motion source and the latter synthesizes styled motions on the fly under control signals. Since both components can be trained separately on different datasets, our method provides great flexibility, requires less data, and generalizes well when no/few samples are available for unseen styles. Through exhaustive evaluation, our method proves to be fast, high-quality, versatile, and controllable. The code and data are available at {https://github.com/yuyujunjun/RSMT-Realtime-Stylized-Motion-Transition.}
- Unpaired motion style transfer from video to animation. ACM Transactions on Graphics 39, 4 (2020), 1–12.
- Okan Arikan and D. A. Forsyth. 2002. Interactive motion generation from examples. ACM Transactions on Graphics 21, 3 (2002), 483–490.
- Motion-motif graphs. In Proceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation. 117–126.
- Matthew Brand and Aaron Hertzmann. 2000. Style machines. In Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques. 183–192.
- Jinxiang Chai and Jessica K. Hodgins. 2007. Constraint-based motion optimization using a statistical dynamic model. ACM Transactions on Graphics 26, 3 (2007), 8–es.
- Dynamic future net: diversified human motion generation. In Proceedings of the 28th ACM International Conference on Multimedia. 2131–2139.
- Adult2child: motion style transfer using cyclegans. In Proceedings of the 11th Annual International Conference on Motion, Interaction, and Games. 1–11.
- Stylistic locomotion modeling and synthesis using variational generative models. In Proceedings of the 11th Annual International Conference on Motion, Interaction, and Games. 1–10.
- Single-shot motion completion with transformer. arXiv:2103.00776 [cs] (2021).
- Félix G. Harvey and Christopher Pal. 2018. Recurrent transition networks for character locomotion. In SIGGRAPH Asia 2018 Technical Briefs (SA ’18). Association for Computing Machinery, 1–4.
- Robust motion in-betweening. ACM Transactions on Graphics 39, 4 (2020), 1–12.
- NeMF: Neural Motion Fields for Kinematic Animation. In NeurIPS.
- Human motion prediction via spatio-temporal inpainting. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7134–7143.
- Learned motion matching. ACM Transactions on Graphics 39, 4 (2020), 1–12.
- Phase-functioned neural networks for character control. ACM Transactions on Graphics 36, 4 (2017), 1–13.
- A deep learning framework for character motion synthesis and editing. ACM Transactions on Graphics 35, 4 (2016), 1–11.
- Style translation for human motion. ACM Transactions on Graphics 24, 3 (2005), 1082–1089.
- Xun Huang and Serge Belongie. 2017. Arbitrary style transfer in real-time with adaptive instance normalization. In Proceedings of the IEEE International Conference on Computer Vision. 1501–1510.
- Motion puzzle: arbitrary motion style transfer by body part. ACM Transactions on Graphics 41, 3 (2022), 1–16.
- Test sample accuracy scales with training sample density in neural networks. In Proceedings of the 1st Conference on Lifelong Learning Agents. 629–646.
- Convolutional autoencoders for human motion infilling. In Proceedings of the 2020 International Conference on 3D Vision. 918–927.
- Diederik P Kingma and Max Welling. 2013. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114 (2013).
- Motion graphs. In ACM SIGGRAPH 2008 Classes (SIGGRAPH ’08). 1–10.
- Continuous character control with low-dimensional embeddings. ACM Transactions on Graphics 31, 4 (2012), 1–10.
- Task-generic hierarchical human motion prior using vaes. In Proceedings of the 2021 International Conference on 3D Vision. 771–781.
- GANimator: neural motion synthesis from a single sequence. ACM Transactions on Graphics 41, 4 (2022), 1–12.
- Spatial constraints-based maximum likelihood estimation for human motions. In Proceedings of the 2013 IEEE International Conference on Signal Processing, Communication and Computing (ICSPCC 2013). 1–6.
- Motion texture: a two-level statistical model for character motion synthesis. In Proceedings of the 29th Annual Conference on Computer Graphics and Interactive Techniques. 465–472.
- Character controllers using motion VAEs. ACM Transactions on Graphics 39, 4 (2020), 1–12.
- Modeling style and variation in human motion. In Proceedings of the 2010 ACM SIGGRAPH/Eurographics Symposium on Computer Animation. 21–30.
- On human motion prediction using recurrent neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2891–2900.
- Real-time style modelling of human locomotion via feature-wise transformations and local motion phases. Proceedings of the ACM on Computer Graphics and Interactive Techniques 5, 1 (2022), 1–18.
- Few-shot learning of homogeneous human Locomotion Styles. Computer Graphics Forum 37, 7 (2018), 143–153.
- Jianyuan Min and Jinxiang Chai. 2012. Motion graphs++: a compact generative model for semantic motion analysis and synthesis. ACM Transactions on Graphics 31, 6 (2012), 1–12.
- Motion inbetweening via deep ΔΔ\Deltaroman_Δ-interpolator. arXiv preprint arXiv:2201.06701 (2022).
- Diverse motion stylization for multiple style domains via spatial-temporal graph-based generative model. Proceedings of the ACM on Computer Graphics and Interactive Techniques 4, 3 (2021), 1–17.
- Learning switching linear models of human motion. In Proceedings of the 13th International Conference on Neural Information Processing Systems. 942–948.
- Action-conditioned 3d human motion synthesis with transformer vae. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 10985–10995.
- Motion in-betweening via two-stage transformers. ACM Transactions on Graphics 41, 6 (2022), 1–16.
- Humor: 3d human motion model for robust pose estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 11488–11499.
- Alla Safonova and Jessica K. Hodgins. 2007. Construction and optimal search of interpolated motion graphs. ACM Transactions on Graphics 26, 3 (2007), 106–es.
- Posture-based and action-based graphs for boxing skill visualization. Computers and Graphics 69, Supplement C (2017), 104–115.
- Efficient neural networks for real-time motion style transfer. Proceedings of the ACM on Computer Graphics and Interactive Techniques 2, 2 (2019), 1–17.
- DeepPhase: periodic autoencoders for learning motion phase manifolds. ACM Transactions on Graphics 41, 4 (2022), 1–13.
- Neural animation layering for synthesizing martial arts movements. ACM Transactions on Graphics 40, 4 (2021), 1–16.
- Real-time controllable motion transition for characters. ACM Transactions on Graphics 41, 4 (2022), 1–10.
- Human motion diffusion model. arXiv preprint arXiv:2209.14916 (2022).
- Fourier principles for emotion-based human figure animation. In Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques. 91–96.
- An energy-driven motion planning method for two distant postures. IEEE Transactions on Visualization and Computer Graphics 21, 1 (2015), 18–30.
- Spatio-temporal manifold learning for human motions via long-horizon modeling. IEEE Transactions on Visualization and Computer Graphics 27, 1 (2019), 216–227.
- He Wang and Taku Komura. 2011. Energy-based pose unfolding and interpolation for 3D articulated characters. In Proceedings of the 4th International Conference on Motion in Games. 110–119.
- Harmonic parameterization by electrostatics. ACM Transactions on Graphics 32, 5 (2013), 1–12.
- Multifactor gaussian process models for style-content separation. In Proceedings of the 24th International Conference on Machine Learning. 975–982.
- Autoregressive stylized motion synthesis with generative flow. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13612–13621.
- Realtime style transfer for unlabeled heterogeneous human motion. ACM Transactions on Graphics 34, 4 (2015), 1–10.
- Dance style transfer with cross-modal transformer. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 5058–5067.
- M Ersin Yumer and Niloy J Mitra. 2016. Spectral style transfer for human motion between independent actions. ACM Transactions on Graphics 35, 4 (2016), 1–8.
- Mode-adaptive neural networks for quadruped motion control. ACM Transactions on Graphics 37, 4 (2018), 1–11.