MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training (2406.01867v2)

Published 4 Jun 2024 in cs.CV

Abstract: In motion generation, controllability as well as generation quality and speed is becoming more and more important. There are various motion editing tasks, such as in-betweening, upper body editing, and path-following, but existing methods perform motion editing with a data-space diffusion model, which is slow in inference compared to a latent diffusion model. In this paper, we propose MoLA, which provides fast and high-quality motion generation and also can deal with multiple editing tasks in a single framework. For high-quality and fast generation, we employ a variational autoencoder and latent diffusion model, and improve the performance with adversarial training. In addition, we apply a training-free guided generation framework to achieve various editing tasks with motion control inputs. We quantitatively show the effectiveness of adversarial learning in text-to-motion generation, and demonstrate the applicability of our editing framework to multiple editing tasks in the motion domain.

Authors (6)

Kengo Uchida (5 papers)
Takashi Shibuya (32 papers)
Yuhta Takida (32 papers)
Naoki Murata (29 papers)
Shusuke Takahashi (31 papers)
Yuki Mitsufuji (127 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/yahshibu/status/1927815517739286941

MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training (2406.01867v2)

Summary

Related Papers

Tweets