Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Open Domain Text-Driven Synthesis of Multi-Person Motions (2405.18483v2)

Published 28 May 2024 in cs.CV

Abstract: This work aims to generate natural and diverse group motions of multiple humans from textual descriptions. While single-person text-to-motion generation is extensively studied, it remains challenging to synthesize motions for more than one or two subjects from in-the-wild prompts, mainly due to the lack of available datasets. In this work, we curate human pose and motion datasets by estimating pose information from large-scale image and video datasets. Our models use a transformer-based diffusion framework that accommodates multiple datasets with any number of subjects or frames. Experiments explore both generation of multi-person static poses and generation of multi-person motion sequences. To our knowledge, our method is the first to generate multi-subject motion sequences with high diversity and fidelity from a large variety of textual prompts.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Mengyi Shan (10 papers)
  2. Lu Dong (17 papers)
  3. Yutao Han (5 papers)
  4. Yuan Yao (292 papers)
  5. Tao Liu (350 papers)
  6. Ifeoma Nwogu (18 papers)
  7. Guo-Jun Qi (76 papers)
  8. Mitch Hill (9 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.