Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model (2309.06284v1)

Published 12 Sep 2023 in cs.CV and cs.MM

Abstract: Text-driven human motion generation in computer vision is both significant and challenging. However, current methods are limited to producing either deterministic or imprecise motion sequences, failing to effectively control the temporal and spatial relationships required to conform to a given text description. In this work, we propose a fine-grained method for generating high-quality, conditional human motion sequences supporting precise text description. Our approach consists of two key components: 1) a linguistics-structure assisted module that constructs accurate and complete language feature to fully utilize text information; and 2) a context-aware progressive reasoning module that learns neighborhood and overall semantic linguistics features from shallow and deep graph neural networks to achieve a multi-step inference. Experiments show that our approach outperforms text-driven motion generation methods on HumanML3D and KIT test sets and generates better visually confirmed motion to the text conditions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Yin Wang (58 papers)
  2. Zhiying Leng (5 papers)
  3. Frederick W. B. Li (11 papers)
  4. Shun-Cheng Wu (11 papers)
  5. Xiaohui Liang (30 papers)
Citations (34)

Summary

We haven't generated a summary for this paper yet.