Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Procedural Humans for Computer Vision (2301.01161v1)

Published 3 Jan 2023 in cs.CV and cs.GR

Abstract: Recent work has shown the benefits of synthetic data for use in computer vision, with applications ranging from autonomous driving to face landmark detection and reconstruction. There are a number of benefits of using synthetic data from privacy preservation and bias elimination to quality and feasibility of annotation. Generating human-centered synthetic data is a particular challenge in terms of realism and domain-gap, though recent work has shown that effective machine learning models can be trained using synthetic face data alone. We show that this can be extended to include the full body by building on the pipeline of Wood et al. to generate synthetic images of humans in their entirety, with ground-truth annotations for computer vision applications. In this report we describe how we construct a parametric model of the face and body, including articulated hands; our rendering pipeline to generate realistic images of humans based on this body model; an approach for training DNNs to regress a dense set of landmarks covering the entire body; and a method for fitting our body model to dense landmarks predicted from multiple views.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Charlie Hewitt (15 papers)
  2. Erroll Wood (12 papers)
  3. Lohit Petikam (5 papers)
  4. Louis Florentin (3 papers)
  5. Hanz Cuevas Velasquez (1 paper)
  6. Tadas BaltruĊĦaitis (12 papers)
Citations (4)