VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset (2407.18245v2)

Published 25 Jul 2024 in cs.CV and cs.LG

Abstract: Human head detection, keypoint estimation, and 3D head model fitting are essential tasks with many applications. However, traditional real-world datasets often suffer from bias, privacy, and ethical concerns, and they have been recorded in laboratory environments, which makes it difficult for trained models to generalize. Here, we introduce \method -- a large-scale synthetic dataset generated with diffusion models for human head detection and 3D mesh estimation. Our dataset comprises over 1 million high-resolution images, each annotated with detailed 3D head meshes, facial landmarks, and bounding boxes. Using this dataset, we introduce a new model architecture capable of simultaneous head detection and head mesh reconstruction from a single image in a single step. Through extensive experimental evaluations, we demonstrate that models trained on our synthetic data achieve strong performance on real images. Furthermore, the versatility of our dataset makes it applicable across a broad spectrum of tasks, offering a general and comprehensive representation of human heads.

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/arXivGPT/status/1822759705036353834

https://twitter.com/ArxivToday/status/1816917821844029702

https://twitter.com/javaeeeee1/status/1822297958676607274

https://twitter.com/arXivGPT/status/1822409874715693064

https://twitter.com/arXivGPT/status/1823122182085783983

YouTube

Show All Videos

VGGHeads: 3D Multi Head Alignment with a Large-Scale Synthetic Dataset (2407.18245v2)

Summary

Related Papers

Tweets

YouTube