Dress-Me-Up: A Dataset & Method for Self-Supervised 3D Garment Retargeting
Abstract: We propose a novel self-supervised framework for retargeting non-parameterized 3D garments onto 3D human avatars of arbitrary shapes and poses, enabling 3D virtual try-on (VTON). Existing self-supervised 3D retargeting methods only support parametric and canonical garments, which can only be draped over parametric body, e.g. SMPL. To facilitate the non-parametric garments and body, we propose a novel method that introduces Isomap Embedding based correspondences matching between the garment and the human body to get a coarse alignment between the two meshes. We perform neural refinement of the coarse alignment in a self-supervised setting. Further, we leverage a Laplacian detail integration method for preserving the inherent details of the input garment. For evaluating our 3D non-parametric garment retargeting framework, we propose a dataset of 255 real-world garments with realistic noise and topological deformations. The dataset contains $44$ unique garments worn by 15 different subjects in 5 distinctive poses, captured using a multi-view RGBD capture setup. We show superior retargeting quality on non-parametric garments and human avatars over existing state-of-the-art methods, acting as the first-ever baseline on the proposed dataset for non-parametric 3D garment retargeting.
- Pose with Style: Detail-preserving pose-guided image synthesis with conditional stylegan. ACM Transactions on Graphics, 2021.
- CLOTH3D: Clothed 3d humans. In Proceedings of the European Conference on Computer Vision (ECCV), 2020.
- Deep parametric surfaces for 3d outfit reconstruction from single view image. In 2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), pages 1–8, 2021a.
- Deepsd: Automatic deep skinning and pose space deformation for 3d garment animation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5471–5480, 2021b.
- Neural cloth simulation. ACM Trans. Graph., 41(6), 2022.
- Multi-Garment Net: Learning to dress 3D people from images. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2019.
- Viton-hd: High-resolution virtual try-on via misalignment-aware normalization. In Proc. of the IEEE conference on computer vision and pattern recognition (CVPR), 2021.
- Smplicit: Topology-aware generative model for clothed people. In CVPR, 2021.
- CVIT. 3dhumans: A rich 3d dataset of scanned humans, 2021.
- Drapenet: Generating garments and draping them with self-supervision, 2022.
- Moulding humans: Non-parametric 3d human shape estimation from single images. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2019.
- In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
- Viton: An image-based virtual try-on network. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7543–7552, 2018.
- Arbitrary style transfer in real-time with adaptive instance normalization. In ICCV, 2017.
- Bodymap: Learning full-body dense correspondence map. In CVPR, 2022.
- Kinectfusion: Real-time 3d reconstruction and interaction using a moving depth camera. In UIST ’11 Proceedings of the 24th annual ACM symposium on User interface software and technology, pages 559–568. ACM, 2011.
- Learning sparse high dimensional filters: Image filtering, dense crfs and bilateral neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- Bcnet: Learning body and cloth shape from a single image. ArXiv, abs/2004.00214, 2020.
- High-resolution virtual try-on with misalignment and occlusion-handled conditions. arXiv preprint arXiv:2206.14180, 2022.
- Dig: Draping implicit garment over the human body, 2022.
- SMPL: A skinned multi-person linear model. ACM Transactions on Graphics (Proc. SIGGRAPH Asia), 34(6):248:1–248:16, 2015.
- SCALE: Modeling clothed humans with a surface codec of articulated local elements. In Proceedings IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), pages 16082–16093, 2021a.
- The power of points for modeling humans in clothing. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 10974–10984, 2021b.
- AMASS: Archive of motion capture as surface shapes. In International Conference on Computer Vision, pages 5442–5451, 2019.
- Robust 3d garment digitization from monocular 2d images for 3d virtual try-on systems. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 3428–3438, 2022.
- Learning to transfer texture from clothing images to 3d humans. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 2020.
- LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On. In Proceedings of the ACM International Conference on Multimedia, 2023.
- Physically based deformable models in computer graphics. In Computer graphics forum, pages 809–836. Wiley Online Library, 2006.
- Image based virtual try-on network from unpaired data. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5184–5193, 2020.
- Continuous surface embeddings. ArXiv, abs/2011.12438, 2020.
- TailorNet: Predicting clothing in 3D as a function of human pose, shape and garment style. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- 3DPeople: Modeling the Geometry of Dressed Humans. In International Conference in Computer Vision (ICCV), 2019.
- Pointnet: Deep learning on point sets for 3d classification and segmentation, 2016.
- SCANimate: Weakly supervised learning of skinned clothed avatar networks. In Proceedings IEEE/CVF Conf. on Computer Vision and Pattern Recognition (CVPR), 2021.
- SNUG: Self-Supervised Neural Dynamic Garments. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- Sp-viton: shape-preserving image-based virtual try-on network. Multimedia Tools and Applications, 79(45):33757–33769, 2020.
- Laplacian surface editing. In Proceedings of the 2004 Eurographics/ACM SIGGRAPH symposium on Geometry processing, pages 175–184, 2004.
- Xcloth: Extracting template-free textured 3d clothes from a monocular image. In Proceedings of the 30th ACM International Conference on Multimedia, page 2504–2512, New York, NY, USA, 2022. Association for Computing Machinery.
- vchoutas. https://github.com/vchoutas/smplify-x, 2019.
- Toward characteristic-preserving image-based virtual try-on network. In Proceedings of the European Conference on Computer Vision (ECCV), pages 589–604, 2018.
- ECON: Explicit Clothed humans Obtained from Normals. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
- Vtnfp: An image-based virtual try-on network with body and clothing feature preservation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 10511–10520, 2019.
- Function4d: Real-time human volumetric capture from very sparse consumer rgbd sensors. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR2021), 2021.
- M3d-vton: A monocular-to-3d virtual try-on network. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 13239–13249, 2021a.
- M3d-vton: A monocular-to-3d virtual try-on network. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 13239–13249, 2021b.
- Pamir: Parametric model-conditioned implicit representation for image-based human reconstruction. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021.
- Registering explicit to implicit: Towards high-fidelity garment mesh reconstruction from single images, 2022.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.