Image-Based Virtual Try-On: A Survey (2311.04811v4)
Abstract: Image-based virtual try-on aims to synthesize a naturally dressed person image with a clothing image, which revolutionizes online shopping and inspires related topics within image generation, showing both research significance and commercial potential. However, there is a gap between current research progress and commercial applications and an absence of comprehensive overview of this field to accelerate the development.In this survey, we provide a comprehensive analysis of the state-of-the-art techniques and methodologies in aspects of pipeline architecture, person representation and key modules such as try-on indication, clothing warping and try-on stage. We additionally apply CLIP to assess the semantic alignment of try-on results, and evaluate representative methods with uniformly implemented evaluation metrics on the same dataset.In addition to quantitative and qualitative evaluation of current open-source methods, unresolved issues are highlighted and future research directions are prospected to identify key trends and inspire further exploration. The uniformly implemented evaluation metrics, dataset and collected methods will be made public available at https://github.com/little-misfit/Survey-Of-Virtual-Try-On.
- \bibcommenthead
- Chong Z, Mo L (2022) ST-VTON: self-supervised vision transformer for image-based virtual try-on. Image Vis Comput 127:104568
- Duchon J (1977) Splines minimizing rotation-invariant semi-norms in sobolev spaces. In: Constructive Theory of Functions of Several Variables, pp 85–100
- Honda S (2019) Viton-gan: Virtual try-on image generator trained with adversarial loss. In: Eurographics, pp 9–10
- Park S, Park J (2022) WG-VITON: wearing-guide virtual try-on for top and bottom clothes. arXiv preprint arXiv:220504759
- Ronneberger O (2017) Invited talk: U-net convolutional networks for biomedical image segmentation. In: Proceedings des Workshops vom 12. bis 14. März 2017 in Heidelberg. Springer, p 3
- Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: ICLR