ARtVista: Gateway To Empower Anyone Into Artist (2403.08876v1)
Abstract: Drawing is an art that enables people to express their imagination and emotions. However, individuals usually face challenges in drawing, especially when translating conceptual ideas into visually coherent representations and bridging the gap between mental visualization and practical execution. In response, we propose ARtVista - a novel system integrating AR and generative AI technologies. ARtVista not only recommends reference images aligned with users' abstract ideas and generates sketches for users to draw but also goes beyond, crafting vibrant paintings in various painting styles. ARtVista also offers users an alternative approach to create striking paintings by simulating the paint-by-number concept on reference images, empowering users to create visually stunning artwork devoid of the necessity for advanced drawing skills. We perform a pilot study and reveal positive feedback on its usability, emphasizing its effectiveness in visualizing user ideas and aiding the painting process to achieve stunning pictures without requiring advanced drawing skills. The source code will be available at https://github.com/htrvu/ARtVista.
- ediffi: Text-to-image diffusion models with an ensemble of expert denoisers. arXiv preprint arXiv:2211.01324 (2022).
- John Canny. 1986. A computational approach to edge detection. IEEE Transactions on pattern analysis and machine intelligence 6 (1986), 679–698.
- The augmented painting. In ACM SIGGRAPH 2006 Emerging technologies. 2–es.
- Matthew Flagg and James M Rehg. 2006. Projector-guided painting. In Proceedings of the 19th annual ACM symposium on User interface software and technology. 235–244.
- A neural algorithm of artistic style. arXiv preprint arXiv:1508.06576 (2015).
- Image style transfer using convolutional neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2414–2423.
- Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1125–1134.
- Scaling up gans for text-to-image synthesis. In CVPR. 10124–10134.
- An introduction to variational autoencoders. Foundations and Trends® in Machine Learning 12, 4 (2019), 307–392.
- Segment anything. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 4015–4026.
- Jeremy Laviole and Martin Hachet. 2012. Spatial augmented reality to enhance physical artistic creation. In Adjunct proceedings of the 25th annual ACM symposium on User interface software and technology. 43–46.
- Autoregressive image generation using residual quantization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11523–11532.
- xFormers: A modular and hackable Transformer modelling library. https://github.com/facebookresearch/xformers.
- Text to image generation with semantic-spatial aware gan. In CVPR. 18187–18196.
- Paint transformer: Feed forward neural painting with stroke prediction. In Proceedings of the IEEE/CVF international conference on computer vision. 6598–6607.
- Deep style transfer for line drawings. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 353–361.
- Stuart Lloyd. 1982. Least squares quantization in PCM. IEEE transactions on information theory 28, 2 (1982), 129–137.
- Lcm-lora: A universal stable-diffusion acceleration module. arXiv preprint arXiv:2311.05556 (2023).
- SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations. In International Conference on Learning Representations.
- GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. In International Conference on Machine Learning. PMLR, 16784–16804.
- Mobile augmented reality system for Design Drawing visualization. In 16th International Conference on Advanced Communication Technology. IEEE, 1296–1300.
- Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning. PMLR, 28492–28518.
- Sculpting by numbers. ACM Transactions on Graphics (TOG) 31, 6 (2012), 1–7.
- High-resolution image synthesis with latent diffusion models. In CVPR. 10684–10695.
- AR Museum: A mobile augmented reality application for interactive painting recoloring. ACM Transactions on Graphics (TOG) 36, 2 (2017), 19.
- Photorealistic text-to-image diffusion models with deep language understanding. NeurIPS 35 (2022), 36479–36494.
- Stylegan-t: Unlocking the power of gans for fast large-scale text-to-image synthesis. In International conference on machine learning. PMLR, 30105–30118.
- Reducing negative mood through drawing: Comparing venting, positive expression, and tracing. Art Therapy 32, 4 (2015), 197–201.
- Texture networks: Feed-forward synthesis of textures and stylized images. arXiv preprint arXiv:1603.03417 (2016).
- Saining ”Xie and Zhuowen” Tu. 2015. Holistically-Nested Edge Detection. In Proceedings of IEEE International Conference on Computer Vision.
- Line drawings for face portraits from photos using global and local structure based GANs. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 10 (2020), 3462–3475.
- Adding conditional control to text-to-image diffusion models. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 3836–3847.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.