Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HI-GAN: Hierarchical Inpainting GAN with Auxiliary Inputs for Combined RGB and Depth Inpainting (2402.10334v1)

Published 15 Feb 2024 in cs.CV, cs.AI, and cs.LG

Abstract: Inpainting involves filling in missing pixels or areas in an image, a crucial technique employed in Mixed Reality environments for various applications, particularly in Diminished Reality (DR) where content is removed from a user's visual environment. Existing methods rely on digital replacement techniques which necessitate multiple cameras and incur high costs. AR devices and smartphones use ToF depth sensors to capture scene depth maps aligned with RGB images. Despite speed and affordability, ToF cameras create imperfect depth maps with missing pixels. To address the above challenges, we propose Hierarchical Inpainting GAN (HI-GAN), a novel approach comprising three GANs in a hierarchical fashion for RGBD inpainting. EdgeGAN and LabelGAN inpaint masked edge and segmentation label images respectively, while CombinedRGBD-GAN combines their latent representation outputs and performs RGB and Depth inpainting. Edge images and particularly segmentation label images as auxiliary inputs significantly enhance inpainting performance by complementary context and hierarchical optimization. We believe we make the first attempt to incorporate label images into inpainting process.Unlike previous approaches requiring multiple sequential models and separate outputs, our work operates in an end-to-end manner, training all three models simultaneously and hierarchically. Specifically, EdgeGAN and LabelGAN are first optimized separately and further optimized inside CombinedRGBD-GAN to enhance inpainting quality. Experiments demonstrate that HI-GAN works seamlessly and achieves overall superior performance compared with existing approaches.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. RGB-D Image Inpainting Using Generative Adversarial Network with a Late Fusion Approach.
  2. A Neural Algorithm of Artistic Style.
  3. A Neural Algorithm of Artistic Style. arXiv:1508.06576.
  4. Deep Fusion Network for Image Completion. In Proceedings of the 27th ACM International Conference on Multimedia, MM ’19, 2033–2042. New York, NY, USA: Association for Computing Machinery. ISBN 9781450368896.
  5. Indoor depth completion with boundary consistency and self-attention. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 0–0.
  6. Iskakov, K. 2023. QD-IMD: Quick Draw Irregular Mask Dataset. https://github.com/karfly/qd-imd.
  7. Image-to-Image Translation with Conditional Adversarial Networks. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5967–5976.
  8. A Category-Level 3D Object Dataset: Putting the Kinect to Work. 1168–1174. ISBN 978-1-4471-4639-1.
  9. Depth-assisted real-time 3D object detection for augmented reality. In ICAT, volume 11, 126–132.
  10. Image Inpainting for Irregular Holes Using Partial Convolutions. In Ferrari, V.; Hebert, M.; Sminchisescu, C.; and Weiss, Y., eds., Computer Vision – ECCV 2018, 89–105. Cham: Springer International Publishing. ISBN 978-3-030-01252-6.
  11. Image Inpainting for Irregular Holes Using Partial Convolutions. In The European Conference on Computer Vision (ECCV).
  12. Depth Inpainting via Vision Transformer. In 2021 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 286–291.
  13. Spectral Normalization for Generative Adversarial Networks. In International Conference on Learning Representations.
  14. 3D PixMix: Image Inpainting in 3D Environments. In 2018 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct), 1–2.
  15. Indoor Segmentation and Support Inference from RGBD Images. In ECCV.
  16. EdgeConnect: Structure Guided Image Inpainting using Edge Prediction. In The IEEE International Conference on Computer Vision (ICCV) Workshops.
  17. Very Deep Convolutional Networks for Large-Scale Image Recognition. In International Conference on Learning Representations.
  18. Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes with Deep Generative Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1511–1519.
  19. SUN RGB-D: A RGB-D scene understanding benchmark suite. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 567–576.
  20. SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels. In 2013 IEEE International Conference on Computer Vision, 1625–1632.
  21. Boundary-aware Image Inpainting with Multiple Auxiliary Cues. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 618–628.
  22. Semantic Image Inpainting with Deep Generative Models. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 6882–6890. Los Alamitos, CA, USA: IEEE Computer Society.
  23. Free-Form Image Inpainting With Gated Convolution. In 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 4470–4479.
  24. Deep Depth Completion of a Single RGB-D Image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  25. InDepth: Real-Time Depth Inpainting for Mobile Augmented Reality. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 6(1).

Summary

We haven't generated a summary for this paper yet.