Reconstructing the Image Stitching Pipeline: Integrating Fusion and Rectangling into a Unified Inpainting Model (2404.14951v2)
Abstract: Deep learning-based image stitching pipelines are typically divided into three cascading stages: registration, fusion, and rectangling. Each stage requires its own network training and is tightly coupled to the others, leading to error propagation and posing significant challenges to parameter tuning and system stability. This paper proposes the Simple and Robust Stitcher (SRStitcher), which revolutionizes the image stitching pipeline by simplifying the fusion and rectangling stages into a unified inpainting model, requiring no model training or fine-tuning. We reformulate the problem definitions of the fusion and rectangling stages and demonstrate that they can be effectively integrated into an inpainting task. Furthermore, we design the weighted masks to guide the reverse process in a pre-trained largescale diffusion model, implementing this integrated inpainting task in a single inference. Through extensive experimentation, we verify the interpretability and generalization capabilities of this unified model, demonstrating that SRStitcher outperforms state-of-the-art methods in both performance and stability. Code: https://github.com/yayoyo66/SRStitcher
- Unsupervised deep image stitching: Reconstructing stitched features to images. IEEE Transactions on Image Processing, 30:6184–6197, 2021. ISSN 1941-0042. doi:10.1109/tip.2021.3092828. URL http://dx.doi.org/10.1109/TIP.2021.3092828.
- Deep seam prediction for image stitching based on selection consistency loss. arXiv preprint arXiv:2302.05027, 2023.
- Deep rectangling for image stitching: a learning baseline. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5740–5748, 2022.
- Learning edge-preserved image stitching from large-baseline deep homography, 2020a.
- Parallax-tolerant unsupervised deep image stitching. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 7399–7408, 2023.
- Using geometric corners to build a 2d mosaic from a set of images. In Proceedings of IEEE computer society conference on computer vision and pattern recognition, pages 420–425. IEEE, 1997.
- Automated mosaicing with super-resolution zoom. In Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No. 98CB36231), pages 885–891. IEEE, 1998.
- Image mosaicing using sequential bundle adjustment. Image and Vision computing, 20(9-10):751–759, 2002.
- Automatic panoramic image stitching using invariant features. International journal of computer vision, 74:59–73, 2007.
- Constructing image panoramas using dual-homography warping. In CVPR 2011, pages 49–56. IEEE, 2011.
- Natural image stitching using depth maps, 2023.
- Parallax-tolerant image stitching with epipolar displacement field, 2023.
- Deep feature extraction for panoramic image stitching. In Intelligent Information and Database Systems: 12th Asian Conference, ACIIDS 2020, Phuket, Thailand, March 23–26, 2020, Proceedings, Part II 12, pages 141–151. Springer, 2020.
- An image mosaic method based on convolutional neural network semantic features extraction. Journal of Signal Processing Systems, 92:435–444, 2020.
- A view-free image stitching network based on global homography. Journal of Visual Communication and Image Representation, 73:102950, 2020b.
- Unsupervised homography estimation with coplanarity-aware gan. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 17663–17672, June 2022.
- Multi-spectral image stitching via spatial graph reasoning. In Proceedings of the 31st ACM International Conference on Multimedia, pages 472–480, 2023a.
- Semi-supervised deep large-baseline homography estimation with progressive equivalence constraint. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 1024–1032, 2023b.
- Recurrent homography estimation using homography-guided image warping and focus transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 9833–9842, June 2023.
- Alex M Andrew. Multiple view geometry in computer vision. Kybernetes, 30(9/10):1333–1341, 2001.
- As-projective-as-possible image stitching with moving dlt. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2339–2346, 2013.
- Interactive digital photomontage. In ACM SIGGRAPH 2004 Papers, pages 294–302. 2004.
- Seam-driven image stitching. In Eurographics (Short Papers), pages 45–48, 2013.
- Seagull: Seam-guided local alignment for parallax-tolerant image stitching. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part III 14, pages 370–385. Springer, 2016.
- Rectangling panoramic images via warping. ACM Transactions on Graphics (TOG), 32(4):1–10, 2013.
- A geodesic-preserving method for image warping. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 213–221, 2015.
- Recdiffusion: Rectangling for image stitching with diffusion models, 2024.
- Glu-net: Global-local universal network for dense flow and correspondences. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6258–6268, 2020.
- Repaint: Inpainting using denoising diffusion probabilistic models, 2022.
- Differential diffusion: Giving each pixel its strength, 2023.
- patrickvonplaten. stable-diffusion-2-inpainting. https://huggingface.co/stabilityai/stable-diffusion-2-inpainting, 2023a.
- Alexandru Telea. An image inpainting technique based on the fast marching method. Journal of graphics tools, 9(1):23–34, 2004.
- Resolution-robust large mask inpainting with fourier convolutions. arXiv preprint arXiv:2109.07161, 2021.
- patrickvonplaten. stable-diffusion-2-inpainting. https://huggingface.co/runwayml/stable-diffusion-inpainting/blob/main/sd-v1-5-inpainting.ckpt, 2023b.
- Blindly assess image quality in the wild guided by a self-adaptive hyper network. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020.
- Exploring clip for assessing the look and feel of images. In AAAI, 2023.
- liudakai2. Unsupdis-pytorch, 2022. URL https://github.com/liudakai2/UnsupDIS-pytorch.
- Alex and McKinney. Image to image inpainting stable diffusion. https://github.com/huggingface/diffusers/tree/08bf7545070764d41637e90290ca9b92b392263e/examples/community#image-to-image-inpainting-stable-diffusion, 2022.
- Ziqi Xie (2 papers)
- Weidong Zhao (16 papers)
- Xianhui Liu (4 papers)
- Jian Zhao (218 papers)
- Ning Jia (22 papers)