Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Parallax-Tolerant Unsupervised Deep Image Stitching (2302.08207v2)

Published 16 Feb 2023 in cs.CV

Abstract: Traditional image stitching approaches tend to leverage increasingly complex geometric features (point, line, edge, etc.) for better performance. However, these hand-crafted features are only suitable for specific natural scenes with adequate geometric structures. In contrast, deep stitching schemes overcome the adverse conditions by adaptively learning robust semantic features, but they cannot handle large-parallax cases due to homography-based registration. To solve these issues, we propose UDIS++, a parallax-tolerant unsupervised deep image stitching technique. First, we propose a robust and flexible warp to model the image registration from global homography to local thin-plate spline motion. It provides accurate alignment for overlapping regions and shape preservation for non-overlapping regions by joint optimization concerning alignment and distortion. Subsequently, to improve the generalization capability, we design a simple but effective iterative strategy to enhance the warp adaption in cross-dataset and cross-resolution applications. Finally, to further eliminate the parallax artifacts, we propose to composite the stitched image seamlessly by unsupervised learning for seam-driven composition masks. Compared with existing methods, our solution is parallax-tolerant and free from laborious designs of complicated geometric features for specific scenes. Extensive experiments show our superiority over the SoTA methods, both quantitatively and qualitatively. The code is available at https://github.com/nie-lang/UDIS2.

Overview of the ICCV Document Preparation Guidelines

The "LaTeX Author Guidelines for ICCV Proceedings" document serves as a comprehensive guide for authors intending to submit their work to the International Conference on Computer Vision (ICCV). It is essential to adhere to specified formatting and procedural guidelines to ensure acceptance and proper presentation of submissions.

Abstract and Introduction

The paper mandates a structured format starting with an italicized abstract in 10-point single-spaced text. Following the abstract, the authors present an introduction that outlines the key aspects of manuscript preparation for IEEE Computer Society Press. This section underscores updates in the guidelines and emphasizes reading the current document thoroughly.

Key Content Areas

  1. Language and Dual Submission: The manuscript must be in English, with adherence to ICCV's dual submission policies detailed on their website.
  2. Paper Length and Formatting: Submissions cannot exceed eight pages excluding references, and must follow specific typographical and layout requirements. Extra emphasis is given to ensuring the use of smaller fonts in figure captions and references without altering set page margins and formats.
  3. Technical Aspects: The document specifies detailed instructions on presenting mathematical content, including section and equation numbering. Complex formatting elements such as rulers for review, blind review recommendations, and naming conventions are also discussed to aid anonymous evaluation.
  4. Figures and Illustrations: Authors must ensure all visual content is legible both digitally and in print, emphasizing the use of \verb+\includegraphics+ for inserting figures with an appropriate scale in relation to the column width.
  5. Reference and Citation Styles: A numerical citation style enclosed in square brackets is preferred, and all references must be listed in 9-point Times font.
  6. Miscellaneous Considerations: Thorough guidance on typesetting specifics including font preferences, margin settings, footnote utilization, and color use is provided. Moreover, the guidelines elaborate on the necessary steps for final camera-ready submissions including the IEEE copyright form submission.

Practical Implications

From a practical perspective, these guidelines ensure a unified aesthetic and structural quality across all conference publications, aiding in readability and accessibility. Proper adherence streamlines the review process, allowing focus on the content's technical merit rather than presentation inaccuracies.

Theoretical Implications and Future Outlook

On a theoretical level, the guidelines reflect standard principles of scientific writing and presentation, promoting clarity and precision. As AI-driven document processing evolves, one can anticipate further automation and assistance in enforcing these guidelines, potentially leading to novel developments in document processing AI.

In summary, the ICCV guidelines are pivotal for authors aiming to publish within the conference's proceedings, and their meticulous implementation serves both the academic community and the broader dissemination of research findings. Future developments may further optimize these processes, integrating advanced AI methodologies to enhance compliance and reduce manual workloads for researchers.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Lang Nie (29 papers)
  2. Chunyu Lin (48 papers)
  3. Kang Liao (37 papers)
  4. Shuaicheng Liu (95 papers)
  5. Yao Zhao (272 papers)
Citations (28)
Github Logo Streamline Icon: https://streamlinehq.com