Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Survey of Structure from Motion (1701.08493v2)

Published 30 Jan 2017 in cs.CV

Abstract: The structure from motion (SfM) problem in computer vision is the problem of recovering the three-dimensional ($3$D) structure of a stationary scene from a set of projective measurements, represented as a collection of two-dimensional ($2$D) images, via estimation of motion of the cameras corresponding to these images. In essence, SfM involves the three main stages of (1) extraction of features in images (e.g., points of interest, lines, etc.) and matching these features between images, (2) camera motion estimation (e.g., using relative pairwise camera positions estimated from the extracted features), and (3) recovery of the $3$D structure using the estimated motion and features (e.g., by minimizing the so-called reprojection error). This survey mainly focuses on relatively recent developments in the literature pertaining to stages (2) and (3). More specifically, after touching upon the early factorization-based techniques for motion and structure estimation, we provide a detailed account of some of the recent camera location estimation methods in the literature, followed by discussion of notable techniques for $3$D structure recovery. We also cover the basics of the simultaneous localization and mapping (SLAM) problem, which can be viewed as a specific case of the SfM problem. Further, our survey includes a review of the fundamentals of feature extraction and matching (i.e., stage (1) above), various recent methods for handling ambiguities in $3$D scenes, SfM techniques involving relatively uncommon camera models and image features, and popular sources of data and SfM software.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Onur Ozyesil (6 papers)
  2. Vladislav Voroninski (22 papers)
  3. Ronen Basri (42 papers)
  4. Amit Singer (95 papers)
Citations (359)

Summary

Overview of "A Survey of Structure from Motion"

The paper "A Survey of Structure from Motion" provides a comprehensive examination of recent advancements in the field of Structure from Motion (SfM), a central problem in computer vision aiming to recover three-dimensional (3D) structures from two-dimensional (2D) image projections. Authored by Onur Özyesil, Vladislav Voroninski, Ronen Basri, and Amit Singer, the survey meticulously outlines methodologies, both classical and contemporary, for solving various facets of the SfM problem, including feature extraction and matching, camera motion and location estimation, and 3D structure recovery, with additional discussion on challenges such as ambiguities due to scene symmetries.

Key Aspects:

  1. Stages of SfM:
    • The paper divides the SfM process into three main stages: feature extraction and matching, camera motion estimation, and 3D structure recovery. It details the function of each stage, highlighting the importance of optimizing reprojection error, often by employing bundle adjustment techniques.
  2. Camera Motion and Location Estimation:
    • The camera location estimation is a focal point, with emphasis on solving the problem of estimating camera positions based on relative pairwise measurements. The authors discuss classical least squares methods and novel approaches like semidefinite relaxations, iterative reweighted least squares, and robust strategies against measurement noise and outliers.
  3. 3D Structure Recovery:
    • The survey distinguishes early factorization methods from modern robust techniques capable of handling unordered, large-scale image datasets. It highlights the use of global approaches that singularly compute camera locations and scene structures simultaneously, often using robust optimization formulations to handle outliers.
  4. Handling Ambiguities:
    • Ambiguities arising from symmetries and repetitive patterns in scenes pose a significant challenge in SfM. The paper addresses various approaches designed to resolve these ambiguities, including cycle consistency tests and Bayesian formulisms.
  5. Simultaneous Localization and Mapping (SLAM):
    • Recognizing SLAM as a subset of SfM largely applicable in robotics, the survey reviews the adaptation of SfM techniques for SLAM, with an emphasis on utilizing visual-inertial systems and addressing the computation constraints imposed by real-time processing demands.
  6. Software and Datasets:
    • The survey mentions key software frameworks and datasets that have been foundational in driving progress within the field of SfM. Notably, tools like bundler and multicore bundle adjustment facilitate practical application of SfM algorithms, while extensive datasets permit benchmarking and empirical validation.

Implications and Future Directions

The paper solidifies SfM's importance due to its wide-ranging applications, including augmented reality, autonomous driving, and photogrammetry. It implies that future advancements will likely focus on enhancing computational efficiency to enable real-time processing, increasing robustness to noise and feature mismatches, and integrating richer data from diverse camera settings and non-traditional structure points, such as line features or omnidirectional imagery.

Further, the paper suggests potential areas of exploration, such as leveraging machine learning to improve feature matching or addressing fundamental questions about the theoretical limits of noise tolerance within SfM tasks.

In summary, this survey serves as an extensive resource that not only characterizes the state-of-the-art in SfM but also examines the trajectory of ongoing research and its implications for future technological advancements in computer vision.