Learning Accurate Dense Correspondences and When to Trust Them (2101.01710v2)

Published 5 Jan 2021 in cs.CV

Abstract: Establishing dense correspondences between a pair of images is an important and general problem. However, dense flow estimation is often inaccurate in the case of large displacements or homogeneous regions. For most applications and down-stream tasks, such as pose estimation, image manipulation, or 3D reconstruction, it is crucial to know when and where to trust the estimated matches. In this work, we aim to estimate a dense flow field relating two images, coupled with a robust pixel-wise confidence map indicating the reliability and accuracy of the prediction. We develop a flexible probabilistic approach that jointly learns the flow prediction and its uncertainty. In particular, we parametrize the predictive distribution as a constrained mixture model, ensuring better modelling of both accurate flow predictions and outliers. Moreover, we develop an architecture and training strategy tailored for robust and generalizable uncertainty prediction in the context of self-supervised training. Our approach obtains state-of-the-art results on multiple challenging geometric matching and optical flow datasets. We further validate the usefulness of our probabilistic confidence estimation for the task of pose estimation. Code and models are available at https://github.com/PruneTruong/PDCNet.

Citations (113)

View on Semantic Scholar

Summary

The paper introduces PDC-Net, a model that predicts dense correspondences and confidence maps, significantly improving reliability in computer vision tasks.
Their probabilistic approach with a constrained mixture model handles outliers better than traditional methods, enhancing prediction accuracy in challenging regions.
Self-supervised training and experiments on datasets like KITTI-2015 demonstrate that PDC-Net achieves state-of-the-art performance in dense flow estimation.

Overview of "Learning Accurate Dense Correspondences and When to Trust Them"

In the paper titled "Learning Accurate Dense Correspondences and When to Trust Them," Truong et al. address the significant challenge of establishing reliable dense correspondences between image pairs. This problem is foundational in computer vision applications, including pose estimation, 3D reconstruction, and image manipulation. Dense flow estimation can be inaccurate, especially with large displacements or homogeneous regions, making it crucial to determine when predictions can be trusted.

Core Contributions

The authors introduce the Probabilistic Dense Correspondence Network (PDC-Net), which not only learns to predict dense flow fields between images but also provides robust pixel-wise confidence maps. These maps indicate the reliability of flow predictions, a key advancement for tasks requiring high accuracy and robustness.

The methodology leverages a flexible probabilistic approach and constructs a constrained mixture model. This model more effectively handles both accurate predictions and outliers compared to traditional methods, enhancing the model's flexibility in dealing with varying uncertainties.

Architecture and Training Strategy

The paper details an advanced architecture designed for robust uncertainty prediction, particularly under self-supervised conditions. The architecture integrates features from the correlation volume and uses a decoder that refines uncertainty predictions iteratively. The training does not depend on densely annotated data; instead, it uses a well-designed self-supervision strategy, improving the model's generalization capability.

Experimental Results

The authors provide compelling numerical results, demonstrating that PDC-Net achieves state-of-the-art performance across multiple optical flow and geometric matching datasets. Notably, on the Megadepth geometric matching dataset and KITTI-2015 training set, PDC-Net sets new performance benchmarks. Additionally, for pose estimation, the probabilistic confidence estimation significantly boosts accuracy.

Implications and Future Directions

This work implies a substantial shift in how dense correspondences can be deployed in vision tasks demanding high reliability, such as autonomous driving and medical imaging. By modeling both prediction and uncertainty, PDC-Net offers a practically viable solution for integrating confidence measures into dense correspondence tasks.

Looking towards the future, the framework presents opportunities for further enhancements in real-world applications. Improvements might include integrating more sophisticated models for motion boundaries or extending the methodology to address different types of visual transformations and varied data conditions. Furthermore, the probabilistic approach could be adapted and extended to other vision tasks, enhancing the overall robustness of computer vision systems.

Conclusion

Truong et al.'s paper offers a significant contribution to the field of computer vision by providing a robust framework for not only predicting dense correspondences but also assessing their reliability. The integration of probabilistic modeling into dense flow estimation stands out as a methodological advancement, opening doors for more reliable and interpretable machine vision systems.

PDF Markdown

Related Papers

GitHub

GitHub - PruneTruong/DenseMatching: Dense matching library based on PyTorch (635 stars)

Tweets

https://twitter.com/prunetruong/status/1406316968949473284

YouTube

Show All Videos