MGNet: Learning Correspondences via Multiple Graphs (2401.04984v1)
Abstract: Learning correspondences aims to find correct correspondences (inliers) from the initial correspondence set with an uneven correspondence distribution and a low inlier rate, which can be regarded as graph data. Recent advances usually use graph neural networks (GNNs) to build a single type of graph or simply stack local graphs into the global one to complete the task. But they ignore the complementary relationship between different types of graphs, which can effectively capture potential relationships among sparse correspondences. To address this problem, we propose MGNet to effectively combine multiple complementary graphs. To obtain information integrating implicit and explicit local graphs, we construct local graphs from implicit and explicit aspects and combine them effectively, which is used to build a global graph. Moreover, we propose Graph~Soft~Degree~Attention (GSDA) to make full use of all sparse correspondence information at once in the global graph, which can capture and amplify discriminative features. Extensive experiments demonstrate that MGNet outperforms state-of-the-art methods in different visual tasks. The code is provided in https://github.com/DAILUANYUAN/MGNet-2024AAAI.
- Pointdsc: Robust point cloud registration using deep spatial consistency. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 15859–15869.
- HPatches: A benchmark and evaluation of handcrafted and learned local descriptors. In Proceedings of the IEEE conference on computer vision and pattern recognition, 5173–5182.
- Graph-cut RANSAC. In Proceedings of the IEEE conference on computer vision and pattern recognition, 6733–6741.
- MAGSAC: marginalizing sample consensus. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 10197–10205.
- MAGSAC++, a fast, reliable and accurate robust estimator. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 1304–1312.
- Two-view geometry estimation unaffected by a dominant plane. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), volume 1, 772–779. IEEE.
- Enhancing Two-View Correspondence Learning By Local-Global Self-Atention. Neurocomputing.
- MS2DG-Net: Progressive Correspondence Learning via Multiple Sparse Semantics Dynamic Graph. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8973–8982.
- Superpoint: Self-supervised interest point detection and description. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, 224–236.
- Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 24(6): 381–395.
- Image matching across wide baselines: From paper to practice. International Journal of Computer Vision, 129(2): 517–547.
- Semi-supervised classification with graph convolutional networks. arXiv preprint arXiv:1609.02907.
- U-Match: Two-view Correspondence Learning with Hierarchy-aware Local Context Aggregation. In International Joint Conference on Artificial Intelligence (IJCAI).
- Motion Consistency-Based Correspondence Growing for Remote Sensing Image Matching. IEEE Geoscience and Remote Sensing Letters, 19: 1–5.
- Learnable Motion Coherence for Correspondence Pruning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 3237–3246.
- Lowe, D. G. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2): 91–110.
- Infrared and visible image fusion methods and applications: A survey. Information Fusion, 45: 153–178.
- Robust feature matching for remote sensing image registration via locally linear transforming. IEEE Transactions on Geoscience and Remote Sensing, 53(12): 6469–6481.
- Learning to find good correspondences. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2666–2674.
- ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE transactions on robotics, 31(5): 1147–1163.
- Automatic differentiation in pytorch.
- Pointnet++: Deep hierarchical feature learning on point sets in a metric space. Advances in neural information processing systems, 30: 5099–5108.
- Geometric transformer for fast and robust point cloud registration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11143–11152.
- Deep fundamental matrix estimation. In Proceedings of the European Conference on Computer Vision (ECCV), 284–299.
- ORB: An efficient alternative to SIFT or SURF. In 2011 International conference on computer vision, 2564–2571. Ieee.
- From coarse to fine: Robust hierarchical localization at large scale. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 12716–12725.
- SuperGlue: Learning Feature Matching With Graph Neural Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).
- Benchmarking 6dof outdoor visual localization in changing conditions. In Proceedings of the IEEE conference on computer vision and pattern recognition, 8601–8610.
- Structure-from-motion revisited. In Proceedings of the IEEE conference on computer vision and pattern recognition, 4104–4113.
- ACNe: Attentive Context Normalization for Robust Permutation-Equivariant Learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 11286–11295.
- YFCC100M: The new data in multimedia research. Communications of the ACM, 59(2): 64–73.
- Robust computation and parametrization of multiple view relations. In Sixth International Conference on Computer Vision (IEEE Cat. No. 98CH36271), 727–732. IEEE.
- Attention is all you need. In Advances in neural information processing systems, 5998–6008.
- Learning to find reliable correspondences with local neighborhood consensus. Neurocomputing, 406: 150–158.
- Sun3d: A database of big spaces reconstructed using sfm and object labels. In Proceedings of the IEEE international conference on computer vision, 1625–1632.
- Learning Second-Order Attentive Context for Efficient Correspondence Pruning. arXiv preprint arXiv:2303.15761.
- Hierarchical graph representation learning with differentiable pooling. Advances in neural information processing systems, 31.
- Learning two-view correspondences and geometry using order-aware network. In Proceedings of the IEEE International Conference on Computer Vision, 5845–5854.
- Nm-net: Mining reliable neighbors for robust feature correspondences. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 215–224.
- Progressive Correspondence Pruning by Consensus Learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 6464–6473.