Stereo-Matching Knowledge Distilled Monocular Depth Estimation Filtered by Multiple Disparity Consistency (2401.12019v2)
Abstract: In stereo-matching knowledge distillation methods of the self-supervised monocular depth estimation, the stereo-matching network's knowledge is distilled into a monocular depth network through pseudo-depth maps. In these methods, the learning-based stereo-confidence network is generally utilized to identify errors in the pseudo-depth maps to prevent transferring the errors. However, the learning-based stereo-confidence networks should be trained with ground truth (GT), which is not feasible in a self-supervised setting. In this paper, we propose a method to identify and filter errors in the pseudo-depth map using multiple disparity maps by checking their consistency without the need for GT and a training process. Experimental results show that the proposed method outperforms the previous methods and works well on various configurations by filtering out erroneous areas where the stereo-matching is vulnerable, especially such as textureless regions, occlusion boundaries, and reflective surfaces.
- “Unsupervised cnn for single view depth estimation: Geometry to the rescue,” in Proc. ECCV. Springer, 2016, pp. 740–756.
- “Unsupervised monocular depth estimation with left-right consistency,” in Proc. CVPR, 2017, pp. 270–279.
- “Digging into self-supervised monocular depth estimation,” in Proc. CVPR, 2019, pp. 3828–3838.
- “Learning monocular depth by distilling cross-domain stereo networks,” in Proc. ECCV, 2018, pp. 484–500.
- “A large rgb-d dataset for semi-supervised monocular depth estimation,” arXiv preprint arXiv:1904.10230, 2019.
- “Unsupervised domain adaptation for depth prediction from images,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 10, pp. 2396–2409, 2019.
- “Adaptive confidence thresholding for monocular depth estimation,” in Proc. ICCV, October 2021, pp. 12808–12818.
- Matteo Poggi and S. Mattoccia, “Learning from scratch a confidence measure,” in Proc. BMVC, 2016.
- “Laf-net: Locally adaptive fusion networks for stereo confidence estimation,” in Proc. CVPR, 2019.
- “Reduction of aliasing artifacts by sign function approximation in light field depth estimation based on foreground–background separation,” IEEE Signal Process. Lett., vol. 25, no. 11, pp. 1750–1754, 2018.
- “Complex-valued disparity: Unified depth model of depth from stereo, depth from focus, and depth from defocus based on the light field gradient,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 43, no. 3, pp. 830–841, 2021.
- “Learning stereo from single images,” in Proc. ECCV. Springer, 2020, pp. 722–740.
- “On the uncertainty of self-supervised monocular depth estimation,” in Proc. CVPR, 2020.
- “Learning monocular depth estimation infusing traditional stereo knowledge,” in Proc. CVPR, June 2019.
- “Self-supervised monocular depth hints,” in Proc. ICCV, 2019, pp. 2162–2171.
- “Very deep convolutional networks for large-scale image recognition,” arXiv preprint arXiv:1409.1556, 2014.
- “Deep residual learning for image recognition,” in Proc. CVPR, 2016, pp. 770–778.
- “Learning to adapt for stereo,” in Proc. CVPR, 2019, pp. 9661–9670.
- “Depth map prediction from a single image using a multi-scale deep network,” in Advances in Neural Information Processing Systems, 2014, vol. 27.
- “Vision meets robotics: The kitti dataset,” Int. J. Robot. Res., 2013.
- “The cityscapes dataset for semantic urban scene understanding,” in Proc. CVPR, 2016.
- “Learning from scratch a confidence measure,” in Proceedings of the British Machine Vision Conference (BMVC), Edwin R. Hancock Richard C. Wilson and William A. P. Smith, Eds. September 2016, pp. 46.1–46.13, BMVA Press.
- “Beyond local reasoning for stereo confidence estimation with deep learning,” in Proceedings of the European Conference on Computer Vision (ECCV), September 2018.
- Woonghyun Ka (3 papers)
- Jae Young Lee (9 papers)
- Jaehyun Choi (16 papers)
- Junmo Kim (90 papers)