Continuous Cost Aggregation for Dual-Pixel Disparity Extraction
Abstract: Recent works have shown that depth information can be obtained from Dual-Pixel (DP) sensors. A DP arrangement provides two views in a single shot, thus resembling a stereo image pair with a tiny baseline. However, the different point spread function (PSF) per view, as well as the small disparity range, makes the use of typical stereo matching algorithms problematic. To address the above shortcomings, we propose a Continuous Cost Aggregation (CCA) scheme within a semi-global matching framework that is able to provide accurate continuous disparities from DP images. The proposed algorithm fits parabolas to matching costs and aggregates parabola coefficients along image paths. The aggregation step is performed subject to a quadratic constraint that not only enforces the disparity smoothness but also maintains the quadratic form of the total costs. This gives rise to an inherently efficient disparity propagation scheme with a pixel-wise minimization in closed-form. Furthermore, the continuous form allows for a robust multi-scale aggregation that better compensates for the varying PSF. Experiments on DP data from both DSLR and phone cameras show that the proposed scheme attains state-of-the-art performance in DP disparity estimation.
- Improving single-image defocus deblurring: How dual-pixel images help through multi-task learning. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 1231–1239, January 2022.
- Multi-view motion synthesis via applying rotated dual-pixel blur kernels. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, pages 701–708, January 2022.
- Defocus deblurring using dual-pixel data. In Proc. ECCV, 2020.
- Learning to reduce defocus blur by realistically modeling dual-pixel data. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 2289–2298, October 2021.
- Real-time stereo vision system using semi-global matching disparity estimation: Architecture and fpga-implementation. In 2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, pages 93–101. IEEE, 2010.
- The fast bilateral solver. In European conference on computer vision, pages 617–632. Springer, 2016.
- Fast approximate energy minimization via graph cuts. IEEE Transactions on pattern analysis and machine intelligence, 23(11):1222–1239, 2001.
- G. Bradski. The OpenCV Library. Dr. Dobb’s Journal of Software Tools, 2000.
- Depth map prediction from a single image using a multi-scale deep network. In Proc. NeurIPS, 2014.
- An investigation of methods for determining depth from focus. IEEE T PAMI, 15(2):97–108, 1993.
- Mgm: A significantly more global matching for stereovision. In BMVC 2015, 2015.
- Deep ordinal regression network for monocular depth estimation. In Proc. CVPR, 2018.
- Learning single camera depth estimation using dualpixels. In Proc. ICCV, 2019.
- Depth estimation from a single image using deep learned phase coded mask. IEEE Transactions on Computational Imaging, 4(3):298–310, 2018.
- Literature survey on stereo vision disparity map algorithms. Journal of Sensors, 2016, 2016.
- Guided image filtering. IEEE transactions on pattern analysis and machine intelligence, 35(6):1397–1409, 2012.
- Learning to autofocus. In Proc. CVPR, 2020.
- Heiko Hirschmuller. Stereo processing by semiglobal matching and mutual information. IEEE T PAMI, 30(2):328–341, 2008.
- Memory efficient semi-global matching. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 3:371–376, 2012.
- Evaluation of stereo matching costs on images with radiometric differences. IEEE Transactions on Pattern Analysis and Machine Intelligence, 31(9):1582–1599, 2009.
- An overview of depth cameras and range scanners based on time-of-flight technologies. Machine Vision and Applications, 27:1005–1020, Oct. 2016.
- Facial depth and normal estimation using single dual-pixel camera. In Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part VIII, pages 181–200. Springer, 2022.
- Spatio-focal bidirectional disparity estimation from a dual-pixel image. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5023–5032, June 2023.
- Memory-efficient parametric semiglobal matching. IEEE Signal Processing Letters, 25(2):194–198, 2017.
- Image and depth from a conventional camera with a coded aperture. ACM transactions on graphics (TOG), 26(3):70–es, 2007.
- Monocular depth estimation using deep learning: A review. Sensors (Basel), 22(14), July 2022.
- Focus on defocus: bridging the synthetic to real domain gap for depth estimation. In Proc. CVPR, 2020.
- Real-time stereo vision: Optimizing semi-global matching. In 2013 IEEE Intelligent Vehicles Symposium (IV), pages 1197–1202, 2013.
- New sub-pixel interpolation functions for accurate real-time stereo-matching algorithms. In 2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP), pages 173–178, 2015.
- Deep learning for monocular depth estimation: A review. Neurocomputing, 438:14–33, 2021.
- Dual pixel exploration: Simultaneous depth estimation and image restoration. In Proc. CVPR, pages 4340–4349, 2021.
- An enhanced correlation-based method for stereo correspondence with subpixel accuracy. In Proc. ICCV, 2005.
- Modeling defocus-disparity in dual-pixel sensors. In Proc. ICCP, 2020.
- Reflection removal using a dual-pixel sensor. In Proc. CVPR, 2019.
- R3sgm: Real-time raster-respecting semi-global matching for power-constrained systems. In 2018 International Conference on Field-Programmable Technology (FPT), pages 102–109. IEEE, 2018.
- Kinect range sensing: Structured-light versus time-of-flight kinect. Comput. Vis. Image Underst., 139:1–20, 2015.
- High-resolution stereo datasets with subpixel-accurate ground truth. In German conference on pattern recognition, pages 31–42. Springer, 2014.
- A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. International journal of computer vision, 47(1):7–42, 2002.
- Learning to fuse proposals from multiple scanline optimizations in semi-global matching. In Proceedings of the European Conference on Computer Vision (ECCV), September 2018.
- Matching cost computation algorithm and high speed fpga architecture for high quality real-time semi global matching stereo vision for road scenes. In 17th International IEEE Conference on Intelligent Transportation Systems (ITSC), pages 3064–3069. IEEE, 2014.
- Sgm-nets: Semi-global matching with neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 231–240, 2017.
- Improved stereo matching with constant highway networks and reflective confidence learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 4641–4650, 2017.
- Synthetic depth-of-field with a single-camera mobile phone. In Proc. SIGGRAPH, 2018.
- Defocus map estimation and deblurring from a single dual-pixel image. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 2228–2238, 2021.
- K3dn: Disparity-aware kernel estimation for dual-pixel defocus deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 13263–13272, June 2023.
- Du22{}^{2}start_FLOATSUPERSCRIPT 2 end_FLOATSUPERSCRIPTnet: Learning depth estimation from dual-cameras and dual-pixels. In Proc. ECCV, pages 582–598, 2020.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.