Boundary Attention: Learning curves, corners, junctions and grouping (2401.00935v3)
Abstract: We present a lightweight network that infers grouping and boundaries, including curves, corners and junctions. It operates in a bottom-up fashion, analogous to classical methods for sub-pixel edge localization and edge-linking, but with a higher-dimensional representation of local boundary structure, and notions of local scale and spatial consistency that are learned instead of designed. Our network uses a mechanism that we call boundary attention: a geometry-aware local attention operation that, when applied densely and repeatedly, progressively refines a pixel-resolution field of variables that specify the boundary structure in every overlapping patch within an image. Unlike many edge detectors that produce rasterized binary edge maps, our model provides a rich, unrasterized representation of the geometric structure in every local region. We find that its intentional geometric bias allows it to be trained on simple synthetic shapes and then generalize to extracting boundaries from noisy low-light photographs.
- A high-quality denoising dataset for smartphone cameras. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018.
- SLIC superpixels. Technical report, 2010.
- Deep ViT features as dense visual descriptors. In ECCV Workshops on What is Motion For?, page 4, 2021.
- John Canny. A computational approach to edge detection. IEEE Transactions on pattern analysis and machine intelligence, (6):679–698, 1986.
- Emerging properties in self-supervised vision transformers. In Proceedings of the IEEE/CVF international conference on computer vision, pages 9650–9660, 2021.
- Edwin Earl Catmull. A subdivision algorithm for computer display of curved surfaces. The University of Utah, 1974.
- Active contours without edges. IEEE Transactions on image processing, 10(2):266–277, 2001.
- Feature detection in human vision: A phase-dependent energy model. Proc. Royal Soc. B, 235(1280):221–245, 1988.
- Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Transactions on image processing, 16(8):2080–2095, 2007.
- Fast edge detection using structured forests, 2014.
- Efficient graph-based image segmentation. International Journal of Computer Vision, 59:167–181, 2004.
- Fundamentals of interactive computer graphics. Addison-Wesley Longman Publishing Co., Inc., 1982.
- William T. Freeman. Steerable filters and local analysis of image structure. PhD thesis, Massachusetts Institute of Technology, 1992.
- The design and use of steerable filters. IEEE Transactions on pattern analysis and machine intelligence, 13(9):891–906, 1991.
- A combined corner and edge detector. In Alvey vision conference, pages 10–5244. Citeseer, 1988.
- Paul S. Heckbert. Fundamentals of texture mapping and image warping. Citeseer, 1989.
- Bridging nonlinearities and stochastic regularizers with gaussian error linear units. CoRR, abs/1606.08415, 2016.
- Snakes: Active contour models. International journal of computer vision, 1(4):321–331, 1988.
- Segment anything. arXiv:2304.02643, 2023.
- Efficient inference in fully connected CRFs with Gaussian edge potentials. Advances in neural information processing systems, 2011.
- Using contours to detect and localize junctions in natural images. In IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8. IEEE, 2008.
- Learning to detect natural image boundaries using local brightness, color, and texture cues. IEEE Transactions on pattern analysis and machine intelligence, 26(5):530–549, 2004.
- Fast detection of curved edges at low SNR. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016.
- On detection of faint edges in noisy images. IEEE Transactions on pattern analysis and machine intelligence, 42(4):894–908, 2019.
- Trace inference, curvature consistency, and curve detection. IEEE Transactions on pattern analysis and machine intelligence, 11(8):823–839, 1989.
- Ken Perlin. An image synthesizer. ACM Siggraph Computer Graphics, 19(3):287–296, 1985.
- Juan Pineda. A parallel algorithm for polygon rasterization. In Proceedings of the 15th annual conference on Computer graphics and interactive techniques, pages 17–20, 1988.
- EDTER: Edge detection with transformer. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- Learning conditional random fields for stereo. In 2007 IEEE Conference on Computer Vision and Pattern Recognition, pages 1–8, 2007.
- High-accuracy stereo depth maps using structured light. In 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings., pages I–I, 2003.
- Pixel difference networks for efficient edge detection. In Proceedings of the IEEE/CVF international conference on computer vision, pages 5117–5127, 2021.
- MLP-Mixer: An all-MLP architecture for vision. Advances in neural information processing systems, 34:24261–24272, 2021.
- Field of junctions: Extracting boundary structure at low SNR. In Proceedings of the IEEE/CVF international conference on computer vision, 2021.
- A physics-based noise formation model for extreme low-light raw denoising. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2758–2767, 2020.
- Holistically-nested edge detection. In Proceedings of the IEEE/CVF international conference on computer vision, pages 1395–1403, 2015.
- Anisotropic-scale junction detection and matching for indoor images. IEEE Transactions on Image Processing, 27(1):78–91, 2017.