Bilateral Reference for High-Resolution Dichotomous Image Segmentation (2401.03407v6)
Abstract: We introduce a novel bilateral reference framework (BiRefNet) for high-resolution dichotomous image segmentation (DIS). It comprises two essential components: the localization module (LM) and the reconstruction module (RM) with our proposed bilateral reference (BiRef). The LM aids in object localization using global semantic information. Within the RM, we utilize BiRef for the reconstruction process, where hierarchical patches of images provide the source reference and gradient maps serve as the target reference. These components collaborate to generate the final predicted maps. We also introduce auxiliary gradient supervision to enhance focus on regions with finer details. Furthermore, we outline practical training strategies tailored for DIS to improve map quality and training process. To validate the general applicability of our approach, we conduct extensive experiments on four tasks to evince that BiRefNet exhibits remarkable performance, outperforming task-specific cutting-edge methods across all benchmarks. Our codes are available at https://github.com/ZhengPeng7/BiRefNet.
- Frequency-tuned salient region detection. In CVPR, 2009.
- Encoder-decoder with atrous separable convolution for semantic image segmentation. In ECCV, 2018.
- Deformable convolutional networks. In ICCV, 2017.
- Enabling trimap-free image matting with a frequency-guided saliency-aware network via joint learning. IEEE TMM, 25:4868–4879, 2022.
- Recurrent multi-scale transformer for high-resolution salient object detection. In ACM MM, 2023.
- Structure-measure: A new way to evaluate foreground maps. In ICCV, 2017.
- Enhanced-alignment measure for binary foreground map evaluation. In IJCAI, 2018.
- Camouflaged object detection. In CVPR, 2020.
- Concealed object detection. IEEE TPAMI, 44(10):6024–6042, 2022.
- Advances in deep concealed scene understanding. VI, 1(1):16, 2023a.
- Salient objects in clutter. IEEE TPAMI, 45(2):2344–2366, 2023b.
- A pyramid-based approach to segmentation applied to region matching. IEEE TPAMI, 8(5):639–650, 1986.
- Deep residual learning for image recognition. In CVPR, 2016.
- High-resolution iterative feedback network for camouflaged object detection. In AAAI, 2023.
- Feature shrinkage pyramid for camouflaged object detection with transformers. In CVPR, 2023.
- Deep gradient learning for efficient camouflaged object detection. MIR, 20(1):92–108, 2023.
- Revisiting image pyramid structure for high resolution salient object detection. In ACCV, 2022.
- Adam: A method for stochastic optimization. In ICLR, 2015.
- Deep laplacian pyramid networks for fast and accurate super-resolution. In CVPR, 2017.
- Fast and accurate image super-resolution with deep laplacian pyramid networks. IEEE TPAMI, 41(11):2599–2613, 2018.
- Bridging composite and real: towards end-to-end deep image matting. IJCV, 130(2):246–266, 2022.
- Locate, refine and restore: A progressive enhancement network for camouflaged object detection. In IJCAI, 2023.
- Microsoft coco: Common objects in context. In ECCV, 2014.
- Feature pyramid networks for object detection. In CVPR, 2017.
- Swin transformer: Hierarchical vision transformer using shifted windows. In ICCV, 2021.
- Zoom in and out: A mixed-scale triplet network for camouflaged object detection. In CVPR, 2022.
- PyTorch: An imperative style, high-performance deep learning library. NeurIPS, 2019.
- Unite-divide-unite: Joint boosting trunk and structure for high-accuracy dichotomous image segmentation. In ACM MM, 2023.
- Basnet: Boundary-aware salient object detection. In CVPR, 2019.
- U2-net: Going deeper with nested u-structure for salient object detection. PR, 106:107404, 2020.
- Highly accurate dichotomous image segmentation. In ECCV, 2022.
- U-net: Convolutional networks for biomedical image segmentation. In MICCAI, 2015.
- High quality segmentation for ultra high-resolution images. In CVPR, 2022.
- Boundary-guided camouflaged object detection. In IJCAI, 2022.
- Look closer to segment better: Boundary patch refinement for instance segmentation. In CVPR, 2021a.
- Disentangled high quality salient object detection. In CVPR, 2021b.
- Deep high-resolution representation learning for visual recognition. IEEE TPAMI, 43(10):3349–3364, 2020.
- Learning to detect salient objects with image-level supervision. In CVPR, 2017.
- Label decoupling framework for salient object detection. In CVPR, 2020.
- Pyramid grafting network for one-stage high resolution saliency detection. In CVPR, 2022.
- Deep image matting. In CVPR, 2017.
- Camoformer: Masked separable attention for camouflaged object detection. arXiv, 2022.
- Mask guided matting via progressive refinement network. In CVPR, 2021.
- Towards high-resolution salient object detection. In CVPR, 2019.
- Pyramid scene parsing network. In CVPR, 2017.
- Icnet for real-time semantic segmentation on high-resolution images. In ECCV, 2018.
- Detecting camouflaged object in frequency domain. In CVPR, 2022.
- Dichotomous image segmentation with frequency priors. In IJCAI, 2023.
- Deepcrack: Learning hierarchical convolutional features for crack detection. IEEE TIP, 28(3):1498–1512, 2018.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.