R-FCN: Object Detection via Region-based Fully Convolutional Networks
Abstract: We present region-based, fully convolutional networks for accurate and efficient object detection. In contrast to previous region-based detectors such as Fast/Faster R-CNN that apply a costly per-region subnetwork hundreds of times, our region-based detector is fully convolutional with almost all computation shared on the entire image. To achieve this goal, we propose position-sensitive score maps to address a dilemma between translation-invariance in image classification and translation-variance in object detection. Our method can thus naturally adopt fully convolutional image classifier backbones, such as the latest Residual Networks (ResNets), for object detection. We show competitive results on the PASCAL VOC datasets (e.g., 83.6% mAP on the 2007 set) with the 101-layer ResNet. Meanwhile, our result is achieved at a test-time speed of 170ms per image, 2.5-20x faster than the Faster R-CNN counterpart. Code is made publicly available at: https://github.com/daijifeng001/r-fcn
- Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks. In CVPR, 2016.
- Semantic image segmentation with deep convolutional nets and fully connected crfs. In ICLR, 2015.
- Instance-sensitive fully convolutional networks. arXiv:1603.08678, 2016.
- Scalable object detection using deep neural networks. In CVPR, 2014.
- The PASCAL Visual Object Classes (VOC) Challenge. IJCV, 2010.
- R. Girshick. Fast R-CNN. In ICCV, 2015.
- Rich feature hierarchies for accurate object detection and semantic segmentation. In CVPR, 2014.
- Spatial pyramid pooling in deep convolutional networks for visual recognition. In ECCV. 2014.
- Deep residual learning for image recognition. In CVPR, 2016.
- Imagenet classification with deep convolutional neural networks. In NIPS, 2012.
- Backpropagation applied to handwritten zip code recognition. Neural computation, 1989.
- K. Lenc and A. Vedaldi. R-CNN minus R. In BMVC, 2015.
- Microsoft COCO: Common objects in context. In ECCV, 2014.
- SSD: Single shot multibox detector. arXiv:1512.02325v2, 2015.
- Fully convolutional networks for semantic segmentation. In CVPR, 2015.
- S. Mallat. A wavelet tour of signal processing. Academic press, 1999.
- You only look once: Unified, real-time object detection. In CVPR, 2016.
- Faster R-CNN: Towards real-time object detection with region proposal networks. In NIPS, 2015.
- Object detection networks on convolutional feature maps. arXiv:1504.06066, 2015.
- ImageNet Large Scale Visual Recognition Challenge. IJCV, 2015.
- Overfeat: Integrated recognition, localization and detection using convolutional networks. In ICLR, 2014.
- Training region-based object detectors with online hard example mining. In CVPR, 2016.
- K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.
- Going deeper with convolutions. In CVPR, 2015.
- Deep neural networks for object detection. In NIPS, 2013.
- Rethinking the inception architecture for computer vision. In CVPR, 2016.
- Selective search for object recognition. IJCV, 2013.
- C. L. Zitnick and P. Dollár. Edge boxes: Locating object proposals from edges. In ECCV, 2014.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.