Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Toward Scale-Invariance and Position-Sensitive Region Proposal Networks (1807.09528v1)

Published 25 Jul 2018 in cs.CV

Abstract: Accurately localising object proposals is an important precondition for high detection rate for the state-of-the-art object detection frameworks. The accuracy of an object detection method has been shown highly related to the average recall (AR) of the proposals. In this work, we propose an advanced object proposal network in favour of translation-invariance for objectness classification, translation-variance for bounding box regression, large effective receptive fields for capturing global context and scale-invariance for dealing with a range of object sizes from extremely small to large. The design of the network architecture aims to be simple while being effective and with real time performance. Without bells and whistles the proposed object proposal network significantly improves the AR at 1,000 proposals by $35\%$ and $45\%$ on PASCAL VOC and COCO dataset respectively and has a fast inference time of 44.8 ms for input image size of $640{2}$. Empirical studies have also shown that the proposed method is class-agnostic to be generalised for general object proposal.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Hsueh-Fu Lu (1 paper)
  2. Xiaofei Du (9 papers)
  3. Ping-Lin Chang (2 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.