Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Non-local RoI for Cross-Object Perception (1811.10002v1)

Published 25 Nov 2018 in cs.CV and cs.LG

Abstract: We present a generic and flexible module that encodes region proposals by both their intrinsic features and the extrinsic correlations to the others. The proposed non-local region of interest (NL-RoI) can be seamlessly adapted into different generalized R-CNN architectures to better address various perception tasks. Observe that existing techniques from R-CNN treat RoIs independently and perform the prediction solely based on image features within each region proposal. However, the pairwise relationships between proposals could further provide useful information for detection and segmentation. NL-RoI is thus formulated to enrich each RoI representation with the information from all other RoIs, and yield a simple, low-cost, yet effective module for region-based convolutional networks. Our experimental results show that NL-RoI can improve the performance of Faster/Mask R-CNN for object detection and instance segmentation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Shou-Yao Roy Tseng (4 papers)
  2. Hwann-Tzong Chen (38 papers)
  3. Shao-Heng Tai (2 papers)
  4. Tyng-Luh Liu (21 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.