Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PNL: Efficient Long-Range Dependencies Extraction with Pyramid Non-Local Module for Action Recognition (2006.05091v1)

Published 9 Jun 2020 in cs.CV

Abstract: Long-range spatiotemporal dependencies capturing plays an essential role in improving video features for action recognition. The non-local block inspired by the non-local means is designed to address this challenge and have shown excellent performance. However, the non-local block brings significant increase in computation cost to the original network. It also lacks the ability to model regional correlation in videos. To address the above limitations, we propose Pyramid Non-Local (PNL) module, which extends the non-local block by incorporating regional correlation at multiple scales through a pyramid structured module. This extension upscales the effectiveness of non-local operation by attending to the interaction between different regions. Empirical results prove the effectiveness and efficiency of our PNL module, which achieves state-of-the-art performance of 83.09% on the Mini-Kinetics dataset, with decreased computation cost compared to the non-local block.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yuecong Xu (31 papers)
  2. Haozhi Cao (23 papers)
  3. Jianfei Yang (78 papers)
  4. Kezhi Mao (24 papers)
  5. Jianxiong Yin (24 papers)
  6. Simon See (74 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.