Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition (2108.04536v1)

Published 10 Aug 2021 in cs.CV and cs.LG

Abstract: The task of skeleton-based action recognition remains a core challenge in human-centred scene understanding due to the multiple granularities and large variation in human motion. Existing approaches typically employ a single neural representation for different motion patterns, which has difficulty in capturing fine-grained action classes given limited training data. To address the aforementioned problems, we propose a novel multi-granular spatio-temporal graph network for skeleton-based action classification that jointly models the coarse- and fine-grained skeleton motion patterns. To this end, we develop a dual-head graph network consisting of two interleaved branches, which enables us to extract features at two spatio-temporal resolutions in an effective and efficient manner. Moreover, our network utilises a cross-head communication strategy to mutually enhance the representations of both heads. We conducted extensive experiments on three large-scale datasets, namely NTU RGB+D 60, NTU RGB+D 120, and Kinetics-Skeleton, and achieves the state-of-the-art performance on all the benchmarks, which validates the effectiveness of our method.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Tailin Chen (4 papers)
  2. Desen Zhou (10 papers)
  3. Jian Wang (967 papers)
  4. Shidong Wang (23 papers)
  5. Yu Guan (53 papers)
  6. Xuming He (109 papers)
  7. Errui Ding (156 papers)
Citations (67)

Summary

We haven't generated a summary for this paper yet.