Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Explore Human Parsing Modality for Action Recognition (2401.02138v1)

Published 4 Jan 2024 in cs.CV

Abstract: Multimodal-based action recognition methods have achieved high success using pose and RGB modality. However, skeletons sequences lack appearance depiction and RGB images suffer irrelevant noise due to modality limitations. To address this, we introduce human parsing feature map as a novel modality, since it can selectively retain effective semantic features of the body parts, while filtering out most irrelevant noise. We propose a new dual-branch framework called Ensemble Human Parsing and Pose Network (EPP-Net), which is the first to leverage both skeletons and human parsing modalities for action recognition. The first human pose branch feeds robust skeletons in graph convolutional network to model pose features, while the second human parsing branch also leverages depictive parsing feature maps to model parsing festures via convolutional backbones. The two high-level features will be effectively combined through a late fusion strategy for better action recognition. Extensive experiments on NTU RGB+D and NTU RGB+D 120 benchmarks consistently verify the effectiveness of our proposed EPP-Net, which outperforms the existing action recognition methods. Our code is available at: https://github.com/liujf69/EPP-Net-Action.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (59)
  1. IEEE Transactions on Multimedia 24, 1488–1502 (2022)
  2. IEEE Transactions on Pattern Analysis and Machine Intelligence 38(1), 14–29 (2016)
  3. IEEE Transactions on Image Processing 26(5), 2149–2162 (2017)
  4. IEEE Transactions on Image Processing (2024)
  5. IEEE Transactions on Multimedia (2023)
  6. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2021
  7. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020
  8. In: Proceedings of the 28th ACM International Conference on Multimedia (ACM MM), 2020
  9. IEEE Transactions on Circuits and Systems for Video Technology 32(12), 8646–8659 (2022)
  10. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023
  11. Cyborg and Bionic Systems 2022, 0002 (2022)
  12. In: Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS), 2023
  13. arXiv:2201.02849 (2022)
  14. In: Proceedings of the European Conference on Computer Vision (ECCV), 2020
  15. IEEE Transactions on Circuits and Systems for Video Technology 32(8), 5281–5292 (2022)
  16. CAAI Transactions on Intelligence Technology 6(1), 80–92 (2021)
  17. IEEE Transactions on Multimedia (2022)
  18. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  19. CAAI Transactions on Intelligence Technology 7(1), 46–55 (2022)
  20. IEEE Transactions on Image Processing 28(6), 2799–2812 (2019)
  21. IEEE Transactions on Pattern Analysis and Machine Intelligence 41(4), 871–885 (2019)
  22. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015
  23. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012
  24. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, 2012
  25. Pattern Recognition 68, 346–362 (2017)
  26. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
  27. In: Proceedings of the CAAI International Conference on Artificial Intelligence (CICAI), 2022
  28. IEEE Transactions on Multimedia pp. 1–13 (2023)
  29. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2018
  30. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI) (2021)
  31. In: Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), 2023
  32. In: Proceedings of the International Conference on Learning Representations (ICLR), 2021
  33. IEEE Transactions on Circuits and Systems for Video Technology 32(3), 1250–1261 (2022)
  34. In: Proceedings of the Asian Conference on Computer Vision (ACCV), 2020
  35. In: Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 2023
  36. arXiv:2201.04676 (2022)
  37. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021
  38. CAAI Transactions on Intelligence Technology 3(4), 219–227 (2018)
  39. CAAI Transactions on Intelligence Technology 1(1), 14–29 (2016)
  40. IEEE Transactions on Pattern Analysis and Machine Intelligence 41(2), 423–443 (2019)
  41. CAAI Transactions on Intelligence Technology 8(1), 247–259 (2023)
  42. CAAI Transactions on Intelligence Technology 8(2), 390–400 (2023)
  43. CAAI Transactions on Intelligence Technology 7(4), 744–757 (2022)
  44. IEEE Transactions on Image Processing (2023)
  45. arXiv:2208.05318 (2022)
  46. arXiv:2305.12398 (2023)
  47. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014
  48. IEEE Transactions on Pattern Analysis and Machine Intelligence 37(12), 2402–2414 (2015)
  49. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2019
  50. IEEE Transactions on Pattern Analysis and Machine Intelligence (2020)
  51. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
  52. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2023
  53. Ultralytics: ultralytics/yolov5: v7.0 - YOLOv5 SOTA Realtime Instance Segmentation (2022)
  54. In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2021
  55. Neurocomputing (2023)
  56. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016
  57. IEEE Transactions on Pattern Analysis and Machine Intelligence 42(10), 2684–2701 (2020)
  58. arXiv:1409.1556 (2015)
  59. arXiv:2303.11331 (2023)
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jinfu Liu (9 papers)
  2. Runwei Ding (16 papers)
  3. Yuhang Wen (7 papers)
  4. Nan Dai (2 papers)
  5. Fanyang Meng (14 papers)
  6. Shen Zhao (37 papers)
  7. Mengyuan Liu (72 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.