2000 character limit reached
Relation-Aware Pyramid Network (RapNet) for temporal action proposal (1908.03448v1)
Published 9 Aug 2019 in cs.CV
Abstract: In this technical report, we describe our solution to temporal action proposal (task 1) in ActivityNet Challenge 2019. First, we fine-tune a ResNet-50-C3D CNN on ActivityNet v1.3 based on Kinetics pretrained model to extract snippet-level video representations and then we design a Relation-Aware Pyramid Network (RapNet) to generate temporal multiscale proposals with confidence score. After that, we employ a two-stage snippet-level boundary adjustment scheme to re-rank the order of generated proposals. Ensemble methods are also been used to improve the performance of our solution, which helps us achieve 2nd place.
- Jialin Gao (18 papers)
- Zhixiang Shi (49 papers)
- Jiani Li (11 papers)
- Yufeng Yuan (15 papers)
- Jiwei Li (137 papers)
- Xi Zhou (43 papers)