Rethinking Learning Approaches for Long-Term Action Anticipation (2210.11566v1)

Published 20 Oct 2022 in cs.CV and cs.LG

Abstract: Action anticipation involves predicting future actions having observed the initial portion of a video. Typically, the observed video is processed as a whole to obtain a video-level representation of the ongoing activity in the video, which is then used for future prediction. We introduce ANTICIPATR which performs long-term action anticipation leveraging segment-level representations learned using individual segments from different activities, in addition to a video-level representation. We propose a two-stage learning approach to train a novel transformer-based model that uses these two types of representations to directly predict a set of future action instances over any given anticipation duration. Results on Breakfast, 50Salads, Epic-Kitchens-55, and EGTEA Gaze+ datasets demonstrate the effectiveness of our approach.

Authors (3)

Megha Nawhal (7 papers)
Akash Abdu Jyothi (3 papers)
Greg Mori (65 papers)

Citations (21)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Rethinking Learning Approaches for Long-Term Action Anticipation (2210.11566v1)

Summary

Related Papers