Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

IMUTube: Automatic Extraction of Virtual on-body Accelerometry from Video for Human Activity Recognition (2006.05675v2)

Published 29 May 2020 in cs.CV and eess.IV

Abstract: The lack of large-scale, labeled data sets impedes progress in developing robust and generalized predictive models for on-body sensor-based human activity recognition (HAR). Labeled data in human activity recognition is scarce and hard to come by, as sensor data collection is expensive, and the annotation is time-consuming and error-prone. To address this problem, we introduce IMUTube, an automated processing pipeline that integrates existing computer vision and signal processing techniques to convert videos of human activity into virtual streams of IMU data. These virtual IMU streams represent accelerometry at a wide variety of locations on the human body. We show how the virtually-generated IMU data improves the performance of a variety of models on known HAR datasets. Our initial results are very promising, but the greater promise of this work lies in a collective approach by the computer vision, signal processing, and activity recognition communities to extend this work in ways that we outline. This should lead to on-body, sensor-based HAR becoming yet another success story in large-dataset breakthroughs in recognition.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Hyeokhyen Kwon (12 papers)
  2. Catherine Tong (6 papers)
  3. Harish Haresamudram (12 papers)
  4. Yan Gao (157 papers)
  5. Gregory D. Abowd (4 papers)
  6. Nicholas D. Lane (97 papers)
  7. Thomas Ploetz (28 papers)
Citations (78)