Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multi-modality action recognition based on dual feature shift in vehicle cabin monitoring (2401.14838v1)

Published 26 Jan 2024 in cs.CV

Abstract: Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems. In real-world applications, it is common for vehicle cabins to be equipped with cameras featuring different modalities. However, multi-modality fusion strategies for the DAR task within car cabins have rarely been studied. In this paper, we propose a novel yet efficient multi-modality driver action recognition method based on dual feature shift, named DFS. DFS first integrates complementary features across modalities by performing modality feature interaction. Meanwhile, DFS achieves the neighbour feature propagation within single modalities, by feature shifting among temporal frames. To learn common patterns and improve model efficiency, DFS shares feature extracting stages among multiple modalities. Extensive experiments have been carried out to verify the effectiveness of the proposed DFS model on the Drive&Act dataset. The results demonstrate that DFS achieves good performance and improves the efficiency of multi-modality driver action recognition.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Dan Lin (31 papers)
  2. Philip Hann Yung Lee (1 paper)
  3. Yiming Li (199 papers)
  4. Ruoyu Wang (95 papers)
  5. Kim-Hui Yap (28 papers)
  6. Bingbing Li (24 papers)
  7. You Shing Ngim (1 paper)
Citations (3)

Summary

We haven't generated a summary for this paper yet.