DensePose From WiFi (2301.00250v1)

Published 31 Dec 2022 in cs.CV

Abstract: Advances in computer vision and machine learning techniques have led to significant development in 2D and 3D human pose estimation from RGB cameras, LiDAR, and radars. However, human pose estimation from images is adversely affected by occlusion and lighting, which are common in many scenarios of interest. Radar and LiDAR technologies, on the other hand, need specialized hardware that is expensive and power-intensive. Furthermore, placing these sensors in non-public areas raises significant privacy concerns. To address these limitations, recent research has explored the use of WiFi antennas (1D sensors) for body segmentation and key-point body detection. This paper further expands on the use of the WiFi signal in combination with deep learning architectures, commonly used in computer vision, to estimate dense human pose correspondence. We developed a deep neural network that maps the phase and amplitude of WiFi signals to UV coordinates within 24 human regions. The results of the study reveal that our model can estimate the dense pose of multiple subjects, with comparable performance to image-based approaches, by utilizing WiFi signals as the only input. This paves the way for low-cost, broadly accessible, and privacy-preserving algorithms for human sensing.

Authors (3)

Jiaqi Geng (4 papers)
Dong Huang (102 papers)
Fernando de la Torre (49 papers)

Citations (17)

View on Semantic Scholar

Summary

Overview of "DensePose From WiFi"

The paper "DensePose From WiFi" presents a paper that leverages WiFi antennas in combination with deep learning structures parallel to those used in computer vision to estimate dense human pose correspondences. This work offers an alternative to traditional 2D and 3D human pose estimation methods which utilize cameras, LiDAR, and radar technologies. These conventional methods, while effective, are hindered by challenges including lighting conditions and occlusion in the case of cameras, and the need for expensive, specialized hardware for LiDAR and radar systems along with privacy concerns. The proposed use of WiFi signals, however, provides a low-cost, widely accessible solution that circumvents the privacy issues associated with video acquisition.

Methodology

The methodology involves a novel neural network that transforms the phase and amplitude of WiFi signals into UV coordinates across 24 segmented human regions. The inputs to the network are Channel State Information (CSI), which encapsulates the relative change of WiFi signal in terms of amplitude and phase. The network features a modality translation mechanism to move from the WiFi domain to an image domain suitable for DensePose estimation. This translation improves training efficiency via transfer learning, using a pre-trained image-based DensePose network as a mentor model.

A series of sanitization techniques are applied to the CSI data to resolve phase irregularities, making complete use of both phase and amplitude information. The DensePose-RCNN architecture is modified to process these signals, incorporating branches to jointly predict the keypoints and the dense human body pose. The report details an efficient approach that enhances accuracy in capturing body detail by supervising feature extraction processes with a robust image-based model. Key insight is gained into how phase can, when effectively unraveled into continuous vectors, contribute significantly to accuracy.

Results

In terms of numerical performance, the model shows comparable results to image-based approaches when estimating dense poses, notable by achieving an Average Precision (AP) at an IOU threshold of 0.50 of 87.2 for human detection in the "same layout" protocol. This suggests that the model's efficacy in detecting and localizing the collective silhouette of human bodies in varied spatial environments is substantially promising. However, when breaking into unseen "different layout" data, performance had a noticeable decline – a reflection of a common domain adaptation challenge across similar multi-modal tasks.

An ablation paper reaffirms the critical importance of phase information, keypoint supervision, and transfer learning, with each proving to be substantial enhancements to model efficiency and accuracy. The experiments demonstrated a performance improvement in dense pose detection with carefully optimized use of CSI phase sanitization and learning transfer processes.

Implications and Future Work

This research presages an intriguing direction for human pose estimation, establishing WiFi as a legitimate sensory domain for complex human monitoring tasks. Given the heightened privacy concerns realizable in current technological epochs, this WiFi-based approach sharply converges with the demand for less intrusive, cost-effective monitoring ecosystems that suit domestic and human-centric environments.

Nonetheless, the paper acknowledges challenges, such as the variability in WiFi signal propagation due to environmental changes and the restricted availability of rich multi-layout training datasets. The potential extension to 3D body reconstruction was identified, a progression that if conquered, will critical augment the utility of such WiFi-driven systems by enabling even richer human pose modeling capabilities pertinent for robotics, smart environments, and health monitoring systems. Further research can also investigate strategies to overcome domain adaptation challenges and refining model reliability in diverse spatial layouts. This paper thus cultivates a substantive groundwork that beckons further exploration and adoption in AI-centric human monitoring.

PDF Markdown

Related Papers

Tweets

https://twitter.com/str4ngeMD/status/1804937361492418825

https://twitter.com/YouHitYourHead/status/1887575661327405549

https://twitter.com/steffanwatkins/status/1804917038676079007

https://twitter.com/cloudbring/status/1939530546532753750

https://twitter.com/edsgerdykestra/status/1752457356729000321

https://twitter.com/vincentmcasey/status/1863771294594113879

YouTube

Show All Videos

Reddit

DensePose From WiFi - AKA heat mapping persons via wifi inside a building (1 point, 0 comments)