Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
GPT-4o
Gemini 2.5 Pro Pro
o3 Pro
GPT-4.1 Pro
DeepSeek R1 via Azure Pro
2000 character limit reached

Enhancing Robotic Arm Activity Recognition with Vision Transformers and Wavelet-Transformed Channel State Information (2407.06154v1)

Published 8 Jul 2024 in cs.RO

Abstract: Vision-based methods are commonly used in robotic arm activity recognition. These approaches typically rely on line-of-sight (LoS) and raise privacy concerns, particularly in smart home applications. Passive Wi-Fi sensing represents a new paradigm for recognizing human and robotic arm activities, utilizing channel state information (CSI) measurements to identify activities in indoor environments. In this paper, a novel machine learning approach based on discrete wavelet transform and vision transformers for robotic arm activity recognition from CSI measurements in indoor settings is proposed. This method outperforms convolutional neural network (CNN) and long short-term memory (LSTM) models in robotic arm activity recognition, particularly when LoS is obstructed by barriers, without relying on external or internal sensors or visual aids. Experiments are conducted using four different data collection scenarios and four different robotic arm activities. Performance results demonstrate that wavelet transform can significantly enhance the accuracy of visual transformer networks in robotic arms activity recognition.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. “R3m: A universal visual representation for robot manipulation,” arXiv preprint arXiv:2203.12601, 2022.
  2. “Complicated robot activity recognition by quality-aware deep reinforcement learning,” Future Generation Computer Systems, vol. 117, pp. 480–485, 2021.
  3. “A review on challenges of autonomous mobile robot and sensor fusion methods,” IEEE Access, vol. 8, pp. 39830–39846, 2020.
  4. “2d LiDAR and camera fusion in 3d modeling of indoor environment,” in 2015 National Aerospace and Electronics Conference (NAECON). IEEE, 2015, pp. 379–383.
  5. “Wifi sensing with channel state information: A survey,” ACM Computing Surveys (CSUR), vol. 52, no. 3, pp. 1–36, 2019.
  6. “Widar3. 0: Zero-effort cross-domain gesture recognition with wi-fi,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021.
  7. “LiteHAR: Lightweight Human Activity Recognition from WiFi Signals with Random Convolution Kernels,” in ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2022, pp. 4068–4072.
  8. “Joint human orientation-activity recognition using wifi signals for human-machine interaction,” in ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2023, pp. 1–5.
  9. “Device-free human activity recognition based on random subspace classifier ensemble,” in 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2018, pp. 590–591.
  10. “A survey on behavior recognition using wifi channel state information,” IEEE Communications Magazine, vol. 55, no. 10, pp. 98–104, 2017.
  11. “Contrastive representation of channel state information for human body orientation recognition in interaction with machines,” in 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP). IEEE, 2023, pp. 1–6.
  12. “Fresnel zone-based voting with capsule networks for human activity recognition from channel state information,” IEEE Internet of Things Journal, 2024.
  13. “Hand movement velocity estimation from wifi channel state information,” in 2023 IEEE 9th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP). IEEE, 2023, pp. 296–300.
  14. “Robot motion prediction by channel state information,” in 2023 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), 2023.
  15. “Robofisense: Attention-based robotic arm activity recognition with wifi sensing,” arXiv preprint arXiv:2312.15345v3, 2023.
  16. “An image is worth 16x16 words: Transformers for image recognition at scale,” arXiv preprint arXiv:2010.11929, 2020.
  17. “A CNN-LSTM approach to human activity recognition,” in 2020 international conference on artificial intelligence in information and communication (ICAIIC). IEEE, 2020, pp. 362–366.
  18. “Attention is all you need,” Advances in neural information processing systems, vol. 30, 2017.
  19. “Optimization of signal denoising in discrete wavelet transform,” Chemometrics and intelligent laboratory systems, vol. 48, no. 1, pp. 21–34, 1999.
  20. “Deep spatial–temporal model based cross-scene action recognition using commodity wifi,” IEEE Internet of Things Journal, vol. 7, no. 4, pp. 3592–3601, 2020.
  21. “Indoor localization with csi fingerprint utilizing depthwise separable convolution neural network,” in 2022 IEEE 33rd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), 2022, pp. 1276–1281.
  22. “Wi-atcn: Attentional temporal convolutional network for human action prediction using wifi channel state information,” IEEE Journal of Selected Topics in Signal Processing, vol. 16, no. 4, pp. 804–816, 2022.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com