Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

HARGPT: Are LLMs Zero-Shot Human Activity Recognizers? (2403.02727v1)

Published 5 Mar 2024 in cs.CL, cs.AI, and cs.HC

Abstract: There is an ongoing debate regarding the potential of LLMs as foundational models seamlessly integrated with Cyber-Physical Systems (CPS) for interpreting the physical world. In this paper, we carry out a case study to answer the following question: Are LLMs capable of zero-shot human activity recognition (HAR). Our study, HARGPT, presents an affirmative answer by demonstrating that LLMs can comprehend raw IMU data and perform HAR tasks in a zero-shot manner, with only appropriate prompts. HARGPT inputs raw IMU data into LLMs and utilizes the role-play and think step-by-step strategies for prompting. We benchmark HARGPT on GPT4 using two public datasets of different inter-class similarities and compare various baselines both based on traditional machine learning and state-of-the-art deep classification models. Remarkably, LLMs successfully recognize human activities from raw IMU data and consistently outperform all the baselines on both datasets. Our findings indicate that by effective prompting, LLMs can interpret raw IMU data based on their knowledge base, possessing a promising potential to analyze raw sensor data of the physical world effectively.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Gpt-4 technical report. arXiv preprint arXiv:2303.08774 (2023).
  2. Gérard Biau and Erwan Scornet. 2016. A random forest guided tour. Test 25 (2016), 197–227.
  3. Video generation models as world simulators. (2024). https://openai.com/research/video-generation-models-as-world-simulators
  4. Capture-24: Activity tracker dataset for human activity recognition. (2021).
  5. ChatGPT. 2023. . https://twitter.com/ChatGPTapp/status/1732979491071549792
  6. Support vector machines. IEEE Intelligent Systems and their applications 13, 4 (1998), 18–28.
  7. Health-llm: Large language models for health prediction via wearable sensor data. arXiv preprint arXiv:2401.06866 (2024).
  8. ChatGPT: Jack of all trades, master of none. Information Fusion (2023), 101861.
  9. Large language models are zero-shot reasoners. Advances in neural information processing systems 35 (2022), 22199–22213.
  10. Yann LeCun. 2022. A path towards autonomous machine intelligence version 0.9. 2, 2022-06-27. Open Review 62 (2022).
  11. Personal llm agents: Insights and survey about the capability, efficiency and security. arXiv preprint arXiv:2401.05459 (2024).
  12. Enhancing the reliability of out-of-distribution image detection in neural networks. arXiv preprint arXiv:1706.02690 (2017).
  13. Can generalist foundation models outcompete special-purpose tuning? case study in medicine. arXiv preprint arXiv:2311.16452 (2023).
  14. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825–2830.
  15. Smart devices are different: Assessing and mitigatingmobile sensing heterogeneities for activity recognition. In Proceedings of the 13th ACM conference on embedded networked sensor systems. 127–140.
  16. Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805 (2023).
  17. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288 (2023).
  18. Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems 35 (2022), 24824–24837.
  19. Penetrative AI: Making LLMs Comprehend the Physical World. In Proceedings of the 25th International Workshop on Mobile Computing Systems and Applications. 1–7.
  20. LIMU-BERT: Unleashing the Potential of Unlabeled Data for IMU Sensing Applications. GetMobile: Mobile Computing and Communications 26, 3 (2022), 39–42.
  21. Deep convolutional neural networks on multichannel time series for human activity recognition.. In Ijcai, Vol. 15. Buenos Aires, Argentina, 3995–4001.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Sijie Ji (15 papers)
  2. Xinzhe Zheng (8 papers)
  3. Chenshu Wu (19 papers)
Citations (15)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets