Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Expanding Horizons in HCI Research Through LLM-Driven Qualitative Analysis (2401.04138v1)

Published 7 Jan 2024 in cs.HC and cs.AI

Abstract: How would research be like if we still needed to "send" papers typed with a typewriter? Our life and research environment have continually evolved, often accompanied by controversial opinions about new methodologies. In this paper, we embrace this change by introducing a new approach to qualitative analysis in HCI using LLMs. We detail a method that uses LLMs for qualitative data analysis and present a quantitative framework using SBART cosine similarity for performance evaluation. Our findings indicate that LLMs not only match the efficacy of traditional analysis methods but also offer unique insights. Through a novel dataset and benchmark, we explore LLMs' characteristics in HCI research, suggesting potential avenues for further exploration and application in the field.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Methods to Integrate Natural Language Processing Into Qualitative Research. https://doi.org/10.1177/1609406920984608 19 (12 2020). https://doi.org/10.1177/1609406920984608
  2. A qualititative approach to HCI research. Research Methods for Human-Computer Interaction (4 2008), 138–157. https://doi.org/10.1017/CBO9780511814570.008
  3. R. J. Anderson. 1994. Representations and Requirements: The Value of Ethnography in System Design. Hum. Comput. Interact. 9 (1994), 151–182. Issue 2. https://doi.org/10.1207/S15327051HCI0902_1
  4. Using Large Language Models for Qualitative Analysis can Introduce Serious Bias. arXiv.org (2023). https://doi.org/10.48550/ARXIV.2309.17147
  5. Exploring Qualitative Research Using LLMs. (6 2023). https://arxiv.org/abs/2306.13298v1
  6. Dispensing with Humans in Human-Computer Interaction Research. Conference on Human Factors in Computing Systems - Proceedings 26 (4 2023). https://doi.org/10.1145/3544549.3582749
  7. CoAIcoder: Examining the Effectiveness of AI-assisted Human-to-Human Collaboration in Qualitative Analysis. ACM Transactions on Computer-Human Interaction (11 2023). https://doi.org/10.1145/3617362
  8. Nahid Golafshani. 2003. Understanding Reliability and Validity in Qualitative Research. The Qualitative Report 8 (12 2003), 597–606. Issue 4. https://doi.org/10.46743/2160-3715/2003.1870
  9. Augmenting Qualitative Text Analysis with Natural Language Processing: Methodological Study. Journal of medical Internet research 20 (6 2018). Issue 6. https://doi.org/10.2196/JMIR.9702
  10. n-stage Latent Dirichlet Allocation: A Novel Approach for LDA. UBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineering (10 2021), 150–154. https://doi.org/10.1109/UBMK.2019.8907050
  11. Evaluating Large Language Models in Generating Synthetic HCI Research Data: a Case Study. Conference on Human Factors in Computing Systems - Proceedings (4 2023), 19. https://doi.org/10.1145/3544548.3580688
  12. Supporting Interview Analysis with Autocoding. Hawaii International Conference on System Sciences 2020-January (2020), 752–761. https://doi.org/10.24251/HICSS.2020.094
  13. Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study. International Journal of Qualitative Methods 18 (11 2019). https://doi.org/10.1177/1609406919887021/ASSET/IMAGES/LARGE/10.1177_1609406919887021-FIG1.JPEG
  14. Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis. Family Medicine and Community Health 9 (11 2021), e001287. Issue Suppl 1. https://doi.org/10.1136/FMCH-2021-001287
  15. Megh Marathe and Kentaro Toyama. 2018. Semi-automated coding for Qualitative research: A user-centered inquiry and initial prototypes. Conference on Human Factors in Computing Systems - Proceedings 2018-April (4 2018). https://doi.org/10.1145/3173574.3173922
  16. Judith S. Olson and Wendy A. Kellogg. 2014. Ways of Knowing in HCI. Springer: New York (1 2014), 1–472. https://doi.org/10.1007/978-1-4939-0378-8
  17. OpenAI. 2023. GPT-4 Technical Report. (3 2023). https://arxiv.org/abs/2303.08774v3
  18. Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (8 2019), 3982–3992. https://doi.org/10.18653/v1/d19-1410 In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity..
  19. The Living Codebook: Documenting the Process of Qualitative Data Analysis. (12 2023). https://doi.org/10.31235/OSF.IO/GVCSM
  20. Personal tracking as lived informatics. Conference on Human Factors in Computing Systems - Proceedings (2014), 1163–1172. https://doi.org/10.1145/2556288.2557039
  21. Together in Bed? Couples’ Mobile Technology Use in Bed. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3290605.3300732
  22. Dag Svanæs. 2013. Interaction design for and with the lived body. ACM Transactions on Computer-Human Interaction (TOCHI) 20 (4 2013). Issue 1. https://doi.org/10.1145/2442106.2442114
  23. Use of Large Language Models to Aid Analysis of Textual Data. bioRxiv (2023). https://doi.org/10.1101/2023.07.17.549361
  24. Emoji Accessibility for Visually Impaired People. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3313831.3376267
  25. Llama 2: Open Foundation and Fine-Tuned Chat Models. (7 2023). https://arxiv.org/abs/2307.09288v2
  26. Analyzing interview data: The development and evolution of a coding system. Qualitative Sociology 24 (2001), 381–400. Issue 3. https://doi.org/10.1023/A:1010690908200
  27. PlaneVR: Social Acceptability of Virtual Reality for Aeroplane Passengers. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3290605.3300310
  28. Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding. International Conference on Intelligent User Interfaces, Proceedings IUI (3 2023), 75–78. https://doi.org/10.1145/3581754.3584136
  29. A guided latent Dirichlet allocation approach to investigate real-time latent topics of Twitter data during Hurricane Laura. https://doi.org/10.1177/01655515211007724 49 (4 2021), 465–479. Issue 2. https://doi.org/10.1177/01655515211007724

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com