Expanding Horizons in HCI Research Through LLM-Driven Qualitative Analysis (2401.04138v1)
Abstract: How would research be like if we still needed to "send" papers typed with a typewriter? Our life and research environment have continually evolved, often accompanied by controversial opinions about new methodologies. In this paper, we embrace this change by introducing a new approach to qualitative analysis in HCI using LLMs. We detail a method that uses LLMs for qualitative data analysis and present a quantitative framework using SBART cosine similarity for performance evaluation. Our findings indicate that LLMs not only match the efficacy of traditional analysis methods but also offer unique insights. Through a novel dataset and benchmark, we explore LLMs' characteristics in HCI research, suggesting potential avenues for further exploration and application in the field.
- Methods to Integrate Natural Language Processing Into Qualitative Research. https://doi.org/10.1177/1609406920984608 19 (12 2020). https://doi.org/10.1177/1609406920984608
- A qualititative approach to HCI research. Research Methods for Human-Computer Interaction (4 2008), 138–157. https://doi.org/10.1017/CBO9780511814570.008
- R. J. Anderson. 1994. Representations and Requirements: The Value of Ethnography in System Design. Hum. Comput. Interact. 9 (1994), 151–182. Issue 2. https://doi.org/10.1207/S15327051HCI0902_1
- Using Large Language Models for Qualitative Analysis can Introduce Serious Bias. arXiv.org (2023). https://doi.org/10.48550/ARXIV.2309.17147
- Exploring Qualitative Research Using LLMs. (6 2023). https://arxiv.org/abs/2306.13298v1
- Dispensing with Humans in Human-Computer Interaction Research. Conference on Human Factors in Computing Systems - Proceedings 26 (4 2023). https://doi.org/10.1145/3544549.3582749
- CoAIcoder: Examining the Effectiveness of AI-assisted Human-to-Human Collaboration in Qualitative Analysis. ACM Transactions on Computer-Human Interaction (11 2023). https://doi.org/10.1145/3617362
- Nahid Golafshani. 2003. Understanding Reliability and Validity in Qualitative Research. The Qualitative Report 8 (12 2003), 597–606. Issue 4. https://doi.org/10.46743/2160-3715/2003.1870
- Augmenting Qualitative Text Analysis with Natural Language Processing: Methodological Study. Journal of medical Internet research 20 (6 2018). Issue 6. https://doi.org/10.2196/JMIR.9702
- n-stage Latent Dirichlet Allocation: A Novel Approach for LDA. UBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineering (10 2021), 150–154. https://doi.org/10.1109/UBMK.2019.8907050
- Evaluating Large Language Models in Generating Synthetic HCI Research Data: a Case Study. Conference on Human Factors in Computing Systems - Proceedings (4 2023), 19. https://doi.org/10.1145/3544548.3580688
- Supporting Interview Analysis with Autocoding. Hawaii International Conference on System Sciences 2020-January (2020), 752–761. https://doi.org/10.24251/HICSS.2020.094
- Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof of Concept Study. International Journal of Qualitative Methods 18 (11 2019). https://doi.org/10.1177/1609406919887021/ASSET/IMAGES/LARGE/10.1177_1609406919887021-FIG1.JPEG
- Developing and testing an automated qualitative assistant (AQUA) to support qualitative analysis. Family Medicine and Community Health 9 (11 2021), e001287. Issue Suppl 1. https://doi.org/10.1136/FMCH-2021-001287
- Megh Marathe and Kentaro Toyama. 2018. Semi-automated coding for Qualitative research: A user-centered inquiry and initial prototypes. Conference on Human Factors in Computing Systems - Proceedings 2018-April (4 2018). https://doi.org/10.1145/3173574.3173922
- Judith S. Olson and Wendy A. Kellogg. 2014. Ways of Knowing in HCI. Springer: New York (1 2014), 1–472. https://doi.org/10.1007/978-1-4939-0378-8
- OpenAI. 2023. GPT-4 Technical Report. (3 2023). https://arxiv.org/abs/2303.08774v3
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference (8 2019), 3982–3992. https://doi.org/10.18653/v1/d19-1410 In this publication, we present Sentence-BERT (SBERT), a modification of the pretrained BERT network that use siamese and triplet network structures to derive semantically meaningful sentence embeddings that can be compared using cosine-similarity..
- The Living Codebook: Documenting the Process of Qualitative Data Analysis. (12 2023). https://doi.org/10.31235/OSF.IO/GVCSM
- Personal tracking as lived informatics. Conference on Human Factors in Computing Systems - Proceedings (2014), 1163–1172. https://doi.org/10.1145/2556288.2557039
- Together in Bed? Couples’ Mobile Technology Use in Bed. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–12. https://doi.org/10.1145/3290605.3300732
- Dag Svanæs. 2013. Interaction design for and with the lived body. ACM Transactions on Computer-Human Interaction (TOCHI) 20 (4 2013). Issue 1. https://doi.org/10.1145/2442106.2442114
- Use of Large Language Models to Aid Analysis of Textual Data. bioRxiv (2023). https://doi.org/10.1101/2023.07.17.549361
- Emoji Accessibility for Visually Impaired People. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI ’20). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3313831.3376267
- Llama 2: Open Foundation and Fine-Tuned Chat Models. (7 2023). https://arxiv.org/abs/2307.09288v2
- Analyzing interview data: The development and evolution of a coding system. Qualitative Sociology 24 (2001), 381–400. Issue 3. https://doi.org/10.1023/A:1010690908200
- PlaneVR: Social Acceptability of Virtual Reality for Aeroplane Passengers. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3290605.3300310
- Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding. International Conference on Intelligent User Interfaces, Proceedings IUI (3 2023), 75–78. https://doi.org/10.1145/3581754.3584136
- A guided latent Dirichlet allocation approach to investigate real-time latent topics of Twitter data during Hurricane Laura. https://doi.org/10.1177/01655515211007724 49 (4 2021), 465–479. Issue 2. https://doi.org/10.1177/01655515211007724