Revealing Patient-Reported Experiences in Healthcare from Social Media using the DAPMAV Framework (2210.04232v2)
Abstract: Understanding patient experience in healthcare is increasingly important and desired by medical professionals in a patient-centered care approach. Healthcare discourse on social media presents an opportunity to gain a unique perspective on patient-reported experiences, complementing traditional survey data. These social media reports often appear as first-hand accounts of patients' journeys through the healthcare system, whose details extend beyond the confines of structured surveys and at a far larger scale than focus groups. However, in contrast with the vast presence of patient-experience data on social media and the potential benefits the data offers, it attracts comparatively little research attention due to the technical proficiency required for text analysis. In this paper, we introduce the Design-Acquire-Process-Model-Analyse-Visualise (DAPMAV) framework to provide an overview of techniques and an approach to capture patient-reported experiences from social media data. We apply this framework in a case study on prostate cancer data from /r/ProstateCancer, demonstrate the framework's value in capturing specific aspects of patient concern (such as sexual dysfunction), provide an overview of the discourse, and show narrative and emotional progression through these stories. We anticipate this framework to apply to a wide variety of areas in healthcare, including capturing and differentiating experiences across minority groups, geographic boundaries, and types of illnesses.
- Australian Institute of Health and Welfare. Health Expenditure Australia 2019-20, 2021. Accessed 2022-05-24.
- Conceptual frameworks for health systems performance: a quest for effectiveness, quality, and improvement. International Journal for Quality in Health Care, 15(5):377–398, 2003.
- A framework for assessing the performance of health systems. Bulletin of the World Health Organization, 78:717–731, 2000.
- Examining the role of patient experience surveys in measuring health care quality. Medical Care Research and Review, 71(5):522–554, 2014.
- The patient experience and health outcomes. The New England Journal of Medicine, 2013.
- Lynn McDonald. Florence Nightingale and the early origins of evidence-based nursing. Evidence-Based Nursing, 4(3):68–69, 2001.
- Using hospital mortality rates to judge hospital performance: a bad idea that just won’t go away. BMJ, 340, 2010.
- What is the value of hospital mortality indicators, and are there ways to do better? Australian Health Review, 36(4):374–377, 2012.
- The National Health Information and Performance Principal Comittee. The Australian Health Performance Framework, 2017. Accessed: 2022-06-29.
- Gunnar Németh. Health related quality of life outcome instruments. European Spine Journal, 15(1):S44–S51, 2006.
- World Health Organization et al. Technical series on safer primary care. World Health Organization, 2017.
- Article Commentary: Patient-Reported Outcomes (PROs) and Patient-Reported Outcome Measures (PROMs). Health Services Insights, 6:HSI.S11093, 2013. PMID: 25114561.
- Commonwealth of Australia. Royal Commission into Aged Care Quality and Safety. Commonwealth of Australia, 2019. Accessed: 2022-05-18.
- Australian Institute of Health and Welfare. Patient experience of health care, 2017. Accessed: 2022-05-18.
- Mousa A Masadeh. Focus group: Reviews and practices. International Journal of Applied Science and Technology, 2(10), 2012.
- Lilla Vicsek. Issues in the Analysis of Focus Groups: Generalisability, Quantifiability, Treatment of Context and Quotations. Qualitative Report, 15(1):122–141, 2010.
- Australian Commission on Safety and Quality in Health Care. Australian Patient Experience Question Set, 2019. Accessed: 2022-05-18.
- Factors affecting response rates of the web survey: A systematic review. Computers in human behavior, 26(2):132–139, 2010.
- Bau Teresa Fernández-Luque Luis. Health and Social Media: Perfect Storm of Information. hir, 21(2):67–73, 2015.
- Social media and political engagement. Pew Internet & American Life Project, 19(1):2–13, 2012.
- Advocacy through social media: Exploring student engagement in addressing social issues. Bowen, GA, Gordon, NS, & Chojnacki, MK (2017). Advocacy Through Social Media: Exploring Student Engagement in Addressing Social Issues. Journal of Higher Education Outreach and Engagement, 21(3):5–30, 2017.
- Harnessing social media for health information management. Electronic Commerce Research and Applications, 27:139–151, 2018.
- Harnessing social media data for pharmacovigilance: a review of current state of the art, challenges and future directions. International Journal of Data Science and Analytics, 8(2):113–135, 2019.
- Harnessing the cloud of patient experience: using social media to detect poor quality healthcare. BMJ quality & safety, 22(3):251–255, 2013.
- Social media analysis and public opinion: The 2010 UK general election. Journal of computer-mediated communication, 20(2):204–220, 2015.
- Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter, 2011.
- Symptom extraction from the narratives of personal experiences with COVID-19 on Reddit. arXiv preprint arXiv:2005.10454, 2020.
- Empowering patients through social media: the benefits and challenges. Health Informatics Journal, 20(1):50–58, 2014.
- Who talks? the social psychology of illness support groups. American Psychologist, 55(2):205, 2000.
- Self-disclosure, privacy and the Internet. The Oxford handbook of Internet psychology, 2374252, 2007.
- Assessing patient-perceived hospital service quality and sentiment in malaysian public hospitals using machine learning and facebook reviews. International Journal of Environmental Research and Public Health, 18(18):9912, 2021.
- Natural language processing of Reddit data to evaluate dermatology patient experiences and therapeutics. Journal of the American Academy of Dermatology, 83(3):803–808, 2020.
- I leaked, then I Reddit: experiences and insight shared on urinary incontinence by Reddit users. International Urogynecology Journal, 31(2):243–248, 2020.
- Measuring patient-perceived quality of care in US hospitals using Twitter. BMJ quality & safety, 25(6):404–413, 2016.
- Natural language processing of patient-experience comments after primary Total knee arthroplasty. The Journal of Arthroplasty, 36(3):927–934, 2021.
- A sentiment analysis of breast cancer treatment experiences and healthcare perceptions across twitter. arXiv preprint arXiv:1805.09959, 2018.
- Inc. Meta Platforms. Meta reports first quarter 2022 results, 2022. Accessed: 2022-06-29.
- Inc. Reddit. Reddit by the Numbers, 2021. Accessed: 2022-06-29.
- Inc. Twitter. Twitter Announces First Quarter 2022 Results, 2022. Accessed: 2022-06-29.
- Twitter and research: a systematic literature review through text mining. IEEE Access, 8:67698–67717, 2020.
- The anatomy of Reddit: An overview of academic research. In Dynamics on and of Complex Networks, pages 183–204. Springer, 2017.
- Facebook shuts the gate after the horse has bolted, and hurts real research in the process. Internet Policy Review, 25, 2018.
- Mike Schroepfer. An update on our plans to restrict data access on Facebook. Facebook Newsroom, 4, 2018.
- The disinformation landscape and the lockdown of social platforms. Information, Communication & Society, 22(11):1531–1543, 2019.
- Twitter for sparking a movement, reddit for sharing the moment:# metoo through the lens of social media. arXiv preprint arXiv:1803.08022, 2018.
- Unravelling unstructured data: A wealth of information in big data. In 2015 4th International Conference on Reliability, Infocom Technologies and Optimization (ICRITO)(Trends and Future Directions), pages 1–6. IEEE, 2015.
- Latent Dirichlet Allocation. Journal of Machine Learning Research, 3(Jan):993–1022, 2003.
- Australian Institute of Health and Welfare. Prostate cancer in Australia. cancer series No. 79, 2013.
- The pushshift Reddit dataset. In Proceedings of the International AAAI Conference on Web and Social Media, volume 14, pages 830–839, 2020.
- Data preprocessing in data mining, volume 72. Springer, 2015.
- Text mining with R: A tidy approach. " O’Reilly Media, Inc.", 2017.
- A network approach to topic models. Science Advances, 4(7):eaaq1360, 2018.
- R: a language for data analysis and graphics. Journal of computational and graphical statistics, 5(3):299–314, 1996.
- Welcome to the tidyverse. Journal of open source software, 4(43):1686, 2019.
- tidytext: Text mining and analysis using tidy data principles in r. Journal of Open Source Software, 1(3):37, 2016.
- Curtis Murray. DAPMAV Online Supplementary Materials, 2022. Accessed 01 Oct 2022.
- Topic analysis of online reviews for two competitive products using latent Dirichlet allocation. Electronic Commerce Research and Applications, 29:142–156, 2018.
- Improving topic models with latent feature word representations. Transactions of the Association for Computational Linguistics, 3:299–313, 2015.
- Steven T Piantadosi. Zipf’s word frequency law in natural language: A critical review and future directions. Psychonomic bulletin & review, 21(5):1112–1130, 2014.
- Ah-Hwee Tan et al. Text mining: The state of the art and the challenges. In Proceedings of the pakdd 1999 workshop on knowledge disocovery from advanced databases, volume 8, pages 65–70, 1999.
- Finn Årup Nielsen. A new anew: Evaluation of a word list for sentiment analysis in microblogs. arXiv preprint arXiv:1103.2903, 2011.
- Mining and summarizing customer reviews. In Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 168–177, 2004.
- Crowdsourcing a word–emotion association lexicon. Computational intelligence, 29(3):436–465, 2013.
- Use of sentiment analysis for capturing patient experience from free-text comments posted online. Journal of medical Internet research, 15(11):e2721, 2013.
- Discovery of grounded theory: Strategies for qualitative research. Routledge, 2017.
- The emotional arcs of stories are dominated by six basic shapes. EPJ Data Science, 5(1):1–12, 2016.
- Tiago P Peixoto. Hierarchical block structures and high-resolution model selection in large networks. Physical Review X, 4(1):011047, 2014.
- Tiago P Peixoto. Nonparametric Bayesian inference of the microcanonical stochastic block model. Physical Review E, 95(1):012317, 2017.
- Principal component analysis. Chemometrics and intelligent laboratory systems, 2(1-3):37–52, 1987.
- Multidimensional scaling. Number 11. Sage, 1978.
- Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-SNE. Journal of Machine Learning Research, 9(11), 2008.
- UMAP: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426, 2018.
- Multidimensional scaling. In Handbook of data visualization, pages 315–347. Springer, 2008.
- Curtis Murray. Prostate discourse topic network, 2022. Accessed 18 May 2022.
- Mark S Litwin. A review of the development and validation of the National Institutes of Health Chronic Prostatitis Symptom Index. Urology, 60(6):14–18, 2002.
- Sexual dysfunction and the preservation of manhood: Experiences of men with prostate cancer. Journal of Health Psychology, 7(3):303–316, 2002.
- “Prostate cancer is far more hidden…”: Perceptions of stigma, social isolation and help-seeking among men with prostate cancer. European Journal of Cancer Care, 27(2):e12790, 2018.
- Depression, anxiety, and suicidality in patients with prostate cancer: A systematic review and meta-analysis of observational studies. Prostate cancer and prostatic diseases, 24(2):281–289, 2021.
- Curtis Murray (3 papers)
- Lewis Mitchell (56 papers)
- Jonathan Tuke (15 papers)
- Mark Mackay (3 papers)