Memoro: Using Large Language Models to Realize a Concise Interface for Real-Time Memory Augmentation (2403.02135v1)
Abstract: People have to remember an ever-expanding volume of information. Wearables that use information capture and retrieval for memory augmentation can help but can be disruptive and cumbersome in real-world tasks, such as in social settings. To address this, we developed Memoro, a wearable audio-based memory assistant with a concise user interface. Memoro uses a LLM to infer the user's memory needs in a conversational context, semantically search memories, and present minimal suggestions. The assistant has two interaction modes: Query Mode for voicing queries and Queryless Mode for on-demand predictive assistance, without explicit query. Our study of (N=20) participants engaged in a real-time conversation demonstrated that using Memoro reduced device interaction time and increased recall confidence while preserving conversational quality. We report quantitative results and discuss the preferences and experiences of users. This work contributes towards utilizing LLMs to design wearable memory augmentation systems that are minimally disruptive.
- Remembering can cause forgetting: retrieval dynamics in long-term memory. Journal of Experimental Psychology: Learning, Memory, and Cognition 20, 5 (1994), 1063. https://doi.org/10.1037/0278-7393.20.5.1063
- Investigating Proactive Search Support in Conversations. In Proceedings of the 2018 Designing Interactive Systems Conference (Hong Kong, China) (DIS ’18). Association for Computing Machinery, New York, NY, USA, 1295–1307. https://doi.org/10.1145/3196709.3196734
- SearchBot: Supporting Voice Conversations with Proactive Search. In Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing (Jersey City, NJ, USA) (CSCW ’18). Association for Computing Machinery, New York, NY, USA, 9–12. https://doi.org/10.1145/3272973.3272990
- Memory. Psychology Press.
- Determining what individual SUS scores mean: Adding an adjective rating scale. Journal of usability studies 4, 3 (2009), 114–123.
- Marine Beaudoin and Olivier Desrichard. 2011. Are memory self-efficacy and memory performance related? A meta-analysis. Psychological bulletin 137, 2 (2011), 211. https://doi.org/doi/10.1037/a0022106
- VIMES: A Wearable Memory Assistance System for Automatic Information Retrieval. (2020), 3191–3200. https://doi.org/10.1145/3394171.3413663
- Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative research in psychology 3, 2 (2006), 77–101. https://doi.org/10.1191/1478088706qp063oa
- John Brooke et al. 1996. SUS-A quick and dirty usability scale. Usability evaluation in industry 189, 194 (1996), 4–7.
- Roger Brown and David McNeill. 1966. The “tip of the tongue” phenomenon. Journal of Verbal Learning and Verbal Behavior 5, 4 (1966), 325–337. https://doi.org/10.1016/S0022-5371(66)80040-3
- Language Models are Few-Shot Learners. arXiv:2005.14165 [cs.CL]
- Vannevar Bush et al. 1945. As we may think. The atlantic monthly 176, 1 (1945), 101–108.
- ParaGlassMenu: Towards Social-Friendly Subtle Interactions in Conversations. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–21. https://doi.org/10.1145/3544548.3581065
- Samantha Chan. 2020. Biosignal-Sensitive Memory Improvement and Support Systems. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems. 1–7. https://doi.org/10.1145/3334480.3375031
- Samantha Chan. 2022. Augmenting Human Prospective Memory through Cognition-Aware Technologies. Ph. D. Dissertation. ResearchSpace@ Auckland. https://hdl.handle.net/2292/58810
- KinVoices: Using voices of friends and family in voice interfaces. Proceedings of the ACM on Human-Computer Interaction 5, CSCW2 (2021), 1–25. https://doi.org/10.1145/3479590
- Prompto: Investigating Receptivity to Prompts Based on Cognitive Load from Memory Training Conversational Agent. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 4, 4, Article 121 (dec 2020), 23 pages. https://doi.org/10.1145/3432190
- Prospero: A personal wearable memory coach. In Proceedings of the 10th Augmented Human International Conference 2019. 1–5. https://doi.org/10.1145/3311823.3311870
- Wearable Reasoner: towards enhanced human rationality through a wearable device with an explainable AI assistant. In Proceedings of the Augmented Humans International Conference. 1–12. https://doi.org/10.1145/3384657.3384799
- Ishita Dasgupta and Samuel J. Gershman. 2021. Memory as a Computational Resource. Trends in Cognitive Sciences 25, 3 (2021), 240–251. https://doi.org/10.1016/j.tics.2020.12.008
- Security and privacy implications of pervasive memory augmentation. IEEE Pervasive Computing 14, 1 (2015), 44–53. https://doi.org/10.1109/MPRV.2015.13
- Richard W. Devaul and Alex P. Pentland. 2004. The Memory Glasses: Wearable Computing for Just-in-Time Memory Support. Ph. D. Dissertation. USA. http://dspace.mit.edu/handle/1721.1/7582
- Acceptability of a lifelogging wearable camera in older adults with mild cognitive impairment: a mixed-method study. BMC geriatrics 19, 1 (2019), 1–10. https://doi.org/10.1186/s12877-019-1132-0
- News summarization and evaluation in the era of gpt-3. arXiv preprint arXiv:2209.12356 (2022). https://doi.org/10.48550/arXiv.2209.12356
- Ido Guy. 2016. Searching by Talking: Analysis of Voice Queries on Mobile Web Search. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (Pisa, Italy) (SIGIR ’16). Association for Computing Machinery, New York, NY, USA, 35–44. https://doi.org/10.1145/2911451.2911525
- You Know What I’m Saying: Designing Conversational Strategies of AI Agent for Tip of the Tongue Phenomenon. In Extended Abstracts of the 2023 CHI Conference on Human Factors in Computing Systems. 1–6. https://doi.org/10.1145/3544549.3585670
- Sandra G. Hart and Lowell E. Staveland. 1988. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. In Advances in psychology. Vol. 52. Elsevier, 139–183. https://doi.org/10.1016/S0166-4115(08)62386-9
- Remembering through lifelogging: A survey of human memory augmentation. Pervasive and Mobile Computing 27 (2016), 14–26. https://doi.org/10.1016/j.pmcj.2015.12.002
- The Personal Audio Loop: Designing a Ubiquitous Audio-Based Memory Aid. In Mobile Human-Computer Interaction - MobileHCI 2004, Stephen Brewster and Mark Dunlop (Eds.). Springer Berlin Heidelberg, 168–179. https://doi.org/10.1007/978-3-540-28637-0_15
- Designing a willing-to-use-in-public hand gestural interaction technique for smart glasses. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. 4203–4215. https://doi.org/10.1145/2858036.2858436
- Nick Hunn. 2014. Hearables—the new wearables. Wearable Technologies (2014).
- Muhammad Zahid Iqbal and Abraham G. Campbell. 2023. Adopting smart glasses responsibly: potential benefits, ethical, and privacy concerns with Ray-Ban stories. AI and Ethics 3, 1 (2023), 325–327. https://doi.org/10.1007/s43681-022-00155-7
- Atlas: Few-shot Learning with Retrieval Augmented Language Models. arXiv:2208.03299 [cs.CL]
- Memento: An emotion-driven lifelogging system with wearables. ACM Transactions on Sensor Networks (TOSN) 15, 1 (2019), 1–23. https://doi.org/10.1145/3281630
- Norene Kelly and Stephen B. Gilbert. 2018. The wearer, the device, and its use: advances in understanding the social acceptability of wearables. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, Vol. 62. SAGE Publications Sage CA: Los Angeles, CA, 1027–1031. https://doi.org/10.1177/1541931218621237
- PAL: A Wearable Platform for Real-time, Personalized and Context-Aware Health and Cognition Support. CoRR abs/1905.01352 (2019). arXiv:1905.01352 http://arxiv.org/abs/1905.01352
- Visual Captions: Augmenting Verbal Communication with On-the-fly Visuals. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–21. https://doi.org/10.1145/3544548.3581566
- Social acceptability in HCI: A survey of methods, measures, and design strategies. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1–19. https://doi.org/10.1145/3313831.3376162
- Overview of lifelogging: current challenges and advances. IEEE Access 9 (2021), 62630–62641. https://doi.org/10.1109/ACCESS.2021.3073469
- Efficient search for approximate nearest neighbor in high dimensional spaces. In Proceedings of the thirtieth annual ACM symposium on Theory of computing. 614–623. https://doi.org/10.1145/276698.276877
- The Design of a Human Memory Prosthesis. Comput. J. 37, 3 (01 1994), 153–163. https://doi.org/10.1093/comjnl/37.3.153 arXiv:https://academic.oup.com/comjnl/article-pdf/37/3/153/1127676/370153.pdf
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. arXiv:2005.11401 [cs.CL]
- Memory and decision making. Handbook of consumer behavior (1991), 1–9.
- Steve Mann. 1996. Wearable Tetherless Computer-Mediated Reality: WearCam as a Wearable Face-Recognizer, and Other Applications for the Disabled Papers. (1996).
- Natalia Marmasse. 1999. comMotion: a context-aware communication system. In CHI’99 Extended Abstracts on Human Factors in Computing Systems. 320–321. https://doi.org/10.1145/632716.632910
- Exploring User Expectations of Proactive AI Systems. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 4, 4, Article 146 (dec 2020), 22 pages. https://doi.org/10.1145/3432193
- Neville Moray. 1959. Attention in dichotic listening: Affective cues and the influence of instructions. Quarterly journal of experimental psychology 11, 1 (1959), 56–60. https://doi.org/10.1080/17470215908416289
- Wearable Systems without Experiential Disruptions: Exploring the Impact of Device Feedback Changes on Explicit Awareness, Physiological Synchrony, Sense of Agency, and Device-Body Ownership. Frontiers in Computer Science 5 (2023), 1289869. https://doi.org/10.3389/fcomp.2023.1289869
- Cloudbits: Supporting Conversations through Augmented Zero-Query Search Visualization. In Proceedings of the 5th Symposium on Spatial User Interaction (Brighton, United Kingdom) (SUI ’17). Association for Computing Machinery, New York, NY, USA, 30–38. https://doi.org/10.1145/3131277.3132173
- Khalid Nassiri and Moulay Akhloufi. 2023. Transformer models used for text-based question answering systems. Applied Intelligence 53, 9 (2023), 10602–10635. https://doi.org/10.1007/s10489-022-04052-8
- Everyday memory errors in older adults. Aging, Neuropsychology, and Cognition 20, 2 (2013), 220–242. https://doi.org/10.1080/13825585.2012.690365 arXiv:https://doi.org/10.1080/13825585.2012.690365 PMID: 22694275.
- The global prevalence of dementia: A systematic review and metaanalysis. Alzheimer’s Dementia 9, 1 (2013), 63–75.e2. https://doi.org/10.1016/j.jalz.2012.11.007
- Aung Pyae and Tapani N. Joelsson. 2018. Investigating the usability and user experiences of voice user interface: a case of Google home smart speaker. In Proceedings of the 20th international conference on human-computer interaction with mobile devices and services adjunct. 127–131. https://doi.org/10.1145/3236112.3236130
- Scaling Language Models: Methods, Analysis & Insights from Training Gopher. arXiv:2112.11446 [cs.CL]
- Björn Rasch and Jan Born. 2013. About sleep’s role in memory. Physiological reviews (2013). https://doi.org/10.1152/physrev.00032.2012
- Nils Reimers and Iryna Gurevych. 2019. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv preprint arXiv:1908.10084 (2019). https://doi.org/10.48550/arXiv.1908.10084
- Juniper Research. 2019. Digital Voice Assistants in Use to Triple to 8 Billion by 2023, Driven by Smart Home Devices. (2019).
- Speaker verification using adapted Gaussian mixture models. Digital signal processing 10, 1-3 (2000), 19–41. https://doi.org/10.1006/dspr.1999.0361
- Bradley J. Rhodes. 1997. The wearable remembrance agent: A system for augmented memory. Personal Technologies 1, 4 (01 Dec 1997), 218–224. https://doi.org/10.1007/BF01682024
- B. J. Rhodes and P. Maes. 2000. Just-in-time information retrieval agents. IBM Systems Journal 39, 3.4 (2000), 685–704. https://doi.org/10.1147/sj.393.0685
- Multimodal inductive transfer learning for detection of Alzheimer’s dementia and its severity. arXiv preprint arXiv:2009.00700 (2020).
- Daniel L. Schacter. 1999. The seven sins of memory: insights from psychology and cognitive neuroscience. American psychologist 54, 3 (1999), 182. https://doi.org/doi/10.1037/0003-066X.54.3.182
- Valentin Schwind and Niels Henze. 2020. Anticipated User Stereotypes Systematically Affect the Social Acceptability of Mobile Devices. In Proceedings of the 11th Nordic Conference on Human-Computer Interaction: Shaping Experiences, Shaping Society. 1–12. https://doi.org/10.1145/3419249.3420113
- Lifelogging: Archival and retrieval of continuously recorded audio using wearable devices. In 2012 IEEE International Conference on Emerging Signal Processing Applications. 99–102. https://doi.org/10.1109/ESPA.2012.6152455
- Improving the domain adaptation of retrieval augmented generation (RAG) models for open domain question answering. Transactions of the Association for Computational Linguistics 11 (2023), 1–17. https://doi.org/10.1162/tacl_a_00530
- Conciseness: An Overlooked Language Task. arXiv:2211.04126 [cs.CL]
- Investigating Users’ Preferences and Expectations for Always-Listening Voice Assistants. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 3, 4, Article 153 (sep 2020), 23 pages. https://doi.org/10.1145/3369807
- Christophe Van Gysel. 2023. Modeling Spoken Information Queries for Virtual Assistants: Open Problems, Challenges and Opportunities. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. https://doi.org/10.48550/arXiv.2304.13149
- An Audio-Based Personal Memory Aid. (2004), 400–417.
- Finetuned language models are zero-shot learners. arXiv preprint arXiv:2109.01652 (2021).
- Multimodal Mobile Interactions: Usability Studies in Real World Settings. In Proceedings of the 13th International Conference on Multimodal Interfaces (Alicante, Spain) (ICMI ’11). Association for Computing Machinery, New York, NY, USA, 361–368. https://doi.org/10.1145/2070481.2070551
- Kiichiro Yamano and Katunobu Itou. 2009. Browsing Audio Life-log Data Using Acoustic and Location Information. In 2009 Third International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies. 96–101. https://doi.org/10.1109/UBICOMM.2009.57