Exploring Community-Driven Descriptions for Making Livestreams Accessible (2310.07057v1)
Abstract: People watch livestreams to connect with others and learn about their hobbies. Livestreams feature multiple visual streams including the main video, webcams, on-screen overlays, and chat, all of which are inaccessible to livestream viewers with visual impairments. While prior work explores creating audio descriptions for recorded videos, live videos present new challenges: authoring descriptions in real-time, describing domain-specific content, and prioritizing which complex visual information to describe. We explore inviting livestream community members who are domain experts to provide live descriptions. We first conducted a study with 18 sighted livestream community members authoring descriptions for livestreams using three different description methods: live descriptions using text, live descriptions using speech, and asynchronous descriptions using text. We then conducted a study with 9 livestream community members with visual impairments, who shared their current strategies and challenges for watching livestreams and provided feedback on the community-written descriptions. We conclude with implications for improving the accessibility of livestreams.
- Adobe. 2022 (accessed Dec 13, 2022). Premiere Pro. https://www.adobe.com/products/premiere.html
- Vizwiz: nearly real-time answers to visual questions. In Proceedings of the 23nd annual ACM symposium on User interface software and technology. 333–342.
- Carmen J Branje and Deborah I Fels. 2012. Livedescribe: can amateur describers create high-quality audio description? Journal of Visual Impairment & Blindness 106, 3 (2012), 154–165.
- Pablo Cesar and David Geerts. 2011. Past, present, and future of social TV: A categorization. In 2011 IEEE consumer communications and networking conference (CCNC). IEEE, 347–351.
- ”I was afraid, but now I enjoy being a streamer!” Understanding the Challenges and Prospects of Using Live Streaming for Online Education. Proceedings of the ACM on Human-Computer Interaction 4, CSCW3 (2021), 1–32.
- Aira Tech Corp. 2023 (accessed May 2023). Aira. https://aira.io.
- Descript. 2022 (accessed Sep 6, 2022). Descript. https://www.descript.com/
- Be My Eyes. 2023 (accessed May 2023). Be My Eyes. https://www.bemyeyes.com.
- Watch me code: Programming mentorship communities on twitch. tv. Proceedings of the ACM on Human-Computer Interaction 2, CSCW (2018), 1–18.
- Sharing the studio: How creative livestreaming can inspire, educate, and engage. In Proceedings of the 2019 on Creativity and Cognition. 144–155.
- Making GIFs Accessible. In Proceedings of the 22nd International ACM SIGACCESS Conference on Computers and Accessibility. 1–10.
- Streaming on twitch: fostering participatory communities of play within live mixed media. In Proceedings of the SIGCHI conference on human factors in computing systems. 1315–1324.
- CommunitySourcing: engaging local crowds to perform expert work via physical kiosks. In Proceedings of the SIGCHI conference on human factors in computing systems. 1539–1548.
- Infosonics: Accessible infographics for people who are blind using sonification and voice. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–13.
- Hopin. 2023 (accessed May 2023). StreamYard. https://streamyard.com.
- Leveraging complementary contributions of different workers for efficient crowdsourcing of video captions. In Proceedings of the 2017 chi conference on human factors in computing systems. 4617–4626.
- Cocomix: Utilizing Comments to Improve Non-Visual Webtoon Accessibility. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–18.
- The Smith-Kettlewell Eye Research Institute. 2019a. YouDescribe FAQ for describers. https://youdescribe.org/support/describers.
- The Smith-Kettlewell Eye Research Institute. 2019b. YouDescribe.com. https://youdescribe.org/.
- Towards Accessible Sports Broadcasts for Blind and Low-Vision Viewers. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–7.
- Robert Johansen. 1988. Groupware: Computer support for business teams. The Free Press.
- Exploring the experiences of streamers with visual impairments. Proceedings of the ACM on Human-Computer Interaction 5, CSCW2 (2021), 1–23.
- Kaycem. 2023 (accessed July 2023). how to IMPROVE your SKILLS QUICKLY + NEW SUB GOAL?!? !bootcamp !youtube. https://www.twitch.tv/videos/1854614493.
- Juho Kim et al. 2015. Learnersourcing: improving learning with collective learner activity. Ph. D. Dissertation. Massachusetts Institute of Technology.
- Sonification report: Status of the field and research agenda. (2010).
- Real-time captioning by groups of non-experts. In Proceedings of the 25th annual ACM symposium on User interface software and technology. 23–34.
- Hye-Kyung Lee. 2011. Participatory media fandom: A case study of anime fansubbing. Media, culture & society 33, 8 (2011), 1131–1147.
- What Makes Videos Accessible to Blind and Visually Impaired People?. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, USA, 1–4.
- CrossA11y: Identifying Video Accessibility Issues via Cross-modal Grounding.
- Vicariously experiencing it all without going outside: A study of outdoor livestreaming in China. Proceedings of the ACM on Human-Computer Interaction 3, CSCW (2019), 1–28.
- An evaluation of haptic descriptions for audio described films for individuals who are blind. In 2013 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1–6.
- Meta. 2023 (accessed April 2023). Facebook Live. https://www.facebook.com.
- Microsoft. 2023. Word for the web. https://www.microsoft365.com/launch/word
- ViScene: A Collaborative Authoring Tool for Scene Descriptions in Videos. In The 22nd International ACM SIGACCESS Conference on Computers and Accessibility. 1–4.
- Supporting Novices Author Audio Descriptions via Automatic Feedback. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–18.
- American Council of the Blind. 2003. The Audio Description Project. https://adp.acb.org/guidelines.html.
- Rescribe: Authoring and Automatically Editing Audio Descriptions. In Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology (Virtual Event, USA) (UIST ’20). Association for Computing Machinery, New York, NY, USA, 747–759. https://doi.org/10.1145/3379337.3415864
- Slidecho: Flexible Non-Visual Exploration of Presentation Videos. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility. 1–12.
- Say It All: Feedback for Improving Non-Visual Presentation Accessibility. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 1–12.
- OBS Project. 2023 (accessed May 2023). OBS: Open Broadcaster Software. https://obsproject.com/.
- The Audio Description Project. 2019. adp.acb.org. https://adp.acb.org/guidelines.html.
- Reddit. 2023 (accessed April 2023). r/BlindSurveys. https://reddit.com/r/blindsurveys
- Logitech Services S.A. 2023 (accessed May 2023). Streamlabs. https://streamlabs.com.
- Jeff T Sheng and Sanjay R Kairam. 2020. From virtual strangers to irl friends: relationship development in livestreaming communities on twitch. Proceedings of the ACM on Human-Computer Interaction 4, CSCW2 (2020), 1–34.
- Live-streaming changes the (video) game. In Proceedings of the 11th european conference on Interactive TV and video. 131–138.
- Joel Snyder. 2005. Audio description: The visual made verbal. In International Congress Series, Vol. 1282. Elsevier, 935–939.
- Pixar Animation Studios. 2004 (accessed August 2022). The Incredibles: Am I Fired Scene with Audio Description. https://www.youtube.com/watch?t=128&v=2zhzVGmyjtg.
- Twitch. 2023 (accessed April 2023). Twitch. https://www.twitch.tv.
- Bruce N Walker and Michael A Nees. 2011. Theory of sonification. The sonification handbook 1 (2011), 9–39.
- Toward Automatic Audio Description Generation for Accessible Videos.
- CatchLive: Real-Time Summarization of Live Streams with Stream Content and Interaction Data. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI ’22). Association for Computing Machinery, New York, NY, USA, Article 500, 20 pages. https://doi.org/10.1145/3491102.3517461
- YouTube. 2023 (accessed April 2023). YouTube Live. https://www.youtube.com/@live
- Bei Yuan and Eelke Folmer. 2008. Blind hero: enabling guitar hero for the visually impaired. In Proceedings of the 10th international ACM SIGACCESS conference on Computers and accessibility. 169–176.
- Human-in-the-loop machine learning to increase video accessibility for visually impaired and blind users. In Proceedings of the 2020 ACM Designing Interactive Systems Conference. 47–60.