Critical-Reflective Human-AI Collaboration: Exploring Computational Tools for Art Historical Image Retrieval (2306.12843v1)
Abstract: Just as other disciplines, the humanities explore how computational research approaches and tools can meaningfully contribute to scholarly knowledge production. We approach the design of computational tools through the analytical lens of 'human-AI collaboration.' However, there is no generalizable concept of what constitutes 'meaningful' human-AI collaboration. In terms of genuinely human competencies, we consider criticality and reflection as guiding principles of scholarly knowledge production. Although (designing for) reflection is a recurring topic in CSCW and HCI discourses, it has not been centered in work on human-AI collaboration. We posit that integrating both concepts is a viable approach to supporting 'meaningful' human-AI collaboration in the humanities. Our research, thus, is guided by the question of how critical reflection can be enabled in human-AI collaboration. We address this question with a use case that centers on computer vision (CV) tools for art historical image retrieval. Specifically, we conducted a qualitative interview study with art historians and extended the interviews with a think-aloud software exploration. We observed and recorded our participants' interaction with a ready-to-use CV tool in a possible research scenario. We found that critical reflection, indeed, constitutes a core prerequisite for 'meaningful' human-AI collaboration in humanities research contexts. However, we observed that critical reflection was not fully realized during interaction with the CV tool. We interpret this divergence as supporting our hypothesis that computational tools need to be intentionally designed in such a way that they actively scaffold and support critical reflection during interaction. Based on our findings, we suggest four empirically grounded design implications for 'critical-reflective human-AI collaboration'.
- Philip E. Agre. 1997. Toward a critical technical practice: Lessons learned in trying to reform AI. (1997).
- Rafael C Alvarado. 2019. Digital Humanities and the Great Project. Why We Should Operationalize Everything–and Study Those Who Are Doing So Now. Debates in the Digital Humanities (2019), 75–82.
- Guidelines for Human-AI Interaction. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–13. https://doi.org/10.1145/3290605.3300233
- Critical Digital Humanities and Machine-Learning. In DH. Alliance of Digital Humanities Organizations.
- Eric PS Baumer. 2015. Reflective informatics: conceptual dimensions for designing technologies of reflection. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems. 585–594.
- Eric PS Baumer. 2017. Toward human-centered algorithm design:. Big Data & Society 4, 2 (Jul 2017), 205395171771885. https://doi.org/10.1177/2053951717718854
- Reviewing Reflection: On the Use of Reflection in Interactive System Design. In Proceedings of the 2014 Conference on Designing Interactive Systems (Vancouver, BC, Canada) (DIS ’14). Association for Computing Machinery, New York, NY, USA, 93–102. https://doi.org/10.1145/2598510.2598598
- Topicalizer: reframing core concepts in machine learning visualization by co-designing for interpretivist scholarship. Human–Computer Interaction 35, 5–6 (Apr 2020), 452–480. https://doi.org/10.1080/07370024.2020.1734460
- Explanation Strategies as an Empirical-Analytical Lens for Socio-Technical Contextualization of Machine Learning Interpretability. Proc. ACM Hum.-Comput. Interact. 6, GROUP, Article 39 (jan 2022), 25 pages. https://doi.org/10.1145/3492858
- Virginia Braun and Victoria Clarke. 2006. Using thematic analysis in psychology. Qualitative Research in Psychology 3, 2 (Jan 2006), 77–101. https://doi.org/10.1191/1478088706qp063oa
- Virginia Braun and Victoria Clarke. 2016. (Mis)conceptualising themes, thematic analysis, and other problems with Fugard and Potts’ (2015) sample-size tool for thematic analysis. International Journal of Social Research Methodology 19, 6 (2016), 739–743. https://doi.org/10.1080/13645579.2016.1195588 arXiv:https://doi.org/10.1080/13645579.2016.1195588
- Virginia Braun and Victoria Clarke. 2019. Reflecting on reflexive thematic analysis. Qualitative Research in Sport, Exercise and Health 11, 4 (2019), 589–597. https://doi.org/10.1080/2159676X.2019.1628806
- Alexander Brey. [n. d.]. Digital art history in 2021. History Compass n/a, n/a ([n. d.]), 1 – 14. https://doi.org/10.1111/hic3.12678 arXiv:https://onlinelibrary.wiley.com/doi/pdf/10.1111/hic3.12678
- Slow Digital Art History in Action: Project Cornelia’s Computational Approach to Seventeenth-century Flemish Creative Communities. Visual Resources 35, 1-2 (2019), 105–124. https://doi.org/10.1080/01973762.2019.1553444
- To Trust or to Think: Cognitive Forcing Functions Can Reduce Overreliance on AI in AI-Assisted Decision-Making. Proc. ACM Hum.-Comput. Interact. 5, CSCW1, Article 188 (apr 2021), 21 pages. https://doi.org/10.1145/3449287
- Taina Bucher. 2017. The algorithmic imaginary: exploring the ordinary affects of Facebook algorithms. Information, Communication & Society 20, 1 (2017), 30–44. https://doi.org/10.1080/1369118X.2016.1154086 arXiv:https://doi.org/10.1080/1369118X.2016.1154086
- Nicola Carboni and Livio de Luca. 2019. An Ontological Approach to the Description of Visual and Iconographical Representations. Heritage 2, 2 (2019), 1191–1210. https://doi.org/10.3390/heritage2020078
- Using Machine Learning to Support Qualitative Coding in Social Science: Shifting the Focus to Ambiguity. ACM Trans. Interact. Intell. Syst. 8, 2, Article 9 (June 2018), 20 pages. https://doi.org/10.1145/3185515
- Robert G. Chenhall and Peter Homulos. [n. d.]. Propositions for the Future: Museum Data Standards. 30, 3-4 ([n. d.]), 205–212. https://doi.org/10.1111/j.1755-5825.1978.tb02059.x
- Michael Chromik and Andreas Butz. 2021. Human-XAI Interaction: A Review and Design Principles for Explanation User Interfaces. In Human-Computer Interaction – INTERACT 2021 (Lecture Notes in Computer Science), Carmelo Ardito, Rosa Lanzilotti, Alessio Malizia, Helen Petrie, Antonio Piccinno, Giuseppe Desolda, and Kori Inkpen (Eds.). Springer International Publishing, 619–640. https://doi.org/10.1007/978-3-030-85616-8_36
- Leendert D Couprie. 1978. Iconclass, a device for the iconographical analysis of art objects. Museum International 30, 3-4 (1978), 194–198.
- Design Frictions for Mindful Interactions: The Case for Microboundaries. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA ’16). Association for Computing Machinery, 1389–1397. https://doi.org/10.1145/2851581.2892410
- Nan Z Da. 2019. The computational case against computational literary studies. Critical inquiry 45, 3 (2019), 601–639.
- Anna Näslund Dahlgren and Amanda Wasielewski. 2021. The Digital U-Turn in Art History. Konsthistorisk tidskrift/Journal of Art History 90, 4 (2021), 249–266. https://doi.org/10.1080/00233609.2021.2006774 arXiv:https://doi.org/10.1080/00233609.2021.2006774
- John Dewey. 1997. How we think. Dover Publications.
- James E Dobson. 2019. Critical digital humanities: the search for a methodology. University of Illinois Press.
- Martin Doerr. 2003. The CIDOC Conceptual Reference Module: An Ontological Approach to Semantic Interoperability of Metadata. AI Magazine 24, 3 (Sep. 2003), 75. https://doi.org/10.1609/aimag.v24i3.1720
- Johanna Drucker. 2013. Is There a “Digital” Art History? Visual Resources 29, 1-2 (2013), 5–13. https://doi.org/10.1080/01973762.2013.761106 arXiv:https://doi.org/10.1080/01973762.2013.761106
- Johanna Drucker. 2014. Graphesis: Visual forms of knowledge production. Harvard University Press Cambridge, MA.
- Johanna Drucker and Bethany Nowviskie. 2004. Speculative computing: Aesthetic provocations in humanities computing. A companion to digital humanities (2004), 431–447.
- Catherine D’Ignazio and Lauren F. Klein. 2020. Data Feminism. MIT Press.
- Human-Centered Explainable AI (HCXAI): Beyond Opening the Black-Box of AI. In Extended Abstracts of the 2022 CHI Conference on Human Factors in Computing Systems (New Orleans, LA, USA) (CHI EA ’22). Association for Computing Machinery, New York, NY, USA, Article 109, 7 pages. https://doi.org/10.1145/3491101.3503727
- Peter GB Enser. 1995. Progress in documentation pictorial information retrieval. Journal of documentation (1995).
- Human-AI Collaboration for UX Evaluation: Effects of Explanation and Synchronization. arXiv:2112.12387 (Dec 2021). https://doi.org/10.48550/arXiv.2112.12387
- Umer Farooq and Jonathan Grudin. 2016. Human-computer integration. interactions 23, 6 (Oct 2016), 26–32. https://doi.org/10.1145/3001896
- Always Somewhere, Never There: Using Critical Design to Understand Database Interactions. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (Toronto, Ontario, Canada) (CHI ’14). Association for Computing Machinery, New York, NY, USA, 1941–1950. https://doi.org/10.1145/2556288.2557055
- Jessica L. Feuston and Jed R. Brubaker. 2021. Putting Tools in Their Place: The Role of Time and Perspective in Human-AI Collaboration for Qualitative Analysis. Proc. ACM Hum.-Comput. Interact. 5, CSCW2, Article 469 (Oct. 2021), 25 pages. https://doi.org/10.1145/3479856
- Kath Fisher. 2003. Demystifying Critical Reflection: Defining criteria for assessment. Higher Education Research & Development 22, 3 (2003), 313–325. https://doi.org/10.1080/0729436032000145167
- Vilém Flusser. 2005. Thought and reflection. Flusser Studies 1, 3 (2005).
- Bj Fogg. 2009. A behavior model for persuasive design. In Proceedings of the 4th International Conference on Persuasive Technology - Persuasive ’09. Association for Computing Machinery, Claremont, California, 1. https://doi.org/10.1145/1541948.1541999
- Christopher Frauenberger. 2019. Entanglement HCI The Next Wave? ACM Trans. Comput.-Hum. Interact. 27, 1, Article 2 (nov 2019), 27 pages. https://doi.org/10.1145/3364998
- Datasheets for Datasets. Commun. ACM 64, 12 (nov 2021), 86–92. https://doi.org/10.1145/3458723
- Lars Hallnäs and Johan Redström. 2001. Slow Technology - Designing for Reflection. Personal and Ubiquitous Computing 5, 3 (Jan 2001), 201–212. https://doi.org/10.1007/PL00000019
- The Dataset Nutrition Label - A Framework To Drive Higher Data Quality Standards. CoRR cs.DB (Jan 2018). http://arxiv.org/abs/1805.03677v1
- Scholastic: Graphical Human-AI Collaboration for Inductive and Interpretive Text Analysis. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology (Bend, OR, USA) (UIST ’22). Association for Computing Machinery, New York, NY, USA, Article 30, 12 pages. https://doi.org/10.1145/3526113.3545681
- Eric Horvitz. 1999. Principles of mixed-initiative user interfaces. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, 159–166. https://doi.org/10.1145/302979.303030
- Hieke Huistra and Bram Mellink. 2016. Phrasing history: Selecting sources in digital repositories. Historical Methods: A Journal of Quantitative and Interdisciplinary History 49, 4 (2016), 220–229. https://doi.org/10.1080/01615440.2016.1205964
- Leonardo Impett. 2020. Analyzing gesture in digital art history. In The Routledge Companion to Digital Humanities and Art History. Routledge, 386–407.
- When and why defaults influence decisions: a meta-analysis of default effects. Behavioural Public Policy 3, 2 (2019), 159–186. https://doi.org/10.1017/bpp.2018.43
- Anthony Jameson and John Riedl. 2011. Introduction to the Transactions on Interactive Intelligent Systems. ACM Transactions on Interactive Intelligent Systems 1, 1 (Oct 2011), 1:1–1:6. https://doi.org/10.1145/2030365.2030366
- Supporting Serendipity: Opportunities and Challenges for Human-AI Collaboration in Qualitative Analysis. Proc. ACM Hum.-Comput. Interact. 5, CSCW1, Article 94 (April 2021), 23 pages. https://doi.org/10.1145/3449168
- Matthew G Kirschenbaum. 2016. What is digital humanities and what’s it doing in English departments? In Defining Digital Humanities. Routledge, 211–220.
- Lukas Klic. 2023. Linked Open Images: Visual similarity for the Semantic Web. Semantic Web 14 (2023), 197–208.
- Harald Klinke. 2016. Big Image Data within the Big Picture of Art History. International Journal for Digital Art History 2 (Oct. 2016). https://doi.org/10.11588/dah.2016.2.33527
- Toward a model for digital tool criticism: Reflection as integrative practice. 34, 2 ([n. d.]), 368–385. https://doi.org/10.1093/llc/fqy048
- Sabine Lang and Björn Ommer. 2018. Attesting Similarity: Supporting the Organization and Study of Art Image Collections with Computer Vision. Digital Scholarship in the Humanities, Oxford University Press 33 (2018), 845–856.
- Clayton Lewis. 1982. Using the” thinking-aloud” method in cognitive interface design. Number RC9265.
- A stage-based model of personal informatics systems. Association for Computing Machinery. https://doi.org/10.1145/1753326.1753409
- Documenting Computer Vision Datasets: An Invitation to Reflexive Data Practices. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency (FAccT ’21). Association for Computing Machinery, 161–172. https://doi.org/10.1145/3442188.3445880
- Accept or Address? Researchers’ Perspectives on Response Bias in Accessibility Research. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility (Virtual Event, USA) (ASSETS ’21). Association for Computing Machinery, New York, NY, USA, Article 20, 13 pages. https://doi.org/10.1145/3441852.3471216
- Luigina Mortari. 2015. Reflectivity in Research Practice: An Overview of Different Perspectives. International Journal of Qualitative Methods 14, 5 (Dec 2015). https://doi.org/10.1177/1609406915618045
- Philipp Müller. 2020. Understanding history: hermeneutics and source criticism in historical scholarship. In Reading Primary Sources. Routledge, 23–40.
- Fabian Offert and Peter Bell. 2023 (forthcoming). imgs.ai. A Deep Visual Search Engine for Digital Art History. International Journal for Digital Art History (2023 (forthcoming)).
- Standardizing Reporting of Participant Compensation in HCI: A Systematic Literature Review and Recommendations for the Field. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (Yokohama, Japan) (CHI ’21). Association for Computing Machinery, New York, NY, USA, Article 141, 16 pages. https://doi.org/10.1145/3411764.3445734
- Katie Rawson and Trevor Muñoz. 2019. Against Cleaning. University of Minnesota Press, 279–292. http://www.jstor.org/stable/10.5749/j.ctvg251hk.26
- Karen Ruhleder. 1995. Reconstructing Artifacts, Reconstructing Work: From Textual Edition to On-Line Databank. Science, Technology, & Human Values 20, 1 (1995), 39–64. https://doi.org/10.1177/016224399502000103 arXiv:https://doi.org/10.1177/016224399502000103
- Do Datasets Have Politics? Disciplinary Values in Computer Vision Dataset Development. Proceedings of the ACM on Human-Computer Interaction 5, CSCW2 (Oct 2021), 317:1–317:37. https://doi.org/10.1145/3476058
- Donald A. Schön. 1983. The reflective practitioner: how professionals think in action. Basic Books.
- Benoit Seguin. 2018. The Replica Project: Building a visual search engine for art historians. XRDS: Crossroads, The ACM Magazine for Students 24, 3 (2018), 24–29.
- Ben Shneiderman and Pattie Maes. 1997. Direct manipulation vs. interface agents. Interactions 4, 6 (Nov 1997), 42–61. https://doi.org/10.1145/267505.267514
- Reflective Practicum: A Framework of Sensitising Concepts to Design for Transformative Reflection. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems (Denver, Colorado, USA) (CHI ’17). Association for Computing Machinery, New York, NY, USA, 2696–2707. https://doi.org/10.1145/3025453.3025516
- Linda C. Smith. 1981. Representation Issues in Information Retrieval System Design. SIGIR Forum 16, 1 (may 1981), 100–105. https://doi.org/10.1145/1013228.511770
- IART: A Search Engine for Art-Historical Images to Support Research in the Humanities. In Proceedings of the 29th ACM International Conference on Multimedia (Virtual Event, China) (MM ’21). Association for Computing Machinery, New York, NY, USA, 2801–2803. https://doi.org/10.1145/3474085.3478564
- Loren G. Terveen. 1995. Overview of human-computer collaboration. Knowledge-Based Systems 8, 2 (Apr 1995), 67–81. https://doi.org/10.1016/0950-7051(95)98369-H
- Large-scale interactive retrieval in art collections using multi-style feature aggregation. PLoS ONE 16, 11 (2021).
- Ted Underwood. 2014. Theorizing Research Practices We Forgot to Theorize Twenty Years Ago. Representations 127, 1 (2014), 64–72.
- Tool Criticism: From Digital Methods to Digital Methodology. In Proceedings of the 2nd International Conference on Web Studies (Paris, France) (WS.2 2018). Association for Computing Machinery, New York, NY, USA, 24–27. https://doi.org/10.1145/3240431.3240436
- Peter-Paul Verbeek. 2015. Beyond Interaction: A Short Introduction to Mediation Theory. Interactions 22, 3 (apr 2015), 26–31. https://doi.org/10.1145/2751314
- Elena Villaespesa and Oonagh Murphy. 2021. This is not an apple! Benefits and challenges of applying computer vision to museum collections. Museum Management and Curatorship 36, 4 (2021), 362–383. https://doi.org/10.1080/09647775.2021.1873827
- From Human-Human Collaboration to Human-AI Collaboration: Designing AI Systems That Can Work Together with People. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems (Honolulu, HI, USA) (CHI EA ’20). Association for Computing Machinery, New York, NY, USA, 1–6. https://doi.org/10.1145/3334480.3381069
- Claire Waterton. 2010. Experimenting with the Archive: STS-ers As Analysts and Co-constructors of Databases and Other Archival Forms. Science, Technology, & Human Values 35, 5 (2010), 645–676. https://doi.org/10.1177/0162243909340265 arXiv:https://doi.org/10.1177/0162243909340265
- Diane M Zorich. 2013. Digital art history: a community assessment. Visual Resources 29, 1-2 (2013), 14–21.
- Katrin Glinka (5 papers)
- Claudia Müller-Birn (16 papers)