Automatically Detecting Confusion and Conflict During Collaborative Learning Using Linguistic, Prosodic, and Facial Cues (2401.15201v1)
Abstract: During collaborative learning, confusion and conflict emerge naturally. However, persistent confusion or conflict have the potential to generate frustration and significantly impede learners' performance. Early automatic detection of confusion and conflict would allow us to support early interventions which can in turn improve students' experience with and outcomes from collaborative learning. Despite the extensive studies modeling confusion during solo learning, there is a need for further work in collaborative learning. This paper presents a multimodal machine-learning framework that automatically detects confusion and conflict during collaborative learning. We used data from 38 elementary school learners who collaborated on a series of programming tasks in classrooms. We trained deep multimodal learning models to detect confusion and conflict using features that were automatically extracted from learners' collaborative dialogues, including (1) language-derived features including TF-IDF, lexical semantics, and sentiment, (2) audio-derived features including acoustic-prosodic features, and (3) video-derived features including eye gaze, head pose, and facial expressions. Our results show that multimodal models that combine semantics, pitch, and facial expressions detected confusion and conflict with the highest accuracy, outperforming all unimodal models. We also found that prosodic cues are more predictive of conflict, and facial cues are more predictive of confusion. This study contributes to the automated modeling of collaborative learning processes and the development of real-time adaptive support to enhance learners' collaborative learning experience in classroom contexts.
- Emily R Lai. Collaboration: A literature review. Pearson Publisher. Retrieved November, 11:2016, 2011.
- Analyzing student interactions and meaning construction in computer bulletin board discussions. Computers & Education, 42(3):243–265, 2004.
- Social and cognitive factors driving teamwork in collaborative learning environments: Team learning beliefs and behaviors. Small group research, 37(5):490–521, 2006.
- Yuan-Hsuan Lee. Facilitating critical thinking using the c-qrac collaboration script: Enhancing science reading literacy in a computer-supported collaborative learning environment. Computers & Education, 88:182–191, 2015.
- Collaborative learning and critical thinking: Testing the link. The Journal of Higher Education, 88(5):726–753, 2017.
- Comparison between individual and collaborative learning: Determining a strategy for promoting social skills and self-esteem among undergraduate students. Journal of Educational Research, 15(2):35, 2012.
- Effectiveness of peer collaborative learning strategy on self-esteem of pupils with behaviour problems in nsukka education authority. Journal of Critical Reviews, 8(1):1055–1069, 2021.
- Social effects of collaborative learning in primary schools. Learning and instruction, 20(3):177–191, 2010.
- Benefits of collaborative learning. Procedia-social and behavioral sciences, 31:486–490, 2012.
- Amy Soller. Supporting social interaction in an intelligent collaborative learning system. International journal of artificial intelligence in education, 12(1):40–62, 2001.
- Facilitating peer knowledge modeling: Effects of a knowledge awareness tool on collaborative learning outcomes and processes. Computers in human behavior, 27(3):1059–1067, 2011.
- Van Dat Tran. Theoretical perspectives underlying the application of cooperative learning in classrooms. International Journal of Higher Education, 2(4):101–115, 2013.
- Knowledge co-construction activities and task-related monitoring in scripted collaborative learning. Learning, culture and social interaction, 21:234–249, 2019.
- Group regulation and social-emotional interactions observed in computer supported collaborative learning: Comparison between good vs. poor collaborators. Computers & Education, 78:185–200, 2014.
- Confusion and complex learning during interactions with computer learning environments. The Internet and Higher Education, 15(3):184–194, 2012.
- The influence of team emotional intelligence climate on conflict and team members’ reactions to conflict. Small Group Research, 39(2):121–149, 2008.
- Confusion can be beneficial for learning. Learning and Instruction, 29:153–170, 2014.
- Collaboration, intragroup conflict, and social skills in project-based learning. Instructional science, 43(5):561–590, 2015.
- Emotions during the learning of difficult material. In Psychology of learning and motivation, volume 57, pages 183–225. Elsevier, 2012.
- Understanding difficulties and resulting confusion in learning: an integrative review. In Frontiers in Education, volume 3, page 49. Frontiers Media SA, 2018.
- Socio-emotional conflict in collaborative learning—a process-oriented case study in a higher education context. International Journal of Educational Research, 68:1–14, 2014.
- Collaborative dialogue and types of conflict: An analysis of pair programming interactions between upper elementary students. In Proceedings of the 52nd ACM Technical Symposium on Computer Science Education, pages 1184–1190, 2021.
- Diana Laurillard. The pedagogical challenges to collaborative technologies. International Journal of Computer-supported collaborative learning, 4(1):5–20, 2009.
- Adaptive and intelligent systems for collaborative learning support: A review of the field. IEEE transactions on Learning Technologies, 4(1):5–20, 2011.
- Teaching computer architecture using a collaborative approach: the siena tool tutorial sessions and problem solving. learning, 2:10, 2013.
- Adaptive intelligent support to improve peer tutoring in algebra. International Journal of Artificial Intelligence in Education, 24(1):33–61, 2014.
- Automatic detection of learner’s affect from conversational cues. User modeling and user-adapted interaction, 18(1):45–80, 2008.
- Affective learning: improving engagement and enhancing learning with affect-aware feedback. User Modeling and User-Adapted Interaction, 27(1):119–158, 2017.
- Detecting and understanding the impact of cognitive and interpersonal conflict in computer supported collaborative learning environments. International Working Group on Educational Data Mining, 2009.
- Detecting conflicts in collaborative learning through the valence change of atomic interactions. Expert Systems with Applications, 183:115291, 2021.
- Multimodal, multiparty modeling of collaborative problem solving performance. In Proceedings of the 2020 International Conference on Multimodal Interaction, pages 423–432, 2020.
- Gesture and gaze: Multimodal data in dyadic interactions. International handbook of computer-supported collaborative learning, pages 625–641, 2021.
- Alejandro Andrade. Understanding student learning trajectories using multimodal learning analytics within an embodied-interaction learning environment. In Proceedings of the seventh international learning analytics & knowledge conference, pages 70–79, 2017.
- Supervised machine learning in multimodal learning analytics for estimating success in project-based learning. Journal of Computer Assisted Learning, 34(4):366–377, 2018.
- Predicting learners’ effortful behaviour in adaptive assessment using multimodal data. In Proceedings of the tenth international conference on learning analytics & knowledge, pages 480–489, 2020.
- From word embeddings to document distances. In International conference on machine learning, pages 957–966. PMLR, 2015.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692, 2019.
- A method to analyze computer science students’ teamwork in online collaborative learning environments. ACM Transactions on Computing Education (TOCE), 16(2):1–28, 2016.
- Opensmile: the munich versatile and fast open-source audio feature extractor. In Proceedings of the 2010 International Conference on Multimedia, pages 1459–1462, 2010.
- Openface 2.0: Facial behavior analysis toolkit. In Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, pages 59–66. IEEE, 2018.
- Affect and learning: an exploratory look into the role of affect in learning with autotutor. Journal of educational media, 29(3):241–250, 2004.
- Dynamics of affective states during complex learning. Learning and Instruction, 22(2):145–157, 2012.
- Why do only some events cause learning during human tutoring? Cognition and Instruction, 21(3):209–249, 2003.
- Better to be frustrated than bored: The incidence, persistence, and impact of learners’ cognitive–affective states during interactions with three different computer-based learning environments. International Journal of Human-Computer Studies, 68(4):223–241, 2010.
- The affective experience of novice computer programmers. International journal of artificial intelligence in education, 27(1):181–206, 2017.
- Affect and engagement in game-basedlearning environments. IEEE transactions on affective computing, 5(1):45–56, 2013.
- Confused, now what? a cognitive-emotional strategy training (cest) intervention for elementary students during mathematics problem solving. Contemporary Educational Psychology, 62:101879, 2020.
- Conflict resolution in student teams: an exploration in the context of design education. In Collaboration and Student Engagement in Design Education, pages 105–124. IGI Global, 2017.
- Collaborative reasoning: Expanding ways for children to talk and think in school. Educational Psychology Review, 15(2):181–198, 2003.
- Intergroup emotions in workgroups: Some emotional antecedents and consequences of belonging. In Affect and groups. Emerald Group Publishing Limited, 2007.
- Identifying productive conflict during upper elementary students’ collaborative programming. In Proceedings of the 14th International Conference on Computer-Supported Collaborative Learning-CSCL 2021. International Society of the Learning Sciences, 2021.
- Emotion regulation as a boundary condition of the relationship between team conflict and performance: A multi-level examination. Journal of Organizational Behavior, 34(5):714–734, 2013.
- Task versus relationship conflict, team performance, and team member satisfaction: a meta-analysis. Journal of applied Psychology, 88(4):741, 2003.
- MP Li and Bick Har Lam. Cooperative learning. The Hong Kong Institute of Education, 1:33, 2013.
- Guided, cooperative learning and individual knowledge acquisition. In Knowing, learning, and instruction, pages 393–451. Routledge, 2018.
- Modeling multiple temporal scales of full-body movements for emotion classification. IEEE Transactions on Affective Computing, 2021.
- Eeg-based emotion recognition via neural architecture search. IEEE Transactions on Affective Computing, 2021.
- Emotions matter: Towards personalizing human-system interactions using a two-layer multimodal approach. In Proceedings of the 2022 International Conference on Multimodal Interaction, pages 63–72, 2022.
- A systematic review on affective computing: Emotion models, databases, and recent advances. Information Fusion, 83:19–52, 2022.
- Multimodal affective states recognition based on multiscale cnns and biologically inspired decision fusion model. IEEE Transactions on Affective Computing, 2021.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Facial expression recognition with visual transformers and attentional selective fusion. IEEE Transactions on Affective Computing, 2021a.
- Affect detection: An interdisciplinary review of models, methods, and their applications. IEEE Transactions on affective computing, 1(1):18–37, 2010.
- A review of recent advances in learner and skill modeling in intelligent learning environments. User Modeling and User-Adapted Interaction, 22(1):9–38, 2012.
- Trends in the use of affective computing in e-learning environments. Education and Information Technologies, pages 1–23, 2022.
- Toward an affect-sensitive autotutor. IEEE Intelligent Systems, 22(4):53–61, 2007.
- Managing learner’s affective states in intelligent tutoring systems. In Advances in intelligent tutoring systems, pages 339–358. Springer, 2010.
- Javatutor: an intelligent tutoring system that adapts to cognitive and affective states during computer programming. In Proceedings of the 46th acm technical symposium on computer science education, pages 599–599, 2015.
- Generalizing models of student affect in game-based learning environments. In International Conference on Affective Computing and Intelligent Interaction, pages 588–597. Springer, 2011.
- Automatic detection of learning-centered affective states in the wild. In Proceedings of the 20th international conference on intelligent user interfaces, pages 379–388, 2015.
- Foundations of game-based learning. Educational psychologist, 50(4):258–283, 2015.
- Challenging games help students learn: An empirical study on engagement, flow and immersion in game-based learning. Computers in human behavior, 54:170–179, 2016.
- Student emotion, co-occurrence, and dropout in a mooc context. International Educational Data Mining Society, 2016.
- Beyond positive and negative emotions: Looking into the role of achievement emotions in discussion forums of moocs. The Internet and Higher Education, 43:100690, 2019.
- Re-defining, analyzing and predicting persistence using student events in online learning. Applied Sciences, 10(5):1722, 2020.
- Automated detection of emotional and cognitive engagement in mooc discussions to predict learning achievement. Computers & Education, 181:104461, 2022.
- Multimethod assessment of affective experience and expression during deep learning. International Journal of Learning Technology, 4(3-4):165–187, 2009.
- Predicting facial indicators of confusion with hidden markov models. In International Conference on Affective computing and intelligent interaction, pages 97–106. Springer, 2011.
- Automatically recognizing facial expression: Predicting engagement and frustration. In Educational data mining 2013, 2013.
- Identification of action units related to affective states in a tutoring system for mathematics. Journal of Educational Technology & Society, 19(2):77–86, 2016.
- Detecting impasse during collaborative problem solving with multimodal learning analytics. In LAK22: 12th International Learning Analytics and Knowledge Conference, pages 45–55, 2022.
- Emotion sensors go to school. In Artificial intelligence in education, pages 17–24. Ios Press, 2009.
- Beyond facial expressions: Learning human emotion from body gestures. In BMVC, pages 1–10, 2007.
- Body movements for affective expression: A survey of automatic recognition and generation. IEEE Transactions on Affective Computing, 4(4):341–359, 2013.
- Prompting collaborative and exploratory discourse: An epistemic network analysis study. International Journal of Computer-Supported Collaborative Learning, 16(3):339–366, 2021.
- “i remember how to do it”: exploring upper elementary students’ collaborative regulation while pair programming using epistemic network analysis. Computer Science Education, pages 1–29, 2022.
- A review on data fusion in multimodal learning analytics and educational data mining. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 12(4):e1458, 2022.
- Tensor fusion network for multimodal sentiment analysis. arXiv preprint arXiv:1707.07250, 2017.
- Fusatnet: Dual attention based spectrospatial multimodal fusion network for hyperspectral and lidar classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pages 92–93, 2020.
- In support of pair programming in the introductory computer science course. Computer Science Education, 12(3):197–212, 2002.
- The importance of producing shared code through pair programming. In Proceedings of the 49th ACM Technical Symposium on Computer Science Education, pages 765–770, 2018a.
- A visual programming environment for learning distributed programming. In Proceedings of the 2017 ACM SIGCSE technical symposium on computer science education, pages 81–86, 2017.
- Using dialogue features to predict trouble during collaborative learning. User Modeling and User-Adapted Interaction, 15:85–134, 2005.
- Expressing and addressing uncertainty: A study of collaborative problem-solving dialogues. Philadelphia, PA: International Society of the Learning Sciences., 2017.
- Neil Mercer. Words and Minds: How We Use Language to Think Together. Routledge, 2002.
- Two-computer pair programming: Exploring a feedback intervention to improve collaborative talk in elementary students. Computer Science Education, 32(1):3–29, 2022.
- The measurement of observer agreement for categorical data. Biometrics, pages 159–174, 1977.
- The challenge of noisy classrooms: Speaker detection during elementary students’ collaborative dialogue. In International Conference on Artificial Intelligence in Education, pages 268–281. Springer, 2021b.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics. doi: 10.18653/v1/N19-1423. URL https://aclanthology.org/N19-1423.
- Tweeteval: Unified benchmark and comparative evaluation for tweet classification. arXiv preprint arXiv:2010.12421, 2020.
- Acoustic-prosodic entrainment and rapport in collaborative learning dialogues. In Proceedings of the 2014 ACM workshop on Multimodal Learning Analytics Workshop and Grand Challenge, pages 5–12, 2014.
- The geneva minimalistic acoustic parameter set (gemaps) for voice research and affective computing. IEEE Transactions on Affective Computing, 7(2):190–202, 2015.
- Frustration recognition from speech during game interaction using wide residual networks. Virtual Reality & Intelligent Hardware, 3(1):76–86, 2021.
- Shemo: a large-scale validated database for persian speech emotion detection. Language Resources and Evaluation, 53:1–16, 2019.
- Recent developments in opensmile, the munich open-source multimedia feature extractor. In Proceedings of the 21st ACM international conference on Multimedia, pages 835–838, 2013.
- Leveraging mobile eye-trackers to capture joint visual attention in co-located collaborative learning groups. International Journal of Computer-Supported Collaborative Learning, 13(3):241–261, 2018.
- Identifying collaborative learning states using unsupervised machine learning on eye-tracking, physiological and motion sensor data. Proceedings of The 12th International Conference on Educational Data Mining, pages 318–323, 2019.
- Predicting student performance based on eye gaze during collaborative problem solving. In Proceedings of the Group Interaction Frontiers in Technology, GIFT’18, New York, NY, USA, 2018b. Association for Computing Machinery. ISBN 9781450360777.
- Predicting learning by analyzing eye-gaze data of reading behavior. Proceedings of the 11th International Conference on Educational Data Mining, pages 455–461, 2018.
- Modelling collaborative problem-solving competence with transparent learning analytics: is video data enough? In Proceedings of the 10th International Conference on Learning Analytics and Knowledge, pages 270–275, 2020.
- Going beyond what is visible: What multichannel data can reveal about interaction in the context of collaborative learning? Computers in Human Behavior, 96:235–245, 2019.
- Relationships between body postures and collaborative learning states in an augmented reality study. In Proceedings of the 21st International Conference on Artificial Intelligence in Education, pages 257–262. Springer, 2020.
- Automatically predicting peer satisfaction during collaborative learning with linguistic, acoustic, and visual features. Journal of Educational Data Mining, 15(2):86–122, 2023.
- Emonets: Multimodal deep learning approaches for emotion recognition in video. Journal on Multimodal User Interfaces, 10:99–111, 2016.
- A transformer-based framework for multivariate time series representation learning. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining, pages 2114–2124, 2021.
- pyts: A python package for time series classification. The Journal of Machine Learning Research, 21(1):1720–1725, 2020.
- Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI conference on artificial intelligence, volume 35, pages 11106–11115, 2021.
- Are transformers effective for time series forecasting? In Proceedings of the AAAI conference on artificial intelligence, volume 37, pages 11121–11128, 2023.
- Longformer: The long-document transformer. arXiv preprint arXiv:2004.05150, 2020.
- Smote: synthetic minority over-sampling technique. Journal of artificial intelligence research, 16:321–357, 2002.
- Kernel-based smote for svm classification of imbalanced datasets. In IECON 2015-41st Annual Conference of the IEEE Industrial Electronics Society, pages 001127–001132. IEEE, 2015.
- Deep neural networks for human activity recognition with wearable sensors: Leave-one-subject-out cross-validation for model selection. Ieee Access, 8:133982–133994, 2020.
- Subject cross validation in human activity recognition. arXiv preprint arXiv:1904.02666, 2019.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Multimodal modeling of collaborative problem-solving facets in triads. User Modeling and User-Adapted Interaction, pages 1–39, 2021.
- Advances in multi-sensor data fusion: Algorithms and applications. Sensors, 9(10):7771–7784, 2009.
- Multimodal transformer for unaligned multimodal language sequences. In Proceedings of the conference. Association for Computational Linguistics. Meeting, volume 2019, page 6558. NIH Public Access, 2019.
- Detecting disruptive talk in student chat-based discussion within collaborative game-based learning environments. In LAK21: 11th International Learning Analytics and Knowledge Conference, pages 405–415, 2021.