Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Toolbox for Modelling Engagement with Educational Videos (2401.05424v1)

Published 30 Dec 2023 in cs.CY, cs.IR, cs.LG, and stat.AP

Abstract: With the advancement and utility of AI, personalising education to a global population could be a cornerstone of new educational systems in the future. This work presents the PEEKC dataset and the TrueLearn Python library, which contains a dataset and a series of online learner state models that are essential to facilitate research on learner engagement modelling.TrueLearn family of models was designed following the "open learner" concept, using humanly-intuitive user representations. This family of scalable, online models also help end-users visualise the learner models, which may in the future facilitate user interaction with their models/recommenders. The extensive documentation and coding examples make the library highly accessible to both machine learning developers and educational data mining and learning analytics practitioners. The experiments show the utility of both the dataset and the library with predictive performance significantly exceeding comparative baseline models. The dataset contains a large amount of AI-related educational videos, which are of interest for building and validating AI-specific educational recommenders.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (49)
  1. Online Courses Recommendation based on LDA. In Symposium on Information Management and Big Data.
  2. Model-Based Machine Learning. Early access version (http://www.mbmlbook.com/). Accessed: 2019-05-23.
  3. Bloom, B. S. 1984. The 2 Sigma Problem: The Search for Methods of Group Instruction as Effective as One-to-One Tutoring. Educational Researcher, 13(6): 4–16.
  4. Annotating Documents with Relevant Wikipedia Concepts. In Proc. of Slovenian KDD Conf. on Data Mining and Data Warehouses (SiKDD).
  5. The anatomy of a large-scale hypertextual Web search engine. In Proc. of Int. Conf. on World Wide Web.
  6. API design for machine learning software: experiences from the scikit-learn project. CoRR, abs/1309.0238.
  7. What’s in It for Me? Augmenting Recommended Learning Resources with Navigable Annotations. In Proceedings of the 25th International Conference on Intelligent User Interfaces Companion, IUI 20.
  8. Scalable Educational Question Generation with Pre-trained Language Models. In International Conference on Artificial Intelligence in Education, 327–339. Springer.
  9. Could AI Democratise Education? Socio-Technical Imaginaries of an EdTech Revolution. In In Proc. of the NeurIPS Workshop on Machine Learning for the Developing World (ML4D).
  10. TrueLearn: A Family of Bayesian Algorithms to Match Lifelong Learners to Open Educational Resources. In AAAI Conference on Artificial Intelligence, AAAI 20.
  11. Power to the Learner: Towards Human-Intuitive and Integrative Recommendations with Open Educational Resources. Sustainability, 14(18).
  12. Which Learning Visualisations to Offer Students? In Pammer-Schindler, V.; Pérez-Sanagustín, M.; Drachsler, H.; Elferink, R.; and Scheffel, M., eds., Lifelong Technology-Enhanced Learning, 524–530. Cham: Springer International Publishing. ISBN 978-3-319-98572-5.
  13. Individual and Peer Comparison Open Learner Model Visualisations to Identify What to Work On Next. In Late-breaking Results, Posters, Demos, Doctoral Consortium and Workshops Proceedings of the 24th ACM Conference on User Modeling, Adaptation and Personalisation (UMAP 2016).
  14. Metacognition and open learner models. In The 3rd workshop on meta-cognition and self-regulated learning in educational technologies, at ITS2008, 7–20.
  15. Open Learner Models, 301–322. Springer Berlin Heidelberg. ISBN 9783642143632.
  16. An Introduction to Bayesian Knowledge Tracing with pyBKT. Psych, 5(3): 770–786.
  17. Temporally grounding natural sentence in video. In Proceedings of the 2018 conference on empirical methods in natural language processing, 162–171.
  18. Ednet: A large-scale hierarchical dataset in education. In International Conference on Artificial Intelligence in Education, 69–73. Springer.
  19. Knowledge tracing: Modeling the acquisition of procedural knowledge. User modeling and user-adapted interaction, 4(4): 253–278.
  20. Deep Neural Networks for YouTube Recommendations. In Proc. of ACM Conf. on Recommender Systems.
  21. ORCAS: 20 Million Clicked Query-Document Pairs for Analyzing Search. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, CIKM ’20, 2983–2989. New York, NY, USA: Association for Computing Machinery.
  22. Knowledge-aware sequence modelling with deep learning for online course recommendation. Information Processing & Management, 60(4): 103377.
  23. Small Generative Language Models for Educational Question Generation. In In Proc. of the NeurIPS Workshop on Generative AI for Education (GAIED).
  24. Design patterns: elements of reusable object-oriented software. Pearson Deutschland GmbH.
  25. Interactive Visualizations of Transparent User Models for Self-Actualization: A Human-Centered Design Approach. Multimodal Technologies and Interaction, 6(6).
  26. How Video Production Affects Student Engagement: An Empirical Study of MOOC Videos. In Proc. of the First ACM Conf. on Learning @ Scale.
  27. Intervention-BKT: Incorporating Instructional Interventions into Bayesian Knowledge Tracing. In Micarelli, A.; Stamper, J.; and Panourgia, K., eds., Proc. of Int. Conf. on Intelligent Tutoring Systems.
  28. VideoKen: Automatic Video Summarization and Course Curation to Support Learning. In Companion Proceedings of the The Web Conference 2018, WWW ’18, 239–242. Republic and Canton of Geneva, CHE: International World Wide Web Conferences Steering Committee. ISBN 9781450356404.
  29. Martin, R. C. 2000. Design principles and design patterns. Object Mentor, 1(34): 597.
  30. Selecting third-party libraries: the data scientist’s perspective. Empirical Software Engineering, 28(1): 15.
  31. Panichella, A. 2021. A Systematic Comparison of search-Based approaches for LDA hyperparameter tuning. Information and Software Technology, 130: 106411.
  32. The Sum is Greater than the Parts: Ensembling Models of Student Knowledge in Educational Software. SIGKDD Explor. Newsl., 13(2): 37–44.
  33. Designing for Serendipity in a University Course Recommendation System. In Proceedings of the Tenth International Conference on Learning Analytics & Knowledge, LAK ’20, 350–359. New York, NY, USA: Association for Computing Machinery. ISBN 9781450377126.
  34. Scikit-learn: Machine learning in Python. the Journal of machine Learning research, 12: 2825–2830.
  35. X5Learn: A Personalised Learning Companion at the Intersection of AI and HCI. In 26th International Conference on Intelligent User Interfaces - Companion, IUI ’21 Companion, 70–74. New York, NY, USA: Association for Computing Machinery. ISBN 9781450380188.
  36. An empirical study of API usability. In 2013 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement, 5–14. IEEE.
  37. Deep Knowledge Tracing. In Advances in Neural Information Processing Systems.
  38. Rasch, G. 1960. Probabilistic Models for Some Intelligence and Attainment Tests, volume 1.
  39. Efficient text proximity search. In International Symposium on String Processing and Information Retrieval, 287–299. Springer.
  40. Assessing the Performance of Online Students - New Data, New Approaches, Improved Accuracy. Journal of Educational Data Mining, 14(1): 1–45.
  41. ASSISTments Dataset from Multiple Randomized Controlled Experiments. In Proc. of the Third (2016) ACM Conf. on Learning @ Scale, L@S ’16. New York, NY, USA: Association for Computing Machinery. ISBN 9781450337267.
  42. Style Guide for Python Code. PEP 8.
  43. Non-Linear Consumption of Videos Using a Sequence of Personalized Multimodal Fragments. In 26th International Conference on Intelligent User Interfaces, IUI ’21, 249–259. New York, NY, USA: Association for Computing Machinery. ISBN 9781450380171.
  44. Diagnostic Questions: The NeurIPS 2020 Education Challenge.
  45. Educational Question Mining At Scale: Prediction, Analysis and Personalization. In Symposium on Educational Advances in Artificial Intelligence (AAAI-EAAI).
  46. Beyond Views: Measuring and Predicting Engagement in Online Videos. In Proc. of the Twelfth Int. Conf. on Web and Social Media.
  47. MOOCCube: a large-scale data repository for NLP applications in MOOCs. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 3135–3142.
  48. Spotify at the TREC 2020 Podcasts Track: Segment Retrieval. In Proceedings of the Twenty-Ninth Text REtrieval Conference (TREC 2020).
  49. Extracting, mining and predicting users’ interests from social media. Hanover, MD: Now Foundations and Trends.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets