Papers
Topics
Authors
Recent
Search
2000 character limit reached

A review on the use of large language models as virtual tutors

Published 20 May 2024 in cs.CL and cs.AI | (2405.11983v2)

Abstract: Transformer architectures contribute to managing long-term dependencies for Natural Language Processing, representing one of the most recent changes in the field. These architectures are the basis of the innovative, cutting-edge LLMs that have produced a huge buzz in several fields and industrial sectors, among the ones education stands out. Accordingly, these generative Artificial Intelligence-based solutions have directed the change in techniques and the evolution in educational methods and contents, along with network infrastructure, towards high-quality learning. Given the popularity of LLMs, this review seeks to provide a comprehensive overview of those solutions designed specifically to generate and evaluate educational materials and which involve students and teachers in their design or experimental plan. To the best of our knowledge, this is the first review of educational applications (e.g., student assessment) of LLMs. As expected, the most common role of these systems is as virtual tutors for automatic question generation. Moreover, the most popular models are GTP-3 and BERT. However, due to the continuous launch of new generative models, new works are expected to be published shortly.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)
  1. \bibcommenthead
  2. Alshater M (2022) Exploring the Role of Artificial Intelligence in Enhancing Academic Performance: A Case Study of ChatGPT. SSRN Electronic Journal pp 1–22. 10.2139/ssrn.4312358
  3. Baidoo-Anu D, Ansah LO (2023) Education in the Era of Generative Artificial Intelligence (AI): Understanding the Potential Benefits of ChatGPT in Promoting Teaching and Learning. Journal of AI 7:52–62. 10.61969/jai.1337500
  4. Christ P (2023) Generation of Meaningful SQL-Query Exercises Using Large Language Models and Knowledge Graphs
  5. Cooper G (2023) Examining Science Education in ChatGPT: An Exploratory Study of Generative Artificial Intelligence. Journal of Science Education and Technology 32:444–452. 10.1007/s10956-023-10039-y
  6. Haluza D, Jungwirth D (2023) Artificial Intelligence and Ten Societal Megatrends: An Exploratory Study Using GPT-3. Systems 11:1–18. 10.3390/systems11030120
  7. Hardy M (2021) Toward Educator-focused Automated Scoring Systems for Reading and Writing
  8. Huang X, Qiao C (2022) Enhancing Computational Thinking Skills Through Artificial Intelligence Education at a STEAM High School. Science & Education 33:383–403. 10.1007/s11191-022-00392-6
  9. Humphry T, Fuller AL (2023) Potential ChatGPT Use in Undergraduate Chemistry Laboratories. Journal of Chemical Education pp 1434–1436. 10.1021/acs.jchemed.3c00006
  10. Nasution NEA (2023) Using artificial intelligence to create biology multiple choice questions for higher education. Agricultural and Environmental Education 2:1–11. 10.29333/agrenvedu/13071
  11. O’Connor S, ChatGPT (2023) Open artificial intelligence platforms in nursing education: Tools for academic progress or abuse? Nurse Education in Practice 66:103,537–103,538. 10.1016/j.nepr.2022.103537
  12. Pavlik JV (2023) Collaborating With ChatGPT: Considering the Implications of Generative Artificial Intelligence for Journalism and Media Education. Journalism & Mass Communication Educator 78:84–93. 10.1177/10776958221149577
  13. Perkins M (2023) Academic integrity considerations of AI Large Language Models in the post-pandemic era: ChatGPT and beyond. Journal of University Teaching and Learning Practice 20:2–26. 10.53761/1.20.02.07
  14. Raina V, Gales M (2022) Multiple-Choice Question Generation: Towards an Automated Assessment Framework
  15. Roll I, Wylie R (2016) Evolution and Revolution in Artificial Intelligence in Education. International Journal of Artificial Intelligence in Education 26:582–599. 10.1007/s40593-016-0110-3
  16. Sok S, Heng K (2023) ChatGPT for Education and Research: A Review of Benefits and Risks. SSRN Electronic Journal 3:110–121. 10.2139/ssrn.4378735
  17. Taber KS (2018) The Use of Cronbach’s Alpha When Developing and Reporting Research Instruments in Science Education. Research in Science Education 48:1273–1296. 10.1007/s11165-016-9602-2
  18. Talan T (2021) Artificial Intelligence in Education: A Bibliometric Study. International Journal of Research in Education and Science 7(3):822–837. 10.46328/ijres.2409
  19. Tobore TO (2019) On Energy Efficiency and the Brain’s Resistance to Change: The Neurological Evolution of Dogmatism and Close-Mindedness. Psychological Reports 122:2406–2416. 10.1177/0033294118792670
Citations (6)

Summary

  • The paper reviews LLMs as virtual tutors, highlighting their roles in question generation, answer grading, and explanation with insights drawn from 29 selected records.
  • It details a systematic review methodology using targeted Google Scholar queries post-2020, emphasizing rigorous screening and performance evaluations.
  • The study discusses practical challenges such as ethical concerns, fairness, and integration issues, urging further research with emerging models like GPT-4.

Review of LLMs as Virtual Tutors

Introduction

LLMs, grounded in Transformer architectures, have become increasingly significant within the field of NLP. These models, notably including GPT-3 and BERT, have permeated various sectors, prominently education, where they serve as virtual tutors. This review paper provides a comprehensive overview of LLMs tailored for educational purposes, focusing on their role in generating and evaluating educational materials, and highlights their involvement in both student and teacher design or experimental plans.

Methodology

The authors conducted a systematic review via Google Scholar using specific search queries to identify scholarly works related to the educational application of LLMs. The search queries focused on documents published since 2020, emphasizing works that explicitly employed LLMs for educational purposes, excluding those merely adaptable to educational contexts. Figure 1

Figure 1: Review pipeline.

A comprehensive screening process yielded records based on English language and applicability within the educational domain, culminating in 29 records applicable to this review.

Applications of LLMs in Education

Virtual Tutoring Applications

LLMs have found success as virtual tutors, predominantly utilizing models like GPT-3 and BERT, for various tasks including automatic question generation, answer grading, explanation generation, and text summarization. Figure 2

Figure 2: Distribution of the LLMs in the records selected.

These models offer functionalities such as:

  • Question Generation: Solutions leveraging GPT-3 and T5 have shown a notable percentage of generated questions being deemed useful, with manual assessments corroborating these findings.
  • Answer Grading: BERT's employment and modifications for answer evaluations demonstrate an adequate Pearson correlation coefficient, signifying its efficacy in grading student responses accurately.
  • Explanations: The potential of GPT-3.5 has been explored for generating qualitative conclusion statements in educational settings, underscoring features like readability and orthographic correctness.

Other Educational Applications

Beyond tutoring, LLMs have been employed for tasks like code explanation and correction. For instance, models like Codex have demonstrated capability in addressing syntax errors and offering explanations in programming assignments. Additionally, virtual assistants powered by variations of BERT and GPT models have shown effectiveness in aiding both academic and administrative tasks. Figure 3

Figure 3: Distribution of the tasks in the records selected.

Discussion and Analysis

The widespread application of LLMs in education showcases their adaptability and potential in transforming traditional educational methodologies. However, challenges persist regarding their integration and the requirement for transparency, especially in scenarios where difficulty adjustment and fairness are pivotal. Despite the advantages, the ethical considerations, and risks such as academic integrity and the models' opaqueness, highlight the need for rigorous scrutiny.

Conclusion

LLMs are undeniably at the forefront of revolutionizing educational practices. Their generative capabilities allow for enhanced pedagogical approaches and individualized assessments. This review serves as a foundational exploration into the myriad applications of LLMs within education and anticipates further developments with the advent of newer models like GPT-4. Future research should aim to address ethical implications and practical integration into educational ecosystems, ensuring these technologies are harnessed effectively and responsibly.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 1 like about this paper.