CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator Needs (2401.11314v2)

Published 20 Jan 2024 in cs.HC and cs.AI

Abstract: Timely, personalized feedback is essential for students learning programming. LLM-powered tools like ChatGPT offer instant support, but reveal direct answers with code, which may hinder deep conceptual engagement. We developed CodeAid, an LLM-powered programming assistant delivering helpful, technically correct responses, without revealing code solutions. CodeAid answers conceptual questions, generates pseudo-code with line-by-line explanations, and annotates student's incorrect code with fix suggestions. We deployed CodeAid in a programming class of 700 students for a 12-week semester. A thematic analysis of 8,000 usages of CodeAid was performed, further enriched by weekly surveys, and 22 student interviews. We then interviewed eight programming educators to gain further insights. Our findings reveal four design considerations for future educational AI assistants: D1) exploiting AI's unique benefits; D2) simplifying query formulation while promoting cognitive engagement; D3) avoiding direct responses while encouraging motivated learning; and D4) maintaining transparency and control for students to asses and steer AI responses.

PDF HTML Abstract

Evaluation and Deployment of CodeAid: An LLM-Based Programming Assistant

The paper "CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator Needs" presents a detailed examination of CodeAid—a programming assistant powered by LLMs. The research outlines both the pedagogical approach and the semester-long deployment of this AI tool in a large university-level programming course, targeting its efficacy in providing technical support while mitigating academic integrity concerns.

CodeAid was introduced to address the growing demand for scalable educational solutions that provide timely and personalized feedback to students learning programming languages, particularly C. While LLMs, including prominent tools like ChatGPT, have demonstrated potential in educational domains, the direct provision of code solutions poses challenges related to academic integrity and cognitive engagement. CodeAid differentiates itself by structuring its interactions to offer conceptual insights, pseudo-code explanations, and recommendations without revealing direct code solutions. The tool serves as a supplementary resource, promoting motivated learning and cognitive engagement among students, and offering educators potential integration into broader teaching frameworks.

Core Features and Deployment Insights

CodeAid was strategically incorporated into a second-year Systems Programming course at the University of Toronto, involving approximately 700 students. Throughout a 12-week semester, CodeAid enabled students to interact with five primary features: asking general programming questions, querying code-specific contexts, explaining code, receiving debugging assistance, and seeking guidance in code writing. These features were designed through iterative prompt engineering to ensure responses were both technically accurate and educationally valuable, without bypassing essential learning processes.

The thematic analysis of over 8,000 student interactions provided insights into usage patterns and helped refine functionality mid-course. Notably, the General Question feature was the most utilized, highlighting students' frequent need for conceptual clarifications. Feedback-driven improvements included real-time streaming of answers for faster response times, the introduction of interactive pseudo-code to visualize concepts without coding directly, and integrating static function documentation to supplement learning contextually.

Implications and Future Directions

The insights derived from CodeAid's deployment pinpoint critical considerations for the design and development of future educational AI tools. Among them is the necessary balance between providing direct solutions and promoting cognitive engagement—a challenge that requires careful handling through innovative interaction designs and pedagogical alignment. Furthermore, giving educators control over the assistant’s design, scope, and included functionalities would ensure these tools better align with their curricula and help gauge student engagement.

This work also opens pathways for future research to assess the longitudinal impacts of deploying such AI-assisted learning technologies across varied educational contexts, focusing on how they influence student competency, autonomy in learning, and critical thinking skills.

Conclusion

This paper advances the understanding of LLM-powered tools in educational settings, offering a robust methodology for evaluating their deployment and reception in large-scale educational environments. CodeAid signifies an important step toward integrating AI tools in programming education, ensuring both student and educator needs are met. This balance is crucial to harness the potential of AI while maintaining educational integrity and fostering genuine learning experiences. As AI continues to evolve, such research will play a pivotal role in shaping its effective and ethical use in education.