Determine GPT-4o performance on complex handwritten math and visual statistics tasks
Determine how OpenAI’s GPT-4o responds to image-based inputs involving systems of hand-written equations, complicated multiple integrals, and identification of differences in medians from hand-drawn boxplots, and assess whether it produces correct and useful solutions for these tasks in the context of mathematics and statistics education.
References
It is unclear how ChatGPT4o would respond to a system of hand-written equations, or a complicated multiple integral, or in determining differences in medians among a set of hand-drawn boxplots.
— Equity in the Use of ChatGPT for the Classroom: A Comparison of the Accuracy and Precision of ChatGPT 3.5 vs. ChatGPT4 with Respect to Statistics and Data Science Exams
(2412.13116 - McGee et al., 17 Dec 2024) in Extensions and Future Work