Papers
Topics
Authors
Recent
Search
2000 character limit reached

Text to Automata Diagrams: Comparing TikZ Code Generation with Direct Image Synthesis

Published 9 Mar 2026 in cs.CV | (2603.07936v1)

Abstract: Diagrams are widely used in teaching computer science courses. They are useful in subjects such as automata and formal languages, data structures, etc. These diagrams, often drawn by students during exams or assignments, vary in structure, layout, and correctness. This study examines whether current vision-language and LLMs can process such diagrams and produce accurate textual and digital representations. In this study, scanned student-drawn diagrams are used as input. Then, textual descriptions are generated from these images using a vision-LLM. The descriptions are checked and revised by human reviewers to make them accurate. Both the generated and the revised descriptions are then fed to a LLM to generate TikZ code. The resulting diagrams are compiled and then evaluated against the original scanned diagrams. We found descriptions generated directly from images using vision-LLMs are often incorrect and human correction can substantially improve the quality of vision LLM generated descriptions. This research can help computer science education by paving the way for automated grading and feedback and creating more accessible instructional materials.

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.