Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Augmented Physics: Creating Interactive and Embedded Physics Simulations from Static Textbook Diagrams (2405.18614v2)

Published 28 May 2024 in cs.HC, cs.CV, and cs.LG

Abstract: We introduce Augmented Physics, a machine learning-integrated authoring tool designed for creating embedded interactive physics simulations from static textbook diagrams. Leveraging recent advancements in computer vision, such as Segment Anything and Multi-modal LLMs, our web-based system enables users to semi-automatically extract diagrams from physics textbooks and generate interactive simulations based on the extracted content. These interactive diagrams are seamlessly integrated into scanned textbook pages, facilitating interactive and personalized learning experiences across various physics concepts, such as optics, circuits, and kinematics. Drawing from an elicitation study with seven physics instructors, we explore four key augmentation strategies: 1) augmented experiments, 2) animated diagrams, 3) bi-directional binding, and 4) parameter visualization. We evaluate our system through technical evaluation, a usability study (N=12), and expert interviews (N=12). Study findings suggest that our system can facilitate more engaging and personalized learning experiences in physics education.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (76)
  1. Year (if available). myPhysicslab. https://www.myphysicslab.com Accessed on Date (e.g., September 6, 2023).
  2. Elastic documents: Coupling text and tables through contextual visualizations for enhanced document reading. IEEE transactions on visualization and computer graphics 25, 1 (2018), 661–671.
  3. Video puppetry: a performative interface for cutout animation. In ACM SIGGRAPH Asia 2008 papers. 1–9.
  4. Mark Billinghurst and Andreas Duenser. 2012. Augmented reality in the classroom. Computer 45, 7 (2012), 56–63.
  5. Teaching and learning physics using 3D virtual learning environment: A case study of combined virtual reality and virtual laboratory in secondary school. Journal of Computers in Mathematics and Science Teaching 39, 1 (2020), 5–18.
  6. John Brooke. 1996. Sus: a “quick and dirty’usability. Usability evaluation in industry 189, 3 (1996), 189–194.
  7. Using the augmented reality 3D technique for a convex imaging experiment in a physics course. International Journal of Engineering Education 29, 4 (2013), 856–865.
  8. Design and application of interactive simulations in problem-solving in university-level physics education. Journal of Science Education and Technology 25 (2016), 590–609.
  9. Salman Cheema and Joseph LaViola. 2012. PhysicsBook: a sketch-based interface for animating physics diagrams. In Proceedings of the 2012 ACM international conference on Intelligent User Interfaces. 51–60.
  10. Marvista: A Human-AI Collaborative Reading Tool. arXiv preprint arXiv:2207.08401 (2022).
  11. Zhutian Chen and Haijun Xia. 2022. Crossdata: Leveraging text-data connections for authoring data documents. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–15.
  12. ScentHighlights: highlighting conceptually-related sentences during reading. In Proceedings of the 10th international conference on Intelligent user interfaces. 272–274.
  13. Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology. 1–16.
  14. Idyll studio: A structured editor for authoring interactive & data-driven articles. In The 34th Annual ACM Symposium on User Interface Software and Technology. 1–12.
  15. K-sketch: a’kinetic’sketch pad for novice animators. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 413–422.
  16. Increasing the transparency of research papers with explorable multiverse analyses. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. 1–15.
  17. Scim: Intelligent Skimming Support for Scientific Papers. In Proceedings of the 28th International Conference on Intelligent User Interfaces. 476–490.
  18. Hobit: Hybrid optical bench for innovative teaching. In Proceedings of the 2017 chi conference on human factors in computing systems. 949–959.
  19. Juan Garzón. 2021. An overview of twenty-five years of augmented reality in education. Multimodal Technologies and Interaction 5, 7 (2021), 37.
  20. GeoGebra. [n.d.]. GeoGebra - the world’s favorite, free math tools used by over 100 million students and teachers. https://www.geogebra.org/?lang=en. (Accessed on 03/21/2023).
  21. Drawing Transforms: A Unifying Interaction Primitive to Procedurally Manipulate Graphics across Style, Space, and Time. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–15.
  22. Math Augmentation: How Authors Enhance the Readability of Formulas using Novel Visual Design Practices. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–18.
  23. Communicating with interactive articles. Distill 5, 9 (2020), e28.
  24. Contextifier: automatic generation of annotated stock visualizations. In Proceedings of the SIGCHI Conference on human factors in computing systems. 2707–2716.
  25. Improving comprehension of measurements using concrete re-expression strategies. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1–12.
  26. Amir Jahanlou and Parmit K Chilana. 2022. Katika: An End-to-End System for Authoring Amateur Explainer Motion Graphics Videos. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–14.
  27. Sketched Reality: Sketching Bi-Directional Interactions Between Virtual and Physical Worlds with AR and Actuated Tangible UI. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology. 1–12.
  28. Threddy: An Interactive System for Personalized Thread-based Exploration and Organization of Scientific Literature. In Proceedings of the 35th Annual ACM Symposium on User Interface Software and Technology. 1–15.
  29. Hannes Kaufmann and Bernd Meyer. 2008. Simulating educational physical experiments in augmented reality. In ACM SIGGRAPH Asia 2008 Educators Programme. 1–8.
  30. Kitty: sketching dynamic and interactive illustrations. In Proceedings of the 27th annual ACM symposium on User interface software and technology. 395–405.
  31. Draco: bringing life to illustrations with kinetic textures. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 351–360.
  32. Motion amplifiers: sketching dynamic illustrations using the principles of 2D animation. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. 4599–4609.
  33. Facilitating document reading by linking text and tables. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. 423–434.
  34. Simulation in education and training. Applied System Simulation: Methodologies and Applications (2003), 437–456.
  35. Segment anything. arXiv preprint arXiv:2304.02643 (2023).
  36. Randall Dewey Knight. 2017. Physics for scientists and engineers: a strategic approach with modern physics. Pearson New York, NY.
  37. Extracting references between text and charts via crowdsourcing. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems. 31–40.
  38. Kori: Interactive synthesis of text and charts in data documents. IEEE Transactions on Visualization and Computer Graphics 28, 1 (2021), 184–194.
  39. Sam Lau and Philip J Guo. 2020. Data Theater: A live programming environment for prototyping data-driven explorable explanations. In Workshop on Live Programming (LIVE).
  40. Joseph J. LaViola and Robert C. Zeleznik. 2007. MathPad2: A System for the Creation and Exploration of Mathematical Sketches. In ACM SIGGRAPH 2007 Courses (San Diego, California) (SIGGRAPH ’07). Association for Computing Machinery, New York, NY, USA, 46–es. https://doi.org/10.1145/1281500.1281557
  41. Explore effective use of computer simulations for physics education. Journal of Computers in Mathematics and Science Teaching 27, 4 (2008), 443–466.
  42. Physica: Interactive Tangible Physics Simulation based on Tabletop Mobile Robots Towards Explorable Physics Education. In Proceedings of the 2023 ACM Designing Interactive Systems Conference. 1485–1499.
  43. HoloDoc: Enabling Mixed Reality Workspaces That Harness Physical and Digital Content. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems (Glasgow, Scotland Uk) (CHI ’19). Association for Computing Machinery, New York, NY, USA, 1–14. https://doi.org/10.1145/3290605.3300917
  44. A layered authoring tool for stylized 3d animations. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–14.
  45. Charagraph: Interactive Generation of Charts for Realtime Annotation of Data-Rich Paragraphs. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems. 1–18.
  46. ConductAR: an augmented reality based tool for iterative design of conductive ink circuits. In Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 791–800.
  47. PhET: Interactive simulations for teaching and learning physics. The physics teacher 44, 1 (2006), 18–23.
  48. Chalktalk : A Visualization and Communication Language – As a Tool in the Domain of Computer Science Education. arXiv:1809.07166 [cs.HC]
  49. Illuminating clay: a 3-D tangible interface for landscape analysis. In Proceedings of the SIGCHI conference on Human factors in computing systems. 355–362.
  50. Dually noted: layout-aware annotations with smartphone augmented reality. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–15.
  51. Iulian Radu and Bertrand Schneider. 2019. What can we learn from augmented reality (AR)? Benefits and drawbacks of AR for inquiry-based learning of physics. In Proceedings of the 2019 CHI conference on human factors in computing systems. 1–12.
  52. Shwetha Rajaram and Michael Nebeling. 2022. Paper trail: An immersive authoring system for augmented reality instructional experiences. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems. 1–16.
  53. Constructing Embodied Algebra by Sketching5. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 1–16.
  54. Effects of the 5E teaching model using interactive simulation on achievement and attitude in physics education. International Journal of Innovation in Science and Mathematics Education 25, 3 (2017).
  55. User-guided synthesis of interactive diagrams. In Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. 195–207.
  56. Revision: Automated classification, analysis and redesign of chart images. In Proceedings of the 24th annual ACM symposium on User interface software and technology. 393–402.
  57. Jeremy Scott and Randall Davis. 2013. Physink: sketching physical behavior. In Adjunct Proceedings of the 26th Annual ACM Symposium on User Interface Software and Technology. 9–10.
  58. Texsketch: Active diagramming through pen-and-ink annotations. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 1–13.
  59. Realitysketch: Embedding responsive graphics and visualizations in AR through dynamic sketching. In Proceedings of the 33rd Annual ACM Symposium on User Interface Software and Technology. 166–181.
  60. Effects of augmented reality on learning and cognitive load in university physics laboratory courses. Computers in Human Behavior 108 (2020), 106316.
  61. John Underkoffler and Hiroshi Ishii. 1999. Urp: a luminous-tangible workbench for urban planning and design. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems. 386–393.
  62. Bret Victor. 2005. Magic Ink: Information software and the graphical interface, 2005. URL http://worrydream. com/MagicInk (2005), 30.
  63. Bret Victor. 2011a. Explorable explanations. Online. http://worrydream. com/ExplorableExplanations 17 (2011).
  64. Bret Victor. 2011b. KillMath.
  65. Bret Victor. 2013a. Media for thinking the unthinkable. MIT Media Lab on April 4 (2013), 2013.
  66. Bret Victor. 2013b. Tangle: explorable explanations made easy.
  67. Tom Walsh. [n.d.]. oPhysics. https://ophysics.com/. Accessed: 2023-07-30.
  68. Website. [n.d.]a. CircuitTUI. Retrieved Sept 11th, 2023 from https://tangible.media.mit.edu/project/circuittui/
  69. Website. [n.d.]b. PhET: Interactive Simulations for science and math. Retrieved Sept 11th, 2023 from https://phet.colorado.edu/
  70. Website. [n.d.]c. Physion : Interactive Physics Simulations. Retrieved Sept 11th, 2023 from https://physion.net/
  71. Website. [n.d.]d. Simphy : Physics simplified. Retrieved Sept 11th, 2023 from https://simphy.com/
  72. Website. [n.d.]e. the Physics Classroom : Physics Interactives. Retrieved Sept 11th, 2023 from https://www.physicsclassroom.com/Physics-Interactives
  73. A mixed-initiative interface for animating static pictures. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. 649–661.
  74. RealityCanvas: Augmented Reality Sketching for Embedded and Responsive Scribble Animation Effects. In Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology. 1–14.
  75. Energy-brushes: Interactive tools for illustrating stylized elemental dynamics. In Proceedings of the 29th Annual Symposium on User Interface Software and Technology. 755–766.
  76. Sketch-based dynamic illustration of fluid systems. In Proceedings of the 2011 SIGGRAPH Asia Conference. 1–8.

Summary

  • The paper presents a system that converts static textbook diagrams into interactive simulations using advanced computer vision and multi-modal LLM techniques.
  • Its methodology involves semi-automatic segmentation and specialized simulation modes for kinematics, optics, circuits, and animations with various success rates.
  • User studies report high usability (SUS 92.73) and significant pedagogical potential, promoting active engagement with complex physics concepts.

Analysis of Augmented Physics: Transforming Static Diagrams into Interactive Simulations

The paper "Augmented Physics: Creating Interactive and Embedded Physics Simulations from Static Textbook Diagrams" introduces a tool that leverages advances in machine learning, particularly computer vision and multi-modal LLMs, to convert static textbook diagrams into interactive simulations. This initiative aims to bridge the gap between traditional learning resources and the potential benefits of dynamic educational tools.

Core Contributions

The researchers have developed a web-based system capable of extracting diagrammatic content from scanned textbook pages to produce interactive simulations. This approach is categorized into several distinctive simulation experiences: kinematics, optics, circuits, and animations. The tool facilitates these transformations with a semi-automatic segmentation process using advanced computer vision techniques.

Methodological Insights

The system’s workflow consists of importing diagrams, choosing simulation types, segmenting images, defining object roles, running simulations, and interacting with results. By integrating an intuitive authoring interface, educators and students can engage with textbook content more interactively without requiring deep programming skills. This process is supported by robust backend integrations employing Segment-Anything, OpenCV, and multi-modal LLMs for accurate content extraction and classification.

Evaluation and Findings

A comprehensive technical evaluation was conducted, indicating varying success rates across different types of diagrams: kinematics (64%), optics (44%), circuits (40% with minor corrections), and animations (66%). These evaluations involved processing 200 diagrams across six textbooks and demonstrated the system’s potential in rendering diagrams into simulatable forms albeit with some limitations, especially in complex illustrations.

In addition to the technical review, user-centered assessments included a usability paper and expert interviews. The usability paper gathered positive feedback, with a System Usability Scale (SUS) score of 92.73, highlighting the tool's potential to aid independent, interest-driven learning. Meanwhile, expert interviews emphasized the tool’s complementary role to traditional resources and its potential in fostering deeper engagement with complex physical concepts.

Pedagogical Implications

The integration of interactive simulations into physics education through Augmented Physics has substantial pedagogical implications. By enabling learners to manipulate and visualize abstract concepts dynamically, the tool facilitates a more profound learning experience that static diagrams alone cannot provide. This embodiment of learner-centered exploration aligns well with contemporary educational paradigms that emphasize active learning and critical thinking.

Challenges and Future Directions

The current limitations predominantly lie in complexity handling and result accuracy, necessitating teacher oversight to mitigate potential misconceptions. The paper suggests future expansions, such as implementing more flexible control mechanisms, broader topic coverage, and enhanced visualization supports. Additionally, exploring large-scale classroom deployment and AR integrations could further enhance the system's learning impact.

Furthermore, there is a promising avenue in integrating AI-driven tutoring via multimodal LLMs to create adaptive learning experiences, potentially elevating the tool’s educational value and accessibility.

Conclusion

This paper showcases a significant stride in integrating cutting-edge technology into physics education, providing a framework for interactive and personalized learning experiences. While still in its developmental phase, Augmented Physics presents a viable approach toward modernizing traditional educational resources and enriching learners' interactions with core scientific concepts. Continued development and deployment could see its adaptation in broader educational contexts, supporting a more interactive and effective learning ecosystem.

Youtube Logo Streamline Icon: https://streamlinehq.com