Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A real-time spatiotemporal AI model analyzes skill in open surgical videos (2112.07219v1)

Published 14 Dec 2021 in cs.CV and cs.AI

Abstract: Open procedures represent the dominant form of surgery worldwide. AI has the potential to optimize surgical practice and improve patient outcomes, but efforts have focused primarily on minimally invasive techniques. Our work overcomes existing data limitations for training AI models by curating, from YouTube, the largest dataset of open surgical videos to date: 1997 videos from 23 surgical procedures uploaded from 50 countries. Using this dataset, we developed a multi-task AI model capable of real-time understanding of surgical behaviors, hands, and tools - the building blocks of procedural flow and surgeon skill. We show that our model generalizes across diverse surgery types and environments. Illustrating this generalizability, we directly applied our YouTube-trained model to analyze open surgeries prospectively collected at an academic medical center and identified kinematic descriptors of surgical skill related to efficiency of hand motion. Our Annotated Videos of Open Surgery (AVOS) dataset and trained model will be made available for further development of surgical AI.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (13)
  1. Emmett D. Goodman (1 paper)
  2. Krishna K. Patel (1 paper)
  3. Yilun Zhang (7 papers)
  4. William Locke (1 paper)
  5. Chris J. Kennedy (3 papers)
  6. Rohan Mehrotra (1 paper)
  7. Stephen Ren (1 paper)
  8. Melody Y. Guan (12 papers)
  9. Maren Downing (1 paper)
  10. Hao Wei Chen (1 paper)
  11. Jevin Z. Clark (1 paper)
  12. Gabriel A. Brat (51 papers)
  13. Serena Yeung (39 papers)
Citations (19)

Summary

We haven't generated a summary for this paper yet.