Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

YOLO -- You only look 10647 times (2201.06159v2)

Published 16 Jan 2022 in cs.CV

Abstract: With this work we are explaining the "You Only Look Once" (YOLO) single-stage object detection approach as a parallel classification of 10647 fixed region proposals. We support this view by showing that each of YOLOs output pixel is attentive to a specific sub-region of previous layers, comparable to a local region proposal. This understanding reduces the conceptual gap between YOLO-like single-stage object detection models, RCNN-like two-stage region proposal based models, and ResNet-like image classification models. In addition, we created interactive exploration tools for a better visual understanding of the YOLO information processing streams: https://limchr.github.io/yolo_visualization

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Christian Limberg (3 papers)
  2. Andrew Melnik (33 papers)
  3. Augustin Harter (4 papers)
  4. Helge Ritter (27 papers)
Citations (5)
Youtube Logo Streamline Icon: https://streamlinehq.com