2000 character limit reached
YOLO -- You only look 10647 times (2201.06159v2)
Published 16 Jan 2022 in cs.CV
Abstract: With this work we are explaining the "You Only Look Once" (YOLO) single-stage object detection approach as a parallel classification of 10647 fixed region proposals. We support this view by showing that each of YOLOs output pixel is attentive to a specific sub-region of previous layers, comparable to a local region proposal. This understanding reduces the conceptual gap between YOLO-like single-stage object detection models, RCNN-like two-stage region proposal based models, and ResNet-like image classification models. In addition, we created interactive exploration tools for a better visual understanding of the YOLO information processing streams: https://limchr.github.io/yolo_visualization
- Christian Limberg (3 papers)
- Andrew Melnik (33 papers)
- Augustin Harter (4 papers)
- Helge Ritter (27 papers)