Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MnasFPN: Learning Latency-aware Pyramid Architecture for Object Detection on Mobile Devices (1912.01106v2)

Published 2 Dec 2019 in cs.CV

Abstract: Despite the blooming success of architecture search for vision tasks in resource-constrained environments, the design of on-device object detection architectures have mostly been manual. The few automated search efforts are either centered around non-mobile-friendly search spaces or not guided by on-device latency. We propose MnasFPN, a mobile-friendly search space for the detection head, and combine it with latency-aware architecture search to produce efficient object detection models. The learned MnasFPN head, when paired with MobileNetV2 body, outperforms MobileNetV3+SSDLite by 1.8 mAP at similar latency on Pixel. It is also both 1.0 mAP more accurate and 10% faster than NAS-FPNLite. Ablation studies show that the majority of the performance gain comes from innovations in the search space. Further explorations reveal an interesting coupling between the search space design and the search algorithm, and that the complexity of MnasFPN search space may be at a local optimum.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Bo Chen (309 papers)
  2. Golnaz Ghiasi (20 papers)
  3. Hanxiao Liu (35 papers)
  4. Tsung-Yi Lin (49 papers)
  5. Dmitry Kalenichenko (5 papers)
  6. Hartwig Adams (1 paper)
  7. Quoc V. Le (128 papers)
Citations (51)

Summary

We haven't generated a summary for this paper yet.