HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices (2303.04440v2)

Published 8 Mar 2023 in cs.CV and cs.LG

Abstract: Vision Transformers have enabled recent attention-based Deep Learning (DL) architectures to achieve remarkable results in Computer Vision (CV) tasks. However, due to the extensive computational resources required, these architectures are rarely implemented on resource-constrained platforms. Current research investigates hybrid handcrafted convolution-based and attention-based models for CV tasks such as image classification and object detection. In this paper, we propose HyT-NAS, an efficient Hardware-aware Neural Architecture Search (HW-NAS) including hybrid architectures targeting vision tasks on tiny devices. HyT-NAS improves state-of-the-art HW-NAS by enriching the search space and enhancing the search strategy as well as the performance predictors. Our experiments show that HyT-NAS achieves a similar hypervolume with less than ~5x training evaluations. Our resulting architecture outperforms MLPerf MobileNetV1 by 6.3% accuracy improvement with 3.5x less number of parameters on Visual Wake Words.

View on arXiv

Authors (4)

Lotfi Abdelkrim Mecharbat (3 papers)
Hadjer Benmeziane (11 papers)
Hamza Ouarnoughi (12 papers)
Smail Niar (15 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

HyT-NAS: Hybrid Transformers Neural Architecture Search for Edge Devices (2303.04440v2)

Summary

Related Papers