Multi-objective Differentiable Neural Architecture Search (2402.18213v2)

Published 28 Feb 2024 in cs.LG, cs.CV, and stat.ML

Abstract: Pareto front profiling in multi-objective optimization (MOO), i.e. finding a diverse set of Pareto optimal solutions, is challenging, especially with expensive objectives like neural network training. Typically, in MOO neural architecture search (NAS), we aim to balance performance and hardware metrics across devices. Prior NAS approaches simplify this task by incorporating hardware constraints into the objective function, but profiling the Pareto front necessitates a computationally expensive search for each constraint. In this work, we propose a novel NAS algorithm that encodes user preferences for the trade-off between performance and hardware metrics, and yields representative and diverse architectures across multiple devices in just one search run. To this end, we parameterize the joint architectural distribution across devices and multiple objectives via a hypernetwork that can be conditioned on hardware features and preference vectors, enabling zero-shot transferability to new devices. Extensive experiments with up to 19 hardware devices and 3 objectives showcase the effectiveness and scalability of our method. Finally, we show that, without extra costs, our method outperforms existing MOO NAS methods across a broad range of qualitatively different search spaces and datasets, including MobileNetV3 on ImageNet-1k, an encoder-decoder transformer space for machine translation and a decoder-only transformer space for LLMling.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (75)

Authors (6)

Rhea Sanjay Sukthanker (8 papers)
Arber Zela (22 papers)
Benedikt Staffler (7 papers)
Samuel Dooley (27 papers)
Josif Grabocka (37 papers)
Frank Hutter (177 papers)

Citations (1)

View on Semantic Scholar

Tweets

YouTube

Show All Videos

Multi-objective Differentiable Neural Architecture Search (2402.18213v2)

Related Papers

Tweets

YouTube