Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

110 tokens/sec

GPT-4o

56 tokens/sec

Gemini 2.5 Pro Pro

44 tokens/sec

o3 Pro

6 tokens/sec

GPT-4.1 Pro

47 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

111 37

DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models (2305.16943v4)

Published 26 May 2023 in cs.LG

Abstract: Existing NAS methods suffer from either an excessive amount of time for repetitive sampling and training of many task-irrelevant architectures. To tackle such limitations of existing NAS methods, we propose a paradigm shift from NAS to a novel conditional Neural Architecture Generation (NAG) framework based on diffusion models, dubbed DiffusionNAG. Specifically, we consider the neural architectures as directed graphs and propose a graph diffusion model for generating them. Moreover, with the guidance of parameterized predictors, DiffusionNAG can flexibly generate task-optimal architectures with the desired properties for diverse tasks, by sampling from a region that is more likely to satisfy the properties. This conditional NAG scheme is significantly more efficient than previous NAS schemes which sample the architectures and filter them using the property predictors. We validate the effectiveness of DiffusionNAG through extensive experiments in two predictor-based NAS scenarios: Transferable NAS and Bayesian Optimization (BO)-based NAS. DiffusionNAG achieves superior performance with speedups of up to 35 times when compared to the baselines on Transferable NAS benchmarks. Furthermore, when integrated into a BO-based algorithm, DiffusionNAG outperforms existing BO-based NAS approaches, particularly in the large MobileNetV3 search space on the ImageNet 1K dataset. Code is available at https://github.com/CownowAn/DiffusionNAG.

References (80)

Authors (5)

Sohyun An (5 papers)
Hayeon Lee (14 papers)
Jaehyeong Jo (14 papers)
Seanie Lee (28 papers)
Sung Ju Hwang (178 papers)

Citations (7)

View on Semantic Scholar

Summary

Overview of DiffusionNAG: Predictor-Guided Neural Architecture Generation with Diffusion Models

The paper "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" presents an innovative framework for Neural Architecture Generation (NAG) utilizing diffusion models, termed DiffusionNAG. The primary objective of DiffusionNAG is to address the limitations of existing Neural Architecture Search (NAS) methodologies, which often suffer from high computational costs due to repetitive sampling and full training of numerous task-irrelevant architectures. DiffusionNAG introduces a paradigm shift from traditional NAS to a more efficient NAG process guided by predictors.

Key Contributions

Conditional Neural Architecture Generation (NAG): The authors propose a conditional NAG framework that employs diffusion models for generating neural architectures as directed graphs. By incorporating parameterized predictors, the model flexibly generates task-optimal architectures with desired properties, significantly improving efficiency over traditional NAS methods.
Predictor-Guided Diffusion Model: The approach utilizes diffusion generative models, which gradually inject noise into data and learn to reverse this process, proven effective in various domains. The model integrates parameterized predictors to guide the generation of architectures that satisfy specific objectives such as accuracy or robustness.
Score Network for Valid Architecture Generation: The paper introduces a novel score network for neural architectures to ensure valid generation by capturing the computational flow and positional information in directed acyclic graphs.
Applications and Performance: DiffusionNAG demonstrates superior performance in two scenarios: Transferable NAS and Bayesian Optimization-based NAS. It achieves a significant speedup, by up to 35 times, compared to baseline methods, especially in the MobileNetV3 search space on the ImageNet 1K dataset.

Methodology

The diffusion process is employed to model the forward perturbation of architecture distribution towards a known prior distribution, which is then reversed to sample architectures. The framework employs a VE SDE forward process and a novel score network to capture the directed nature of neural architectures.

Implications

Efficiency in Search Space Exploration: DiffusionNAG's capability to generate architectures aligned closely with specified distributions leads to reduced computational overhead, making it highly efficient.
Versatility Across Tasks: The plug-and-play nature of the predictors allows for adapting the generative model to various NAS tasks without retraining, demonstrating versatility in diverse scenarios such as latency or robustness-constrained NAS.
Potential for Enhancements: The framework sets a foundation for future enhancement of diffusion-based models in NAS, suggesting further explorations into adaptive and context-aware architecture generation.

Future Directions

The paper opens multiple avenues for exploration, including:

Better Exploration of Large Search Spaces: With its efficiency, DiffusionNAG can be extended and refined for other expansive search spaces in varying domains.
Integration with More Complex Predictors: Future work might explore integration with more sophisticated predictors that consider additional task constraints.
Enhanced Score Network Architecture: Further refinements in the score network could improve the fidelity of the architecture generation process even more.

In conclusion, DiffusionNAG offers a robust and efficient alternative to traditional NAS, indicating a significant step forward in the automation of neural architecture design. The proposed framework effectively balances performance and efficiency, promising contributions to several practical applications in neural architecture development.

PDF Markdown

GitHub

GitHub - CownowAn/DiffusionNAG: Official PyTorch implementation of "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" (ICLR 2024) (37 stars)

Tweets

https://twitter.com/hayeonlee_ai/status/1748424096151663022

https://twitter.com/SohyunAn_/status/1748435987221614735

https://twitter.com/MLAI_KAIST/status/1750386893303541996