Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficacy of Neural Prediction-Based Zero-Shot NAS (2308.16775v3)

Published 31 Aug 2023 in cs.LG and cs.AI

Abstract: In prediction-based Neural Architecture Search (NAS), performance indicators derived from graph convolutional networks have shown remarkable success. These indicators, achieved by representing feed-forward structures as component graphs through one-hot encoding, face a limitation: their inability to evaluate architecture performance across varying search spaces. In contrast, handcrafted performance indicators (zero-shot NAS), which use the same architecture with random initialization, can generalize across multiple search spaces. Addressing this limitation, we propose a novel approach for zero-shot NAS using deep learning. Our method employs Fourier sum of sines encoding for convolutional kernels, enabling the construction of a computational feed-forward graph with a structure similar to the architecture under evaluation. These encodings are learnable and offer a comprehensive view of the architecture's topological information. An accompanying multi-layer perceptron (MLP) then ranks these architectures based on their encodings. Experimental results show that our approach surpasses previous methods using graph convolutional networks in terms of correlation on the NAS-Bench-201 dataset and exhibits a higher convergence rate. Moreover, our extracted feature representation trained on each NAS benchmark is transferable to other NAS benchmarks, showing promising generalizability across multiple search spaces. The code is available at: https://github.com/minh1409/DFT-NPZS-NAS

Definition Search Book Streamline Icon: https://streamlinehq.com
References (8)
  1. Mathematics into Type, American Mathematical Society. Online available:
  2. The LATEXCompanion, by F. Mittelbach and M. Goossens
  3. More Math into LaTeX, by G. Grätzer
  4. AMS-StyleGuide-online.pdf, published by the American Mathematical Society
  5. H. Sira-Ramirez. “On the sliding mode control of nonlinear systems,” Systems & Control Letters, vol. 19, pp. 303–312, 1992.
  6. A. Levant. “Exact differentiation of signals with unbounded higher derivatives,” in Proceedings of the 45th IEEE Conference on Decision and Control, San Diego, California, USA, pp. 5585–5590, 2006.
  7. M. Fliess, C. Join, and H. Sira-Ramirez. “Non-linear estimation is easy,” International Journal of Modelling, Identification and Control, vol. 4, no. 1, pp. 12–27, 2008.
  8. R. Ortega, A. Astolfi, G. Bastin, and H. Rodriguez. “Stabilization of food-chain systems using a port-controlled Hamiltonian description,” in Proceedings of the American Control Conference, Chicago, Illinois, USA, pp. 2245–2249, 2000.

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com