Automated Search-Space Generation Neural Architecture Search

Published 25 May 2023 in cs.LG, cs.AI, and cs.CV | (2305.18030v3)

Abstract: To search an optimal sub-network within a general deep neural network (DNN), existing neural architecture search (NAS) methods typically rely on handcrafting a search space beforehand. Such requirements make it challenging to extend them onto general scenarios without significant human expertise and manual intervention. To overcome the limitations, we propose Automated Search-Space Generation Neural Architecture Search (ASGNAS), perhaps the first automated system to train general DNNs that cover all candidate connections and operations and produce high-performing sub-networks in the one shot manner. Technologically, ASGNAS delivers three noticeable contributions to minimize human efforts: (i) automated search space generation for general DNNs; (ii) a Hierarchical Half-Space Projected Gradient (H2SPG) that leverages the hierarchy and dependency within generated search space to ensure the network validity during optimization, and reliably produces a solution with both high performance and hierarchical group sparsity; and (iii) automated sub-network construction upon the H2SPG solution. Numerically, we demonstrate the effectiveness of ASGNAS on a variety of general DNNs, including RegNet, StackedUnets, SuperResNet, and DARTS, over benchmark datasets such as CIFAR10, Fashion-MNIST, ImageNet, STL-10 , and SVNH. The sub-networks computed by ASGNAS achieve competitive even superior performance compared to the starting full DNNs and other state-of-the-arts. The library will be released at https://github.com/tianyic/only_train_once.

Abstract PDF HTML Upgrade to Chat

Authors (4)

References (47)

Citations (1)

View on Semantic Scholar

Summary

The paper introduces HSPG, a framework automating search-space generation in NAS for efficient sub-network construction.
It employs a graph-based algorithm and H2SPG optimizer to automatically identify and remove redundant network structures.
Results show HSPG delivers compact models with competitive performance across diverse architectures and datasets.

Automated Search-Space Generation Neural Architecture Search

The paper "Automated Search-Space Generation Neural Architecture Search" presents HSPG, a novel framework that pioneers the automation of search-space generation and optimization in Neural Architecture Search (NAS). Traditional NAS frameworks necessitate extensive human expertise to define and explore different neural architectures, which limits their scalability and adaptability across diverse tasks. HSPG, however, reduces manual intervention and enables end-to-end automatic sub-network generation within a given deep neural network (DNN).

The core contributions of this paper are multifold:

Automated Search Space Generation: The authors develop a graph-based algorithm that automatically constructs a search space from any general DNN. This algorithm identifies potential redundant structures in the network without disrupting the functionality of the remaining architecture. By representing these as a segment graph, HSPG efficiently delineates the operations and connections within a neural network that are amenable to removal or modification.
Hierarchical Half-Space Projected Gradient (H2SPG) Algorithm: H2SPG is introduced as an innovative optimizer specifically designed for handling hierarchical structured sparsity problems in DNNs. It leverages hierarchical dependencies within the search space to ensure that resulting sub-networks remain valid and performant. This optimizer identifies and zeroes out redundant components while maintaining critical parts, thus achieving a balance between performance and model compactness.
Automated Sub-Network Construction: Building upon the solution obtained from H2SPG, HSPG automatically constructs a more compact sub-network. This process not only removes redundant structures but also reconfigures dependent modules to ensure seamless operation of the simplified network.

The numerical results presented in the paper illustrate the efficacy of HSPG across a variety of neural architectures, including RegNet, StackedUnets, SuperResNet, and DARTS, evaluated on baseline datasets such as CIFAR10, Fashion-MNIST, ImageNet, STL-10, and SVHN. The automated sub-networks generated by HSPG exhibit competitive, if not superior, performance relative to their larger parent networks and other state-of-the-art models. For instance, in experiments on the StackedUnets, HSPG successfully minimizes the network parameters while slightly improving the top-1 accuracy over the original architecture.

Implications and Future Directions

This work represents a significant step forward in the NAS domain, showcasing that the automation of search space generation and optimization can be achieved effectively. By minimizing the need for human intervention, HSPG opens up possibilities for deploying neural architecture search in applications where rapid adaptation to new or varied tasks is essential. Cybersecurity, autonomous driving, and real-time data processing are potential areas that could benefit from such automated NAS systems.

In the theoretical landscape, the introduction of a hierarchical approach within optimization provides a robust framework that could influence future algorithmic research in sparse optimization and neural network interpretability.

As for future developments, the scope for improving HSPG lies in further enhancing its computational efficiency and applicability to broader classes of neural networks inclusive of non-trainable operations. Moreover, as the current framework leverages a single-shot search, integrating multi-level optimization features, similar to existing approaches yet automated, could extend the framework's functionality. Addressing these areas will not only enhance the practical utility of the HSPG but may also contribute to more generalized advancements in neural architecture search technologies.

In conclusion, the paper presents a thoughtful and significant advancement in the field of NAS, providing tools and paradigms that could reshape current practices in search space exploration and automated architecture design.

Markdown Report Issue