Hierarchical Representations for Efficient Architecture Search (1711.00436v2)

Published 1 Nov 2017 in cs.LG, cs.CV, cs.NE, and stat.ML

Abstract: We explore efficient neural architecture search methods and show that a simple yet powerful evolutionary algorithm can discover new architectures with excellent performance. Our approach combines a novel hierarchical genetic representation scheme that imitates the modularized design pattern commonly adopted by human experts, and an expressive search space that supports complex topologies. Our algorithm efficiently discovers architectures that outperform a large number of manually designed models for image classification, obtaining top-1 error of 3.6% on CIFAR-10 and 20.3% when transferred to ImageNet, which is competitive with the best existing neural architecture search approaches. We also present results using random search, achieving 0.3% less top-1 accuracy on CIFAR-10 and 0.1% less on ImageNet whilst reducing the search time from 36 hours down to 1 hour.

Authors (5)

Hanxiao Liu (35 papers)
Karen Simonyan (54 papers)
Oriol Vinyals (116 papers)
Chrisantha Fernando (16 papers)
Koray Kavukcuoglu (57 papers)

Citations (901)

View on Semantic Scholar

Summary

Hierarchical Representations for Efficient Architecture Search: A Formal Overview

The paper "Hierarchical Representations for Efficient Architecture Search" presents research on a novel framework for neural architecture search (NAS). Authored by Hanxiao Liu, Karen Simonyan, Oriol Vinyals, Chrisantha Fernando, and Koray Kavukcuoglu, the objective is to improve the efficiency and effectiveness of NAS methodologies using a hierarchical representation of neural network architectures.

Introduction

The motivation for this research is rooted in the increasing complexity and computational demands of NAS. Traditional approaches to architecture design—often reliant on human expertise—are progressively constrained by the expanding design space of neural topologies. The central challenge addressed by this paper is to optimize neural architectures automatically in a computationally efficient manner. This is achieved through a hierarchical genetic representation scheme and an expressive search space that supports complex topologies, which collectively facilitate the discovery of high-performance architectures.

Hierarchical Representation

The paper introduces a hierarchical representation mechanism that leverages modularity, reminiscent of manually designed networks like VGGNet, ResNet, and Inception. The network is structured in a hierarchy where lower-level motifs—basic computational modules—are recursively used to build more complex higher-level motifs. The methodology allows for flexible network topologies, forming robust and scalable architectures.

Hierarchical representations consist of multiple levels, with each level containing a number of motifs constructed from the primitives of the lower level. This multi-scale, modular approach guarantees that changes in motifs propagate effectively across the network, optimizing the architecture search process.

Evolutionary Architecture Search

An evolutionary algorithm is employed to efficiently explore the hierarchical search space. A genotype in this context is defined by hierarchical graph structures, representing different levels of motifs, which are evolved through mutation and selection operations. Mutation modifies motifs at different levels, introducing randomness that promotes diversity in the search space. The population is initially diversified through extensive random mutations to ensure broad coverage. Tournament selection is used to identify promising genotypes, which are then mutated and re-evaluated iteratively.

A significant aspect of this work is the asynchronous, distributed implementation of the evolutionary algorithm, which leverages parallel computing resources to expedite the search process. The paper details specific actions for mutation and the mechanism for maintaining and updating a population of genotypes.

Experimental Setup and Results

The efficacy of the presented hierarchical search methodology is validated through extensive experiments on CIFAR-10 and ImageNet datasets. The architecture search focuses on discovering optimal convolutional cells, rather than entire architectures, which are then evaluated within larger network models.

Key results include:

An error rate of 3.6% on CIFAR-10 and 20.3% top-1 error when transferred to ImageNet, which is in line with the performance of state-of-the-art NAS approaches.
Random search within the hierarchical representation space achieved 0.3% less top-1 accuracy on CIFAR-10 and 0.1% less on ImageNet compared to evolutionary search, while significantly reducing search time.

The hierarchical representation was observed to yield competitive architectures even when evaluated against flat representations, particularly under parameter-constrained settings reflective of real-world constraints. The search process itself demonstrated substantial gains in both efficiency and effectiveness compared to previously reported NAS methods.

Implications and Future Directions

The research in this paper underscores the utility of hierarchical representations in NAS, showcasing their ability to simplify the search process while maintaining high-performance outcomes. The modular approach not only reduces computational demands but also facilitates better generalization of discovered architectures across different tasks.

Potential future directions include:

Incorporating weight inheritance techniques to further expedite the evolutionary process.
Extending the hierarchical framework to other domains of deep learning beyond image classification.
Exploring the integration of more sophisticated search algorithms within the hierarchical representation space to potentially uncover even more optimal architectures.

Conclusion

This paper presents a methodologically robust approach for hierarchical NAS that balances computational efficiency with architectural performance. The hierarchical representation framework, coupled with simple yet effective search algorithms, offers a scalable solution to the growing complexity of neural network design, setting a foundation for future advancements in automated architecture discovery.

The promising results evidenced in this paper advocate for broader adoption and further exploration of hierarchical methods in the evolving landscape of NAS research.

PDF Markdown

Related Papers

Find Related Papers

YouTube

Show All Videos