AutoML: A Survey of the State-of-the-Art (1908.00709v6)

Published 2 Aug 2019 in cs.LG, cs.CV, and stat.ML

Abstract: Deep learning (DL) techniques have penetrated all aspects of our lives and brought us great convenience. However, building a high-quality DL system for a specific task highly relies on human expertise, hindering the applications of DL to more areas. Automated machine learning (AutoML) becomes a promising solution to build a DL system without human assistance, and a growing number of researchers focus on AutoML. In this paper, we provide a comprehensive and up-to-date review of the state-of-the-art (SOTA) in AutoML. First, we introduce AutoML methods according to the pipeline, covering data preparation, feature engineering, hyperparameter optimization, and neural architecture search (NAS). We focus more on NAS, as it is currently very hot sub-topic of AutoML. We summarize the performance of the representative NAS algorithms on the CIFAR-10 and ImageNet datasets and further discuss several worthy studying directions of NAS methods: one/two-stage NAS, one-shot NAS, and joint hyperparameter and architecture optimization. Finally, we discuss some open problems of the existing AutoML methods for future research.

PDF Abstract

Overview of AutoML: A Survey of the State-of-the-Art

The paper "AutoML: A Survey of the State-of-the-Art," authored by Xin He, Kaiyong Zhao, and Xiaowen Chu, provides a comprehensive review of the current advancements in automated machine learning (AutoML). Given the progress and complexity of deep learning (DL) techniques in tasks ranging from image recognition to LLMing, the authors highlight AutoML's potential to democratize machine learning by reducing reliance on human expertise. This essay will summarize the paper's content, analyze its implications and results, and propose future developments in AutoML and AI.

Introduction

Deep learning has demonstrated significant success in numerous AI tasks; however, the development and optimization of DL systems are arduous, requiring substantial expertise and computational resources. AutoML emerges as a promising alternative by automating the entire ML pipeline, including data preparation, feature engineering, hyperparameter optimization (HPO), and neural architecture search (NAS). The paper aims to provide an updated survey of AutoML methodologies and their corresponding performances on benchmark datasets like CIFAR-10 and ImageNet.

Components of AutoML

Data Preparation

Data preparation is the initial and critical phase in the ML pipeline. It encompasses data collection, data cleaning, and data augmentation:

Data Collection: The process involves gathering datasets from open sources or via data synthesis techniques like GANs.
Data Cleaning: This involves the removal of noise and inconsistencies in the data to ensure robust model performance.
Data Augmentation: Techniques like affine, elastic, and neural-based transformations are used to enhance data robustness and model generalizability.

Feature Engineering

Feature engineering focuses on optimizing the features extracted or constructed from raw data to improve model performance. It involves:

Feature Selection: Identifying and selecting relevant features using methods like filter, wrapper, and embedded methods.
Feature Construction: Building new features from existing ones to enhance model proficiency.
Feature Extraction: Reducing feature dimensionality using techniques like PCA or autoencoders.

Model Generation

Model generation is divided into two components: search space and optimization methods:

Search Space: This defines the scope within which the AI can create models, including entire-structured, cell-based, hierarchical, and morphism-based search spaces. A significant focus is given to NAS methodologies within these spaces.
Optimization Methods: Techniques include evolutionary algorithms (EA), reinforcement learning (RL), gradient descent (GD), surrogate model-based optimization (SMBO), and hybrid optimization methods. DARTS, ENAS, and ProxylessNAS are notable mentions for their contributions to search efficiency and model performance.

Model Evaluation

Model evaluation is pivotal for validating the performance of generated models. Low fidelity methods involve using proxies to estimate results faster, while weight sharing and surrogate models drastically reduce the need for extensive computations. Techniques like early stopping are used to hasten the evaluation by terminating non-promising candidates prematurely.

Numerical Results and Performance Evaluation

The paper outlines the performance of various NAS methodologies on CIFAR-10 and ImageNet benchmarks. For instance, AmoebaNet and NASNet-A achieve superior performance but at an extremely high computational cost. In contrast, more efficient algorithms like ENAS and DARTS significantly reduce GPU days while maintaining competitive accuracy.

Implications and Future Developments

The research outcomes have both theoretical and practical implications:

Scalability and Efficiency: The transition from heuristic-based models to automated methods has vastly improved efficiency in DL system design.
Flexibility in Search Space: The innovation of flexible architectures like AutoML-Zero can minimize human biases, suggesting avenues for groundbreaking designs.
Joint Optimization: There's a promising scope in integrating HPO and NAS, thus fine-tuning models holistically.
Robustness and Reproducibility: Addressing real-world data imperfections and ensuring reproducibility are paramount for practical AutoML application.
Meta-Learning and Continual Learning: Future AutoML systems must be capable of lifelong learning and adapting to ever-evolving tasks without forgetting past knowledge.

In conclusion, the survey points toward a rapidly advancing field with the profound potential to reshape AI and DL practices. A complete, automated pipeline as envisioned by the authors will make AI more accessible, efficient, and adaptable across various domains.

Understanding and insights into these components, evaluations, and future directions will equip researchers with the knowledge necessary to push the boundaries of AutoML and AI further.

PDF Markdown Bookmark Chat (Pro)

Authors (3)

Xin He (135 papers)
Kaiyong Zhao (16 papers)
Xiaowen Chu (108 papers)

Citations (1,286)

View on Semantic Scholar

Related Papers

Find Related Papers

YouTube

Show All Videos