Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AVATAR -- Machine Learning Pipeline Evaluation Using Surrogate Model (2001.11158v2)

Published 30 Jan 2020 in cs.LG and stat.ML

Abstract: The evaluation of ML pipelines is essential during automatic ML pipeline composition and optimisation. The previous methods such as Bayesian-based and genetic-based optimisation, which are implemented in Auto-Weka, Auto-sklearn and TPOT, evaluate pipelines by executing them. Therefore, the pipeline composition and optimisation of these methods requires a tremendous amount of time that prevents them from exploring complex pipelines to find better predictive models. To further explore this research challenge, we have conducted experiments showing that many of the generated pipelines are invalid, and it is unnecessary to execute them to find out whether they are good pipelines. To address this issue, we propose a novel method to evaluate the validity of ML pipelines using a surrogate model (AVATAR). The AVATAR enables to accelerate automatic ML pipeline composition and optimisation by quickly ignoring invalid pipelines. Our experiments show that the AVATAR is more efficient in evaluating complex pipelines in comparison with the traditional evaluation approaches requiring their execution.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Tien-Dung Nguyen (7 papers)
  2. Tomasz Maszczyk (11 papers)
  3. Katarzyna Musial (36 papers)
  4. Bogdan Gabrys (42 papers)
  5. Marc-Andre Zöller (1 paper)
Citations (11)

Summary

We haven't generated a summary for this paper yet.