Fast Benchmarking of Asynchronous Multi-Fidelity Optimization on Zero-Cost Benchmarks
Abstract: While deep learning has celebrated many successes, its results often hinge on the meticulous selection of hyperparameters (HPs). However, the time-consuming nature of deep learning training makes HP optimization (HPO) a costly endeavor, slowing down the development of efficient HPO tools. While zero-cost benchmarks, which provide performance and runtime without actual training, offer a solution for non-parallel setups, they fall short in parallel setups as each worker must communicate its queried runtime to return its evaluation in the exact order. This work addresses this challenge by introducing a user-friendly Python package that facilitates efficient parallel HPO with zero-cost benchmarks. Our approach calculates the exact return order based on the information stored in file system, eliminating the need for long waiting times and enabling much faster HPO evaluations. We first verify the correctness of our approach through extensive testing and the experiments with 6 popular HPO libraries show its applicability to diverse libraries and its ability to achieve over 1000x speedup compared to a traditional approach. Our package can be installed via pip install mfhpo-simulator.
- Optuna: A next-generation hyperparameter optimization framework. In International Conference on Knowledge Discovery & Data Mining.
- HPO-B: A large-scale reproducible benchmark for black-box HPO based on OpenML. arXiv:2106.06257.
- DEHB: Evolutionary HyperBand for scalable, robust and efficient hyperparameter optimization. arXiv:2105.09821.
- JAHS-Bench-201: A foundation for research on joint architecture and hyperparameter search. In Advances in Neural Information Processing Systems Datasets and Benchmarks Track.
- Algorithms for hyper-parameter optimization. Advances in Neural Information Processing Systems.
- HEBO: Pushing the limits of sample-efficient hyper-parameter optimisation. Journal of Artificial Intelligence Research, 74.
- NAS-Bench-201: Extending the scope of reproducible neural architecture search. arXiv:2001.00326.
- Efficient benchmarking of hyperparameter optimizers via surrogates. In AAAI Conference on Artificial Intelligence.
- HPOBench: A collection of reproducible multi-fidelity benchmark problems for HPO. arXiv:2109.06716.
- BOHB: Robust and efficient hyperparameter optimization at scale. In International Conference on Machine Learning.
- Non-stochastic best arm identification and hyperparameter optimization. In International Conference on Artificial Intelligence and Statistics.
- Multi-fidelity Bayesian optimisation with continuous approximations. In International Conference on Machine Learning.
- Tuning hyperparameters without grad students: Scalable and robust Bayesian optimisation with Dragonfly. Journal of Machine Learning Research, 21.
- Tabular benchmarks for joint architecture and hyperparameter optimization. arXiv:1905.04970.
- HyperBand: A novel bandit-based approach to hyperparameter optimization. Journal of Machine Learning Research, 18.
- A system for massively parallel hyperparameter tuning. Machine Learning and Systems, 2.
- Hyper-Tune: towards efficient hyper-parameter tuning at scale. arXiv:2201.06834.
- Tune: A research platform for distributed model selection and training. arXiv:1807.05118.
- SMAC3: A versatile Bayesian optimization package for hyperparameter optimization. Journal of Machine Learning Research, 23.
- NAS-Bench-Suite: NAS evaluation is (now) surprisingly easy. arXiv:2201.13396.
- TrivialAugment: Tuning-free yet state-of-the-art data augmentation. In International Conference on Computer Vision.
- Multiobjective tree-structured Parzen estimator. Journal of Artificial Intelligence Research, 73.
- Multiobjective tree-structured Parzen estimator for computationally expensive optimization problems. In Genetic and Evolutionary Computation Conference.
- YAHPO Gym – an efficient multi-objective multi-fidelity benchmark for hyperparameter optimization. In International Conference on Automated Machine Learning.
- Syne Tune: A library for large scale hyperparameter tuning and reproducible research. In International Conference on Automated Machine Learning.
- On the importance of architectures and hyperparameters for fairness in face recognition. arXiv:2210.09943.
- On the importance of hyperparameters and data augmentation for self-supervised learning. arXiv:2207.07875.
- Watanabe, S. (2023a). Python wrapper for simulating multi-fidelity optimization on HPO benchmarks without any wait. arXiv:2305.17595.
- Watanabe, S. (2023b). Tree-structured Parzen estimator: Understanding its algorithm components and their roles for better empirical performance. arXiv:2304.11127.
- c-TPE: Generalizing tree-structured Parzen estimator with inequality constraints for continuous and categorical hyperparameter optimization. arXiv:2211.14411.
- c-TPE: tree-structured Parzen estimator with inequality constraints for expensive hyperparameter optimization. In International Joint Conference on Artificial Intelligence.
- On the importance of hyperparameter optimization for model-based reinforcement learning. In International Conference on Artificial Intelligence and Statistics.
- Auto-PyTorch: Multi-fidelity metalearning for efficient and robust AutoDL. Transactions on Pattern Analysis and Machine Intelligence.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.