Towards Leveraging AutoML for Sustainable Deep Learning: A Multi-Objective HPO Approach on Deep Shift Neural Networks
Abstract: Deep Learning (DL) has advanced various fields by extracting complex patterns from large datasets. However, the computational demands of DL models pose environmental and resource challenges. Deep shift neural networks (DSNNs) offer a solution by leveraging shift operations to reduce computational complexity at inference. Following the insights from standard DNNs, we are interested in leveraging the full potential of DSNNs by means of AutoML techniques. We study the impact of hyperparameter optimization (HPO) to maximize DSNN performance while minimizing resource consumption. Since this combines multi-objective (MO) optimization with accuracy and energy consumption as potentially complementary objectives, we propose to combine state-of-the-art multi-fidelity (MF) HPO with multi-objective optimization. Experimental results demonstrate the effectiveness of our approach, resulting in models with over 80\% in accuracy and low computational cost. Overall, our method accelerates efficient model development while enabling sustainable AI applications.
- Multi-Fidelity Multi-Objective bayesian optimization: An output space entropy search approach. AAAI, 34(06):10035–10043, 2020.
- Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges. WIREs Data. Mining. Knowl. Discov., 13(2), 2023.
- K. Deb. Multi-objective Optimization, pp. 403–449. Springer US, 2014.
- DeepShift: Towards multiplication-less neural networks. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2021, virtual, June 19-25, 2021, pp. 2359–2368, 2021.
- Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016, pp. 770–778, 2016.
- MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861, 2017.
- Sequential model-based optimization for general algorithm configuration. In C. Coello Coello (ed.), Learning and Intelligent Optimization - 5th International Conference, LION 5, Rome, Italy, January 17-21, 2011. Selected Papers, volume 6683, pp. 507–523, 2011.
- Automated Machine Learning - Methods, Systems, Challenges. Springer Publishing Company, Incorporated, 2019.
- K. Jamieson and A. Talwalkar. Non-stochastic best arm identification and Hyperparameter Optimization. In A. Gretton and C. Robert (eds.), Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics (AISTATS’16), volume 51. Proceedings of Machine Learning Research, 2016.
- Efficient global optimization of expensive black-box functions. J. Glob. Optim., 13(4):455–492, 1998.
- Multi-fidelity gaussian process bandit optimisation. J. Artif. Intell. Res., 66:151–196, 2019.
- J. Knowles. ParEGO: a hybrid algorithm with on-line landscape approximation for expensive multiobjective optimization problems. IEEE Trans. Evol. Comput., 10(1):50–66, 2006.
- Learning multiple layers of features from tiny images. University of Toronto, 2009.
- Quantifying the carbon emissions of machine learning. arXiv:1910.09700, 2019.
- Learning IoT in edge: Deep learning for the internet of things with edge computing. IEEE Netw., 32(1):96–101, 2018.
- Hyperband: A novel bandit-based approach to hyperparameter optimization. J. Mach. Learn. Res., 18:185:1–185:52, 2017.
- SMAC3: A versatile bayesian optimization package for hyperparameter optimization. J. Mach. Learn. Res., 23:54:1–54:9, 2022.
- Energy usage reports: Environmental awareness as part of algorithmic accountability. arXiv:1911.08354, 2019.
- C. Edward Rasmussen and C. Williams. Gaussian Processes for Machine Learning. The MIT Press, 2006.
- Green AI. Commun. ACM, 63(12):54–63, 2020.
- Efficient processing of deep neural networks: A tutorial and survey. Proc. IEEE, 105(12):2295–2329, 2017.
- Towards green automated machine learning: Status quo and future directions. J. Artif. Intell. Res., 77:427–457, 2023.
- T. Wada and H. Hino. Bayesian optimization for multi-objective optimization and multi-point search. arXiv:1905.02370, 2019.
- Edge intelligence: Paving the last mile of artificial intelligence with edge computing. Proc. IEEE, 107(8):1738–1762, 2019.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.