Papers
Topics
Authors
Recent
Search
2000 character limit reached

Towards Leveraging AutoML for Sustainable Deep Learning: A Multi-Objective HPO Approach on Deep Shift Neural Networks

Published 2 Apr 2024 in cs.LG and cs.AI | (2404.01965v2)

Abstract: Deep Learning (DL) has advanced various fields by extracting complex patterns from large datasets. However, the computational demands of DL models pose environmental and resource challenges. Deep shift neural networks (DSNNs) offer a solution by leveraging shift operations to reduce computational complexity at inference. Following the insights from standard DNNs, we are interested in leveraging the full potential of DSNNs by means of AutoML techniques. We study the impact of hyperparameter optimization (HPO) to maximize DSNN performance while minimizing resource consumption. Since this combines multi-objective (MO) optimization with accuracy and energy consumption as potentially complementary objectives, we propose to combine state-of-the-art multi-fidelity (MF) HPO with multi-objective optimization. Experimental results demonstrate the effectiveness of our approach, resulting in models with over 80\% in accuracy and low computational cost. Overall, our method accelerates efficient model development while enabling sustainable AI applications.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (24)
  1. Multi-Fidelity Multi-Objective bayesian optimization: An output space entropy search approach. AAAI, 34(06):10035–10043, 2020.
  2. Hyperparameter optimization: Foundations, algorithms, best practices, and open challenges. WIREs Data. Mining. Knowl. Discov., 13(2), 2023.
  3. K. Deb. Multi-objective Optimization, pp.  403–449. Springer US, 2014.
  4. DeepShift: Towards multiplication-less neural networks. In IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2021, virtual, June 19-25, 2021, pp.  2359–2368, 2021.
  5. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016, pp.  770–778, 2016.
  6. MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861, 2017.
  7. Sequential model-based optimization for general algorithm configuration. In C. Coello Coello (ed.), Learning and Intelligent Optimization - 5th International Conference, LION 5, Rome, Italy, January 17-21, 2011. Selected Papers, volume 6683, pp.  507–523, 2011.
  8. Automated Machine Learning - Methods, Systems, Challenges. Springer Publishing Company, Incorporated, 2019.
  9. K. Jamieson and A. Talwalkar. Non-stochastic best arm identification and Hyperparameter Optimization. In A. Gretton and C. Robert (eds.), Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics (AISTATS’16), volume 51. Proceedings of Machine Learning Research, 2016.
  10. Efficient global optimization of expensive black-box functions. J. Glob. Optim., 13(4):455–492, 1998.
  11. Multi-fidelity gaussian process bandit optimisation. J. Artif. Intell. Res., 66:151–196, 2019.
  12. J. Knowles. ParEGO: a hybrid algorithm with on-line landscape approximation for expensive multiobjective optimization problems. IEEE Trans. Evol. Comput., 10(1):50–66, 2006.
  13. Learning multiple layers of features from tiny images. University of Toronto, 2009.
  14. Quantifying the carbon emissions of machine learning. arXiv:1910.09700, 2019.
  15. Learning IoT in edge: Deep learning for the internet of things with edge computing. IEEE Netw., 32(1):96–101, 2018.
  16. Hyperband: A novel bandit-based approach to hyperparameter optimization. J. Mach. Learn. Res., 18:185:1–185:52, 2017.
  17. SMAC3: A versatile bayesian optimization package for hyperparameter optimization. J. Mach. Learn. Res., 23:54:1–54:9, 2022.
  18. Energy usage reports: Environmental awareness as part of algorithmic accountability. arXiv:1911.08354, 2019.
  19. C. Edward Rasmussen and C. Williams. Gaussian Processes for Machine Learning. The MIT Press, 2006.
  20. Green AI. Commun. ACM, 63(12):54–63, 2020.
  21. Efficient processing of deep neural networks: A tutorial and survey. Proc. IEEE, 105(12):2295–2329, 2017.
  22. Towards green automated machine learning: Status quo and future directions. J. Artif. Intell. Res., 77:427–457, 2023.
  23. T. Wada and H. Hino. Bayesian optimization for multi-objective optimization and multi-point search. arXiv:1905.02370, 2019.
  24. Edge intelligence: Paving the last mile of artificial intelligence with edge computing. Proc. IEEE, 107(8):1738–1762, 2019.
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 0 likes about this paper.