PFGE: Parsimonious Fast Geometric Ensembling of DNNs (2202.06658v8)

Published 14 Feb 2022 in cs.LG and cs.AI

Abstract: Ensemble methods are commonly used to enhance the generalization performance of machine learning models. However, they present a challenge in deep learning systems due to the high computational overhead required to train an ensemble of deep neural networks (DNNs). Recent advancements such as fast geometric ensembling (FGE) and snapshot ensembles have addressed this issue by training model ensembles in the same time as a single model. Nonetheless, these techniques still require additional memory for test-time inference compared to single-model-based methods. In this paper, we propose a new method called parsimonious FGE (PFGE), which employs a lightweight ensemble of higher-performing DNNs generated through successive stochastic weight averaging procedures. Our experimental results on CIFAR-{10,100} and ImageNet datasets across various modern DNN architectures demonstrate that PFGE achieves 5x memory efficiency compared to previous methods, without compromising on generalization performance. For those interested, our code is available at https://github.com/ZJLAB-AMMI/PFGE.

Authors (3)

Hao Guo (172 papers)
Jiyong Jin (3 papers)
Bin Liu (441 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - ZJLAB-AMMI/PFGE: Python codes to implement the PFGE algorithm

PFGE: Parsimonious Fast Geometric Ensembling of DNNs (2202.06658v8)

Summary

Related Papers

GitHub