AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis (2309.01966v2)

Published 5 Sep 2023 in cs.LG

Abstract: This paper proposes an efficient optimizer called AdaPlus which integrates Nesterov momentum and precise stepsize adjustment on AdamW basis. AdaPlus combines the advantages of AdamW, Nadam, and AdaBelief and, in particular, does not introduce any extra hyper-parameters. We perform extensive experimental evaluations on three machine learning tasks to validate the effectiveness of AdaPlus. The experiment results validate that AdaPlus (i) among all the evaluated adaptive methods, performs most comparable with (even slightly better than) SGD with momentum on image classification tasks and (ii) outperforms other state-of-the-art optimizers on LLMing tasks and illustrates pretty high stability when training GANs. The experiment code of AdaPlus will be accessible at: https://github.com/guanleics/AdaPlus.

References (23)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - guanleics/AdaPlus: The official implementation of AdaPlus (3 stars)

AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis (2309.01966v2)

Summary

Related Papers

GitHub