The Hidden Power of Pure 16-bit Floating-Point Neural Networks (2301.12809v2)

Published 30 Jan 2023 in cs.LG, cs.AI, and cs.PF

Abstract: Lowering the precision of neural networks from the prevalent 32-bit precision has long been considered harmful to performance, despite the gain in space and time. Many works propose various techniques to implement half-precision neural networks, but none study pure 16-bit settings. This paper investigates the unexpected performance gain of pure 16-bit neural networks over the 32-bit networks in classification tasks. We present extensive experimental results that favorably compare various 16-bit neural networks' performance to those of the 32-bit models. In addition, a theoretical analysis of the efficiency of 16-bit models is provided, which is coupled with empirical evidence to back it up. Finally, we discuss situations in which low-precision training is indeed detrimental.

Authors (3)

Juyoung Yun (15 papers)
Byungkon Kang (8 papers)
Zhoulai Fu (7 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/realmofresearch/status/1787694764676075594

The Hidden Power of Pure 16-bit Floating-Point Neural Networks (2301.12809v2)

Summary

Related Papers

Tweets