Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Survey on Large Scale Neural Network Training (2202.10435v1)

Published 21 Feb 2022 in cs.LG and cs.AI

Abstract: Modern Deep Neural Networks (DNNs) require significant memory to store weight, activations, and other intermediate tensors during training. Hence, many models do not fit one GPU device or can be trained using only a small per-GPU batch size. This survey provides a systematic overview of the approaches that enable more efficient DNNs training. We analyze techniques that save memory and make good use of computation and communication resources on architectures with a single or several GPUs. We summarize the main categories of strategies and compare strategies within and across categories. Along with approaches proposed in the literature, we discuss available implementations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Julia Gusak (13 papers)
  2. Daria Cherniuk (6 papers)
  3. Alena Shilova (6 papers)
  4. Alexander Katrutsa (2 papers)
  5. Daniel Bershatsky (7 papers)
  6. Xunyi Zhao (2 papers)
  7. Lionel Eyraud-Dubois (9 papers)
  8. Oleg Shlyazhko (1 paper)
  9. Denis Dimitrov (27 papers)
  10. Ivan Oseledets (187 papers)
  11. Olivier Beaumont (10 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.