Toward Cross-Layer Energy Optimizations in AI Systems (2404.06675v2)

Published 10 Apr 2024 in cs.LG, cs.AR, and cs.DC

Abstract: The "AI for Science, Energy, and Security" report from DOE outlines a significant focus on developing and optimizing artificial intelligence workflows for a foundational impact on a broad range of DOE missions. With the pervasive usage of AI and ML tools and techniques, their energy efficiency is likely to become the gating factor toward adoption. This is because generative AI (GenAI) models are massive energy hogs: for instance, training a 200-billion parameter LLM at Amazon is estimated to have taken 11.9 GWh, which is enough to power more than a thousand average U.S. households for a year. Inference consumes even more energy, because a model trained once serve millions. Given this scale, high energy efficiency is key to addressing the power delivery problem of constructing and operating new supercomputers and datacenters specialized for AI workloads. In that regard, we outline software- and architecture-level research challenges and opportunities, setting the stage for creating cross-layer energy optimizations in AI systems.

View on arXiv

References (28)

Authors (3)

Jae-Won Chung (8 papers)
Mosharaf Chowdhury (39 papers)
Nishil Talati (14 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/HPCPapers/status/1778302624728854892

Toward Cross-Layer Energy Optimizations in AI Systems (2404.06675v2)

Summary

Related Papers

Tweets