Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AI Tax: The Hidden Cost of AI Data Center Applications (2007.10571v1)

Published 21 Jul 2020 in cs.DC and cs.PF

Abstract: Artificial intelligence and machine learning are experiencing widespread adoption in industry and academia. This has been driven by rapid advances in the applications and accuracy of AI through increasingly complex algorithms and models; this, in turn, has spurred research into specialized hardware AI accelerators. Given the rapid pace of advances, it is easy to forget that they are often developed and evaluated in a vacuum without considering the full application environment. This paper emphasizes the need for a holistic, end-to-end analysis of AI workloads and reveals the "AI tax." We deploy and characterize Face Recognition in an edge data center. The application is an AI-centric edge video analytics application built using popular open source infrastructure and ML tools. Despite using state-of-the-art AI and ML algorithms, the application relies heavily on pre-and post-processing code. As AI-centric applications benefit from the acceleration promised by accelerators, we find they impose stresses on the hardware and software infrastructure: storage and network bandwidth become major bottlenecks with increasing AI acceleration. By specializing for AI applications, we show that a purpose-built edge data center can be designed for the stresses of accelerated AI at 15% lower TCO than one derived from homogeneous servers and infrastructure.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Daniel Richins (2 papers)
  2. Dharmisha Doshi (1 paper)
  3. Matthew Blackmore (1 paper)
  4. Aswathy Thulaseedharan Nair (1 paper)
  5. Neha Pathapati (1 paper)
  6. Ankit Patel (16 papers)
  7. Brainard Daguman (1 paper)
  8. Daniel Dobrijalowski (1 paper)
  9. Ramesh Illikkal (3 papers)
  10. Kevin Long (6 papers)
  11. David Zimmerman (1 paper)
  12. Vijay Janapa Reddi (78 papers)
Citations (4)