Papers
Topics
Authors
Recent
Search
2000 character limit reached

Clover: Toward Sustainable AI with Carbon-Aware Machine Learning Inference Service

Published 19 Apr 2023 in cs.DC | (2304.09781v2)

Abstract: This paper presents a solution to the challenge of mitigating carbon emissions from hosting large-scale ML inference services. ML inference is critical to modern technology products, but it is also a significant contributor to carbon footprint. We introduce Clover, a carbon-friendly ML inference service runtime system that balances performance, accuracy, and carbon emissions through mixed-quality models and GPU resource partitioning. Our experimental results demonstrate that Clover is effective in substantially reducing carbon emissions while maintaining high accuracy and meeting service level agreement (SLA) targets.

Citations (20)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.