Sustainability in HPC: Vision and Opportunities (2309.13473v1)
Abstract: Tackling climate change by reducing and eventually eliminating carbon emissions is a significant milestone on the path toward establishing an environmentally sustainable society. As we transition into the exascale era, marked by an increasing demand and scale of HPC resources, the HPC community must embrace the challenge of reducing carbon emissions from designing and operating modern HPC systems. In this position paper, we describe challenges and highlight different opportunities that can aid HPC sites in reducing the carbon footprint of modern HPC systems.
- [n.d.]. Aurora. https://www.anl.gov/aurora. Accessed: 2023-08-3.
- [n.d.]. Average vs Marginal Carbon intensity. https://adgefficiency.com/average-vs-marginal-carbon-emissions/. Accessed: [2023-08-03].
- [n.d.]. Carbon emissions for Coal. https://www.eia.gov/tools/faqs/faq.php?id=74&t=11. Accessed: [2023-08-03].
- [n.d.]. Energy and Climate. https://tinyurl.com/2p8khujf. Accessed: 2023-02-13.
- [n.d.]. Enviornment Sustainability Report. https://www.microsoft.com/en-us/corporate-responsibility/sustainability/report. Accessed: [2023-08-3].
- [n.d.]. ExaMUC. https://examuc-procurement.lrz.de/. Accessed: [2023-08-03].
- [n.d.]. GHG Protocol. https://ghgprotocol.org/. Accessed: [2023-08-03].
- [n.d.]. Hawk. https://www.hlrs.de/solutions/systems/hpe-apollo-hawk. Accessed: [2023-08-03].
- [n.d.]. The HPC PowerStack. https://hpcpowerstack.github.io/. Accessed: [2023-08-03].
- [n.d.]. Juwels Booster. https://apps.fz-juelich.de/jsc/hps/juwels/booster-overview.html. Accessed: [2023-08-03].
- [n.d.]a. Lifetime of different systems at LRZ. https://doku.lrz.de/decommissioned-supermuc-10746010.html. Accessed: [2023-08-03].
- [n.d.]b. LRZ supercomputing center. https://www.lrz.de/. Accessed: [2023-08-03].
- [n.d.]. Paris Agreement. https://tinyurl.com/47v3xe2r. Accessed: 2023-02-13.
- [n.d.]a. Performance development. https://www.top500.org/statistics/perfdevel/. Accessed: [2023-08-03].
- [n.d.]. REGALE: Open Architecture for Exascale Supercomputers. https://regale-project.eu/. Accessed: [2023-08-03].
- [n.d.]. RISC-V. https://riscv.org/. Accessed: [2023-08-03].
- [n.d.]a. SuperMUC-NG. https://doku.lrz.de/hardware-of-supermuc-ng-11482553.html. Accessed: [2023-08-03].
- [n.d.]b. SuperMUC-NG Phase 2. https://www.lrz.de/presse/ereignisse/2021-05-04-SuperMUC-NG-Phase-2_ENG/. Accessed: [2023-08-03].
- [n.d.]. Sustainable High Performance Computing. https://enterpriseviewpoint.com/sustainable-high-performance-computing/. Accessed: 2023-08-3.
- [n.d.]b. TOP500. https://top500.org/lists/top500/list/2023/06/. Accessed: [2023-08-03].
- [n.d.]. World’s Climate goal. http://tiny.cc/xsp4vz. Accessed: 2023-02-13.
- Anders S. G. Andrae and Tomas Edler. 2015. On Global Electricity Usage of Communication Technology: Trends to 2030. Challenges 6 (2015), 117–157. https://api.semanticscholar.org/CorpusID:110716199
- On the Convergence of Malleability and the HPC PowerStack: Exploiting Dynamism in Over-Provisioned and Power-Constrained HPC Systems. In International Conference on High Performance Computing. Springer, 206–217. https://doi.org/10.1007/978-3-031-23220-6_14
- Countdown: a run-time library for performance-neutral energy saving in MPI applications. IEEE Trans. Comput. 70, 5 (2020), 682–695.
- Mohak Chadha. 2020. Adaptive Resource-Aware Batch Scheduling for HPC Systems. (2020). https://mediatum.ub.tum.de/doc/1539159/1539159.pdf
- Mohak Chadha and Michael Gerndt. 2019. Modelling DVFS and UFS for Region-Based Energy Aware Tuning of HPC Applications. In 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS). 805–814. https://doi.org/10.1109/IPDPS.2019.00089
- Extending slurm for dynamic resource-aware adaptive batch scheduling. In 2020 IEEE 27th International Conference on High Performance Computing, Data, and Analytics (HiPC). IEEE, 223–232. https://doi.org/10.1109/HiPC50609.2020.00036
- Infrastructure and api extensions for elastic execution of mpi applications. In Proceedings of the 23rd European MPI Users’ Group Meeting. 82–97. https://doi.org/10.1145/2966884.2966917
- Kali Frost et al. 2020. The use of decision support tools to accelerate the development of circular economic business models for hard disk drives and rare-earth magnets. MRS Energy & Sustainability 7 (2020).
- The international race towards Exascale in Europe. CCF Transactions on High Performance Computing 1, 1 (2019), 3–13.
- Ponte Vecchio: A multi-tile 3D stacked processor for exascale computing. In 2022 IEEE International Solid-State Circuits Conference (ISSCC), Vol. 65. IEEE, 42–44.
- ACT: Designing sustainable computer systems with an architectural carbon modeling tool. In ISCA. ACM, 784–799. https://doi.org/10.1145/3470496.3527408
- Chasing carbon: The elusive environmental footprint of computing. In HPCA. IEEE, 854–867. https://doi.org/10.1109/HPCA51647.2021.00076
- Towards dynamic resource management with MPI sessions and PMIx. In Proceedings of the 29th European MPI Users’ Group Meeting. 57–67. https://doi.org/10.1145/3555819.3555856
- David Keys. 2023. Sustainable High Performance Computing. https://enterpriseviewpoint.com/sustainable-high-performance-computing/. Accessed: 2023-08-10.
- Evaluation of Power Management Control on the Supercomputer Fugaku. In 2020 IEEE International Conference on Cluster Computing (CLUSTER). 484–493. https://doi.org/10.1109/CLUSTER49012.2020.00069
- Sustainable HPC: Modeling, Characterization, and Implications of Carbon Footprint in Modern HPC Systems. https://doi.org/10.48550/arXiv.2306.13177 arXiv:2306.13177 [cs.DC]
- Huaicheng Li et al. 2023a. Pond: CXL-Based Memory Pooling Systems for Cloud Platforms. In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2. ACM, 574–587. https://doi.org/10.1145/3575693.3578835
- Myths and Misconceptions Around Reducing Carbon Embedded in Cloud Platforms. In HotCarbon (Boston, MA, USA). ACM, Article 7, 7 pages. https://doi.org/10.1145/3604930.3605717
- Hyrax: Fail-in-Place Server Operation in Cloud Platforms. In 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI 23). USENIX Association, 287–304. https://www.usenix.org/conference/osdi23/presentation/lyu
- Recalibrating global data center energy-use estimates. Science 367, 6481 (2020), 984–986.
- Sophie Mclean. 2023. The Environmental Impact of ChatGPT: A Call for Sustainable Practices In AI Development. https://earth.org/environmental-impact-chatgpt/. Accessed: 2023-08-03.
- From facility to application sensor data: modular, continuous and holistic monitoring with DCDB. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 1–27. https://doi.org/10.1145/3295500.3356191
- On the Inevitability of Integrated HPC Systems and How They Will Change HPC System Operations. In Proceedings of the 11th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies. Article 2, 6 pages. https://doi.org/10.1145/3468044.3468046
- Climate Change 2022: Mitigation of Climate Change. Cambridge University Press, Cambridge, UK and New York, NY, USA. (2022). https://doi.org/10.1017/9781009157926
- Jaylen Wang et al. 2023. Peeling Back the Carbon Curtain: Carbon Optimization Challenges in Cloud Computing. In Proceedings of the 2nd Workshop on Sustainable Computer Systems. ACM, Article 8, 7 pages. https://doi.org/10.1145/3604930.3605718
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.