CXL and the Return of Scale-Up Database Engines (2401.01150v2)
Abstract: The trend toward specialized processing devices such as TPUs, DPUs, GPUs, and FPGAs has exposed the weaknesses of PCIe in interconnecting these devices and their hosts. Several attempts have been proposed to improve, augment, or downright replace PCIe, and more recently, these efforts have converged into a standard called Compute Express Link (CXL). CXL is already on version 2.0 in terms of commercial availability, but its potential to radically change the conventional server architecture has only just started to surface. For example, CXL can increase the bandwidth and quantity of memory available to any single machine beyond what that machine can originally provide, most importantly, in a manner that is fully transparent to software applications. We argue, however, that CXL can have a broader impact beyond memory expansion and deeply affect the architecture of data-intensive systems. In a nutshell, while the cloud favored scale-out approaches that grew in capacity by adding full servers to a rack, CXL brings back scale-up architectures that can grow by fine-tuning individual resources, all while transforming the rack into a large shared-memory machine. In this paper, we describe why such architectural transformations are now possible, how they benefit emerging heterogeneous hardware platforms for data-intensive systems, and the associated research challenges.
- Josep Aguilar-Saborit et al. 2020. POLARIS: The Distributed SQL Engine in Azure Synapse. Proc. VLDB Endow. 13, 12 (2020), 3204–3216. https://doi.org/10.14778/3415478.3415545
- Memory disaggregation: why now and what are the challenges. ACM SIGOPS Oper. Syst. Rev. 57, 1 (2023). https://doi.org/10.1145/3606557.3606563
- AMD. [n.d.]. Infinity Architecture: A New Era in Accelerated System Connectivity. https://www.amd.com/en/technologies/infinity-architecture.
- Jeff Barr. 2021. AQUA (Advanced Query Accelerator) – A Speed Boost for Your Amazon Redshift Queries. https://aws.amazon.com/blogs/aws/new-aqua-advanced-query-accelerator-for-amazon-redshift/
- Strong consistency is not hard to get: Two-Phase Locking and Two-Phase Commit on Thousands of Cores. Proc. VLDB Endow. 12, 13 (2019). https://doi.org/10.14778/3358701.3358702
- The End of Slow Networks: It’s Time for a Redesign. Proc. VLDB Endow. 9, 7 (2016). https://doi.org/10.14778/2904483.2904485
- Hardware Acceleration of Compression and Encryption in SAP HANA. Proc. of the VLDB Endowment 15, 12 (2022). https://doi.org/10.14778/3554821.3554822
- Hong-Tai Chou and David J. DeWitt. 1985. An Evaluation of Buffer Management Strategies for Relational Database Systems. In VLDB’85, Proceedings of 11th International Conference on Very Large Data Bases, August 21-23, 1985, Stockholm, Sweden. https://doi.org/10.1007/BF01840450
- CCIX Consortium. [n.d.]. An Introduction to CCIX. https://www.ccixconsortium.com/wp-content/uploads/2019/11/CCIX-White-Paper-Rev111219.pdf.
- CXL. [n.d.]. Consortium Member List. https://www.computeexpresslink.org/members.
- Bill Dally. 2011. Power, programmability, and granularity: The challenges of exascale computing. In 2011 IEEE International Test Conference. IEEE Computer Society, 12–12. https://doi.org/10.1109/IPDPS.2011.420
- Domain-specific hardware accelerators. Commun. ACM 63, 7 (2020), 48–57. https://doi.org/10.1145/3361682
- Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture. Proceedings of the VLDB Endowment 12, 11 (2019). https://doi.org/10.14778/3342263.3342634
- GenZ. [n.d.]. GenZ Archive. https://www.computeexpresslink.org/projects-3.
- Direct Access, High-Performance Memory Disaggregation with DirectCXL (USENIX ATC’22’). https://www.usenix.org/conference/atc22/presentation/gouk
- Farview: Disaggregated Memory with Operator Off-loading for Database Engines. In 12th Conference on Innovative Data Systems Research, CIDR 2022, Chaminade, CA, USA, January 9-12, 2022. https://www.cidrdb.org/cidr2022/papers/p11-korolija.pdf
- Database Kernels: Seamless Integration of Database Systems and Fast Storage via CXL. In 14th Conference on Innovative Data Systems Research, CIDR 2024, Chaminade, CA, USA, January 9-12, 2022. http://exascale.info/assets/pdf/lee2024cidr.pdf
- Pond: CXL-Based Memory Pooling Systems for Cloud Platforms (ASPLOS 2023). https://doi.org/10.1145/3575693.3578835
- Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects (SIGMOD’20). https://doi.org/10.1145/3318464.3389705
- Optimizing Database Architecture for the New Bottleneck: Memory Access. The VLDB Journal 9, 3 (dec 2000). https://doi.org/10.1007/s007780000031
- TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory (ASPLOS 2023). https://doi.org/10.1145/3582016.3582063
- Fabio Maschi and Gustavo Alonso. 2023. The Difficult Balance Between Modern Hardware and Conventional CPUs (DaMoN’23). https://doi.org/10.1145/3592980.3595314
- Micron. [n.d.]. Flexible memory capacity expansion for data intensive workloads. https://www.micron.com/solutions/server/cxl.
- NVidia. [n.d.]a. NVidia ConnectX-7 400G Ethernet. https://www.nvidia.com/content/dam/en-zz/Solutions/networking/ethernet-adapters/connectx-7-datasheet-Final.pdf.
- NVidia. [n.d.]b. NVLink and NVSwitch: The building blocks of advanced multi-GPU communication—within and between servers. https://www.nvidia.com/en-us/data-center/nvlink/.
- NVIDIAG. [n.d.]. NVIDIA Opens NVLink for Custom Silicon Integration. https://nvidianews.nvidia.com/news/nvidia-opens-nvlink-for-custom-silicon-integration.
- OpenCAPI. [n.d.]. OpenCAPI Archive. https://www.computeexpresslink.org/occ-archive.
- Oracle. [n.d.]. Why Oracle Exadata platforms are the best for Oracle Database. https://www.oracle.com/engineered-systems/exadata/.
- Oracle. 2015. SPARC S7 Processor. https://www.oracle.com/a/ocom/docs/servers/sparc/sparc-s7-processor-ds-3042417.pdf.
- Samsung. [n.d.]. Samsung Electronics Introduces Industry’s First 512GB CXL Memory Module. https://news.samsung.com/global/samsung-electronics-introduces-industrys-first-512gb-cxl-memory-module.
- Debendra Das Sharma. [n.d.]. Compute Express Link. https://docs.wixstatic.com/ugd/0c1418_d9878707bbb7427786b70c3c91d5fbd1.pdf.
- PCIe SIG. [n.d.]. Announcing the PCIe 7.0 Specification. https://pcisig.com/blog/announcing-pcie%C2%AE-70-specification-doubling-data-rate-128-gts-next-generation-computing.
- Micro-Architectural Analysis of In-Memory OLTP (SIGMOD ’16). https://doi.org/10.1145/2882903.2882916
- A Primer on Memory Consistency and Cache Coherence. Morgan & Claypool Publishers. https://doi.org/10.1007/978-3-031-01764-3
- Demystifying CXL Memory with Genuine CXL-Ready Systems and Devices (MICRO’23). https://doi.org/10.1145/3613424.3614256
- Tailwind: Fast and Atomic RDMA-Based Replication (USENIX ATC’18). https://www.usenix.org/conference/atc18/presentation/taleb
- Neil C. Thompson and Svenja Spanuth. 2021. The Decline of Computers as a General Purpose Technology. Commun. ACM 64, 3 (feb 2021). https://doi.org/10.1145/3430936
- Alexandre Verbitski et al. 2018. Amazon Aurora: On Avoiding Distributed Consensus for I/Os, Commits, and Membership Changes (SIGMOD ’18). https://doi.org/10.1145/3183713.3196937
- Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases (SIGMOD’17). https://doi.org/10.1145/3035918.3056101
- The Case for Distributed Shared-Memory Databases with RDMA-Enabled Memory Disaggregation. Proc. VLDB Endow. 16, 1 (2022). https://doi.org/10.14778/3561261.3561263
- The Yin and Yang of Processing Data Warehousing Queries on GPU Devices. Proceedings of the VLDB Endowment 6, 10 (2013), 817–828. https://doi.org/10.14778/2536206.2536210
- Rethinking Database High Availability with RDMA Networks. Proc. VLDB Endow. 12, 11 (2019). https://doi.org/10.14778/3342263.3342639
- Redy: Remote Dynamic Memory Cache. Proc. VLDB Endow. 15, 4 (2021). https://doi.org/10.14778/3503585.3503587
- Mark Zhao et al. 2022. Understanding data storage and ingestion for large-scale deep recommendation model training: industrial product (ISCA’22). https://doi.org/10.1145/3470496.3533044
- Alberto Lerner (1 paper)
- Gustavo Alonso (45 papers)