A Comprehensive Survey on SmartNICs: Architectures, Development Models, Applications, and Research Directions (2405.09499v1)
Abstract: The end of Moore's Law and Dennard Scaling has slowed processor improvements in the past decade. While multi-core processors have improved performance, they are limited by the application's level of parallelism, as prescribed by Amdahl's Law. This has led to the emergence of domain-specific processors that specialize in a narrow range of functions. Smart Network Interface Cards (SmartNICs) can be seen as an evolutionary technology that combines heterogeneous domain-specific processors and general-purpose cores to offload infrastructure tasks. Despite the impressive advantages of SmartNICs and their importance in modern networks, the literature has been missing a comprehensive survey. To this end, this paper provides a background encompassing an overview of the evolution of NICs from basic to SmartNICs, describing their architectures, development environments, and advantages over legacy NICs. The paper then presents a comprehensive taxonomy of applications offloaded to SmartNICs, covering network, security, storage, and machine learning functions. Challenges associated with SmartNIC development and deployment are discussed, along with current initiatives and open research issues.
- G. Moore, “Cramming more components onto integrated circuits,” Proceedings of the IEEE, 1998.
- G. Moore, “Progress in digital integrated electronics,” in Electron Devices Meeting, 1975.
- R. Dennard, F. Gaensslen, H. Yu, V. Rideout, E. Bassous, and A. LeBlanc, “Design of ion-implanted MOSFET’s with very small physical dimensions,” IEEE Journal of solid-state circuits, 1974.
- J. Hennessy and D. Patterson, Computer architecture: a quantitative approach. Elsevier, 2011.
- G. Amdahl, “Validity of the single processor approach to achieving large scale computing capabilities,” in Proceedings of the April 18-20, 1967, spring joint computer conference, 1967.
- J. Faircloth, “Enterprise applications administration: The definitive guide to implementation and operations,” Morgan Kaufmann, 2013.
- S. Ibanez, M. Shahbaz, and N. McKeown, “The case for a network fast path to the CPU,” in Proceedings of the 18th ACM Workshop on Hot Topics in Networks, 2019.
- M. Metz, “SmartNICs and infrastructure acceleration report 2022,” AvidThink, 2022.
- A. Ageev, M. Foroushani, and A. Kaufmann, “Exploring domain-specific architectures for network protocol processing,”
- E. Tell, “A domain specific DSP processor,” Institutionen för systemteknik, 2001.
- D. Caetano-Anolles, “Hardware - optimizations - SSD - CPU - GPU - FPGA - TPU,” gatk, 2022.
- G. Elinoff, “Data centers are overloaded. the inventor of FPGAs is swooping in with a “comprehensive” SmartNIC,” March 2020.
- Google, “Encryption in transit.” [Online]. Available: https://tinyurl.com/436vh9jh.
- J. Morra, “Is this the future of the SmartNIC?.” [Online]. Available: https://tinyurl.com/ydru5bcp.
- Microsoft, “Azure SmartNIC.” [Online]. Available: https://tinyurl.com/4sj7m7mp.
- S. Schweitzer, “Architectures, boards, chips and software,” SmartNIC Summit, 2023.
- AMD, “AMD collaborates with the energy sciences network on launch of its next-generation, high-performance network to enhance data-intensive science,” 2022. [Online]. Available: https://tinyurl.com/ycyb382t.
- VMware, “DPU-based acceleration for NSX.” [Online]. Available: https://tinyurl.com/238v6j5h.
- Palo Alto Networks, “Intelligent traffic offload uses smartnic/dpu for hyperscale security,” 2022. [Online]. Available: https://tinyurl.com/d322nda7.
- Juniper Networks, “SmartNICs accelerate the new network edge,” 2021. [Online]. Available: https://tinyurl.com/2uh6uh7t.
- S. Vural, “SmartNICs in telco: benefits and use cases,” 2021. [Online]. Available: https://tinyurl.com/8amw8s74.
- D. Tootaghaj, A. Mercian, V. Adarsh, M. Sharifian, and P. Sharma, “SmartNICs at edge for transient compute elasticity,” in Proceedings of the 3rd International Workshop on Distributed Machine Learning, 2022.
- C. Zheng, X. Hong, D. Ding, S. Vargaftik, Y. Ben-Itzhak, and N. Zilberman, “In-network machine learning using programmable network devices: A survey,” IEEE Communications Surveys & Tutorials, 2023.
- I. Baldin, A. Nikolich, J. Griffioen, I. Monga, K.-C. Wang, T. Lehman, and P. Ruth, “FABRIC: A national-scale programmable experimental network infrastructure,” IEEE Internet Computing, 2019.
- GEANT, “GEANT testbed.” [Online]. Available: https://geant.org/.
- GEANT, “High-performance flow monitoring using programmable network interface cards,” 2023.
- E. da Cunha, M. Martinello, C. Dominicini, M. Schwarz, M. Ribeiro, E. Borges, I. Brito, J. Bezerra, and M. Barcellos, “FABRIC testbed from the eyes of a network researcher,” in Anais do II Workshop de Testbeds, 2023.
- D. Cerović, V. del Piccolo, A. Amamou, K. Haddadou, and G. Pujolle, “Fast packet processing: A survey,” IEEE Communications Surveys & Tutorials, 2018.
- E. Freitas, A. de Oliveira, P. do Carmo, D. Sadok, and J. Kelner, “A survey on accelerating technologies for fast network packet processing in Linux environments,” Computer Communications, 2022.
- L. Linguaglossa, S. Lange, S. Pontarelli, G. Rétvári, D. Rossi, T. Zinner, R. Bifulco, M. Jarschel, and G. Bianchi, “Survey of performance acceleration techniques for network function virtualization,” Proceedings of the IEEE, 2019.
- X. Fei, F. Liu, Q. Zhang, H. Jin, and H. Hu, “Paving the way for NFV acceleration: A taxonomy, survey and future directions,” ACM Computing Surveys (CSUR), 2020.
- P. Shantharama, A. Thyagaturu, and M. Reisslein, “Hardware-accelerated platforms and infrastructures for network functions: A survey of enabling technologies and research studies,” IEEE Access, 2020.
- M. Vieira, M. Castanho, R. Pacífico, E. Santos, E. Júnior, and L. Vieira, “Fast packet processing with eBPF and XDP: Concepts, code, challenges, and applications,” ACM Computing Surveys (CSUR), 2020.
- L. Rosa, L. Foschini, and A. Corradi, “Empowering cloud computing with network acceleration: A survey,” IEEE Communications Surveys & Tutorials, 2024.
- The Linux Foundation, “DPDK.” [Online]. Available: https://www.dpdk.org/.
- Ntop Engineering, “PF_RING: High-speed packet capture, filtering and analysis.” [Online]. Available: https://tinyurl.com/yzwc4t35.
- T. Marian, K. Lee, and H. Weatherspoon, “Netslices: Scalable multi-core packet processing in user-space,” in Proceedings of the eighth ACM/IEEE symposium on Architectures for networking and communications systems, 2012.
- L. Rizzo, “Netmap: a novel framework for fast packet I/O,” in 21st USENIX Security Symposium (USENIX Security 12), 2012.
- E. Kfoury, J. Crichigno, and E. Bou-Harb, “An exhaustive survey on P4 programmable data plane switches: Taxonomy, applications, challenges, and future trends,” IEEE Access, 2021.
- F. Hauser, M. Häberle, D. Merling, S. Lindner, V. Gurevich, F. Zeiger, R. Frank, and M. Menth, “A survey on data plane programming with P4: Fundamentals, advances, and applied research,” Journal of Network and Computer Applications, 2023.
- O. Michel, R. Bifulco, G. Retvari, and S. Schmid, “The programmable data plane: Abstractions, architectures, algorithms, and applications,” ACM Computing Surveys (CSUR), 2021.
- E. Kaljic, A. Maric, P. Njemcevic, and M. Hadzialic, “A survey on data plane flexibility and programmability in software-defined networking,” IEEE Access, 2019.
- W. da Costa Cordeiro, J. Marques, and L. Gaspary, “Data plane programmability beyond OpenFlow: Opportunities and challenges for network and service operations and management,” Journal of Network and Systems Management, 2017.
- Y. Gao and Z. Wang, “A review of P4 programmable data planes for network security,” Mobile Information Systems, 2021.
- A. AlSabeh, J. Khoury, E. Kfoury, J. Crichigno, and E. Bou-Harb, “A survey on security applications of P4 programmable switches and a STRIDE-based vulnerability assessment,” Computer networks, 2022.
- X. Chen, C. Wu, X. Liu, Q. Huang, D. Zhang, H. Zhou, Q. Yang, and M. Khan, “Empowering network security with programmable switches: A comprehensive survey,” IEEE Communications Surveys & Tutorials, 2023.
- R. Parizotto, B. Coelho, D. Nunes, I. Haque, and A. Schaeffer-Filho, “Offloading machine learning to programmable data planes: A systematic survey,” ACM Computing Surveys, 2023.
- W. Quan, Z. Xu, M. Liu, N. Cheng, G. Liu, D. Gao, H. Zhang, X. Shen, and W. Zhuang, “AI-driven packet forwarding with programmable data plane: A survey,” IEEE Communications Surveys & Tutorials, 2022.
- J. Gomez, E. Kfoury, J. Crichigno, and G. Srivastava, “A survey on TCP enhancements using P4-programmable devices,” Computer Networks, 2022.
- S. Han, S. Jang, H. Choi, H. Lee, and S. Pack, “Virtualization in programmable data plane: A survey and open challenges,” IEEE Open Journal of the Communications Society, 2020.
- J. Brito, J. Moreno, L. Contreras, M. Alvarez-Campana, and M. Blanco, “Programmable data plane applications in 5G and beyond architectures: A systematic review,” Sensors, 2023.
- A. Mazloum, E. Kfoury, J. Gomez, and J. Crichigno, “A survey on rerouting techniques with P4 programmable data plane switches,” Computer Networks, 2023.
- M. Chiesa, A. Kamisiński, J. Rak, G. Rétvári, and S. Schmid, “A survey of fast recovery mechanisms in the data plane,” Authorea Preprints, 2023.
- NVIDIA, “NVIDIA Mellanox BlueField-2 data processing unit (DPU).” [Online]. Available: https://tinyurl.com/yrky7ee5.
- AMD, “Pensando DSC2-200 distributed services card.” [Online] Available: https://tinyurl.com/yr6eeez6.
- AMD, “Xilinx Alveo SN1000 SmartNIC.” [Online]. Available: https://tinyurl.com/pxacmnd9.
- N. McKeown, “Why does the internet need a programmable forwarding plane.” [Online]. Available: https://tinyurl.com/ffajhk9y.
- J. Xing, Y. Qiu, K.-F. Hsu, S. Sui, K. Manaa, O. Shabtai, Y. Piasetzky, M. Kadosh, and A. Krishnamurthy, “Unleashing SmartNIC packet processing performance in P4,” in Proceedings of the ACM SIGCOMM 2023 Conference, 2023.
- S. Kanev, J. Darago, K. Hazelwood, P. Ranganathan, T. Moseley, G.-Y. Wei, and D. Brooks, “Profiling a warehouse-scale computer,” in Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015.
- NVIDIA, “ConnectX-5 EN Card.” [Online]. Available: https://tinyurl.com/nhcf26nr.
- NVIDIA, “ConnectX-6 LX 25/50G Ethernet SmartNIC.” [Online]. Available: https://tinyurl.com/4at7npy5.
- NVIDIA, “ConnectX-6 Dx 200G Ethernet SmartNIC.” [Online]. Available: https://tinyurl.com/2e59ts66.
- NVIDIA, “ConnectX-7 400G Adapters.” [Online]. Available: https://tinyurl.com/hndz6yxm.
- Achronix, “Vectorpath accelerator card.” [Online]. Available: https://tinyurl.com/yc7xachz.
- AMD, “Xilinx Alveo U50 Data Center Accelerator Card.” [Online]. Available: https://tinyurl.com/nhbe4xbd.
- AMD, “Xilinx Alveo U55C Data Center Accelerator Cards.” [Online]. Available: https://tinyurl.com/mr4887yw.
- AMD, “Alveo U200 and U250 Data Center Accelerator Cards.” [Online]. Available: https://tinyurl.com/2p9tzav3.
- AMD, “Alveo U280 Data Center Accelerator Card.” [Online]. Available: https://tinyurl.com/bdfzke7z.
- Napatech, “NT200A02 SmartNIC with Link-Capture Software.” [Online]. Available: https://tinyurl.com/y4xbyypy.
- Silicom, “Silicom FPGA SmartNIC N501x Series.” [Online]. Available: https://tinyurl.com/4s9mwr88.
- Silicom, “Silicom N5110A SmartNIC Intel based.” [Online]. Available: https://tinyurl.com/yskzrzah.
- Silicom, “FPGA SmartNIC FB2CDG1@AGM39D-2 Intel based.” [Online]. Available: https://tinyurl.com/3rsbur47.
- Silicom, “FPGA SmartNIC N6010/6011 Intel based.” [Online]. Available: https://tinyurl.com/3syps38s.
- Silicom, “FB4XXVG@Z21D TimeSync SmartNIC FPGA Xilinx based.” [Online]. Available: https://tinyurl.com/4vdbp3jd.
- NVIDIA, “Mellanox Innova-2 Flex Open Programmable SmartNIC.” [Online]. Available: https://tinyurl.com/3wdy3hxd.
- AMD, “Pensando Giglio Data Processing Unit.” [Online]. Available: https://tinyurl.com/yst9b77m.
- AMD, “Pensando DSC2-100 100G 2p QSFP56 DPU and DSC2-25 10/25G 2p SFP56 DPU Distributed Services Cards for VMware vSphere Distributed Services Engine.” [Online]. Available: https://tinyurl.com/38ax5jkb.
- Asterfusion, “Helium EC2004Y.” [Online]. Available: https://tinyurl.com/3bkpn6yv.
- Asterfusion, “Helium ec2002p.” [Online]. Available: https://tinyurl.com/psfr4w6d.
- Broadcom, “Stingray PS225 SmartNIC Adapters.” [Online]. Available: https://tinyurl.com/5f3rpu45.
- Intel, “Infrastructure Processing Unit (Intel IPU) ASIC E2000.” [Online]. Available: https://tinyurl.com/5d3rbjfb.
- Marvell, “Marvell LiquidIO III.” [Online]. Available: https://tinyurl.com/a7r69vpc.
- Netronome, “Agilio FX 2x10GbE SmartNIC.” [Online]. Available: https://tinyurl.com/28sxth97.
- Netronome, “Agilio CX 2x40GbE SmartNIC.” [Online]. Available: https://tinyurl.com/mfpud4pd.
- NVIDIA, “NVIDIA BlueField-3 Networking Platform.” [Online]. Available: https://tinyurl.com/3e5v2xd2.
- AMD, “Xilinx Alveo U25N SmartNIC.” [Online]. Available: https://tinyurl.com/2dwz7dxe.
- AMD, “Alveo U45N Data Center Accelerator Card.” [Online]. Available: https://tinyurl.com/mvtbshy3.
- Intel, “FPGA Product Catalog.” [Online]. Available: https://tinyurl.com/ykvxkj3c.
- Napatech, “SmartNIC and IPU Hardware Portfolio.” [Online]. Available: https://tinyurl.com/yxcbx2p9.
- M. Liu, T. Cui, H. Schuh, A. Krishnamurthy, S. Peter, and K. Gupta, “Offloading distributed applications onto SmartNICs using ipipe,” in Proceedings of the ACM Special Interest Group on Data Communication, 2019.
- P. Bosshart, D. Daly, G. Gibb, M. Izzard, N. McKeown, J. Rexford, C. Schlesinger, D. Talayco, A. Vahdat, and G. Varghese, “P4: Programming protocol-independent packet processors,” ACM SIGCOMM Computer Communication Review, 2014.
- The P4 Language Consortium, “P4_14 language specification.” [Online]. Available: https://tinyurl.com/hzujjzt7.
- The P4 Language Consortium, “P4_16 language specification.” [Online]. Available: https://tinyurl.com/5fvfnd8t.
- “P4 Portable NIC Architecture (PNA).” [Online]. Available: https://tinyurl.com/3v6etke2.
- AMD, “Xilinx Vivado Design Suite 2023.” [Online]. Available: https://www.xilinx.com.
- Intel, “Intel P4 Suite for FPGA.” [Online]. Available: https://tinyurl.com/42rztah2.
- The Linux Foundation, “DPDK Supported Hardware.” [Online]. Available: https://core.dpdk.org/supported/.
- The Linux Foundation, “DPDK Pipeline Application.” [Online]. Available: https://tinyurl.com/udutp3jf.
- The Linux Foundation, “Generic flow API (rte_flow) documentation.” [Online]. Available: https://tinyurl.com/3pwwnnx2.
- S. Horman, “OvS hardware offload with TC flower,” in Proceedings Open vSwitch 2017 Fall Conf.
- NVIDIA, “DOCA Flow.” [Online]. Available: https://tinyurl.com/bdfx7u98.
- NVIDIA, “Mellanox ASAP2 Accelerated Switching and Packet Processing,” ConnectX and ASAP2 -Accelerated Switcha and Packet Processing, 2019.
- NVIDIA, “DOCA Developer Guide.” [Online]. Available: https://tinyurl.com/2usa47hs.
- Intel, “P4 insight.” [Online]. Available: https://tinyurl.com/2v2xajrf.
- AMD, “Xilinx Vitis Networking P4.” [Online]. Available: https://tinyurl.com/bdctjc9b.
- AMD, “Xilinx XRT and Vitis Platform Overview.” [Online]. Available: https://tinyurl.com/y5jdsypx.
- Intel, “Intel Open FPGA Stack.” [Online]. Available: https://www.intel.com/.
- The Linux Foundation, “Open Programmable Infrastructure Project.” [Online]. Available: https://opiproject.org/.
- The Linux Foundation, “IPDK Documentation.” [Online]. Available: https://ipdk.io/documentation/.
- The Linux Foundation, “Sonic-dash.” [Online]. Available: https://tinyurl.com/utcjchme.
- L. Xin, “SONiC, Programmability & Acceleration,” 2022. [Online]. Available: https://tinyurl.com/musxey96.
- J. Thönes, “Microservices,” IEEE software, 2015.
- T. Benson, A. Akella, and D. Maltz, “Network traffic characteristics of data centers in the wild,” in Proceedings of the 10th ACM SIGCOMM conference on Internet measurement, 2010.
- Cisco, “Cisco global cloud index 2015–2020.” [Online]. Available: https://tinyurl.com/2ery68x4.
- V. Stafford, “Zero trust architecture,” NIST special publication, 2020.
- D. Basak, R. Toshniwal, S. Maskalik, and A. Sequeira, “Virtualizing networking and security in the cloud,” ACM SIGOPS Operating Systems Review, 2010.
- NVIDIA, “DOCA Open vSwitch Layer-4 Firewall.” [Online]. Available: https://tinyurl.com/bdfctkaj.
- AMD, “Achieve high throughput: A case study using a Pensando distributed services card with P4 programmable software-defined networking pipeline.” [Online]. Available: https://tinyurl.com/yj9ttvnh.
- The Zeek Project, “Zeek, an open source network security monitoring tool.” [Online]. Available: https://zeek.org/.
- The Open Information Security Foundation, “Suricata.” [Online]. Available: https://suricata.io/.
- Cisco, “Snort - network intrusion detection and prevention system.” [Online]. Available: https://www.snort.org/.
- Z. Zhao, H. Sadok, N. Atre, J. Hoe, V. Sekar, and J. Sherry, “Achieving 100Gbps intrusion prevention on a single server,” in 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), 2020.
- J. Chen, X. Zhang, T. Wang, Y. Zhang, T. Chen, J. Chen, M. Xie, and Q. Liu, “Fidas: Fortifying the cloud via comprehensive fpga-based offloading for intrusion detection: Industrial product,” in Proceedings of the 49th Annual International Symposium on Computer Architecture, 2022.
- Y. Zhao, G. Cheng, Y. Duan, Z. Gu, Y. Zhou, and L. Tang, “Secure IoT edge: Threat situation awareness based on network traffic,” Computer Networks, 2021.
- S. Panda, Y. Feng, S. Kulkarni, K. Ramakrishnan, N. Duffield, and L. Bhuyan, “SmartWatch: Accurate traffic analysis and flow-state tracking for intrusion prevention using smartnics,” in Proceedings of the 17th International Conference on Emerging Networking Experiments and Technologies, 2021.
- M. Wu, H. Matsutani, and M. Kondo, “ONLAD-IDS: ONLAD-based intrusion detection system using SmartNIC,” in 2022 IEEE 24th Int Conf on High Performance Computing & Communications, 2022.
- K. Tasdemir, R. Khan, F. Siddiqui, S. Sezer, F. Kurugollu, and A. Bolat, “An investigation of machine learning algorithms for high-bandwidth SQL injection detection utilising BlueField-3 DPU technology,” in 2023 IEEE 36th International System-on-Chip Conference (SOCC), 2023.
- S. Miano, R. Doriguzzi-Corin, F. Risso, D. Siracusa, and R. Sommese, “Introducing SmartNICs in server-based data plane processing: The DDoS mitigation use case,” IEEE Access, 2019.
- The Open Information Security Foundation, “Ignoring traffic.” [Online]. Available: https://tinyurl.com/f2kn3snm.
- M. Gonen, “Accelerating the Suricata IDS/IPS with NVIDIA BlueField DPUs.” [Online]. Available: https://tinyurl.com/ys8n6mmz.
- R. Yavatkar, “SmartNICs accelerate the new network edge.” [Online]. Available: https://tinyurl.com/2af6yfp3.
- M. Ceška, V. Havlena, L. Holík, J. Korenek, O. Lengál, D. Matoušek, J. Matoušek, J. Semric, and T. Vojnar, “Deep packet inspection in FPGAs via approximate nondeterministic automata,” in 2019 IEEE 27th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2019.
- Y. Yang and V. Prasanna, “High-performance and compact architecture for regular expression matching on FPGA,” IEEE Transactions on Computers, 2011.
- D. Matoušek, J. Kořenek, and V. Puš, “High-speed regular expression matching with pipelined automata,” in 2016 International Conference on Field-Programmable Technology (FPT), 2016.
- D. Luchaup, L. De Carli, S. Jha, and E. Bach, “Deep packet inspection with DFA-trees and parametrized language overapproximation,” in IEEE INFOCOM 2014-IEEE Conference on Computer Communications, 2014.
- M. Češka, V. Havlena, L. Holík, O. Lengál, and T. Vojnar, “Approximate reduction of finite automata for high-speed network intrusion detection,” International Journal on Software Tools for Technology Transfer, 2020.
- N. Diamond, S. Graham, and G. Clark, “Securing InfiniBand traffic with BlueField-2 data processing units,” in International Conference on Critical Infrastructure Protection, 2022.
- Q. Su, S. Wu, Z. Niu, R. Shu, P. Cheng, Y. Xiong, C. Xue, Z. Liu, and H. Xu, “Meili: Enabling SmartNIC as a service in the cloud,” arXiv preprint arXiv:2312.11871, 2023.
- T. T. Bar Tuaf, Tal Gilboa, “kTLS offload performance enhancements for real-life applications,” 2020. [Online]. Available: https://tinyurl.com/24ep7pwc.
- D. Kim, S. Lee, and K. Park, “A case for smartnic-accelerated private communication,” in Proceedings of the 4th Asia-Pacific Workshop on Networking, pp. 30–35, 2020.
- F. Novais and F. L. Verdi, “Unlocking security to the board: An evaluation of SmartNIC-driven TLS acceleration with kTLS.” [Online]. Available: https://tinyurl.com/2p92nsnj.
- J. Zhao, M. Neves, and I. Haque, “On the (dis) advantages of programmable NICs for network security services,” in 2023 IFIP Networking Conference (IFIP Networking), 2023.
- B. Pfaff, J. Pettit, K. Amidon, M. Casado, T. Koponen, and S. Shenker, “Extending networking into the virtualization layer.,” in Hotnets, 2009.
- P. Emmerich, D. Raumer, F. Wohlfart, and G. Carle, “Performance characteristics of virtual switching,” in 2014 IEEE 3rd International Conference on Cloud Networking (CloudNet), 2014.
- W. Tu, Y.-H. Wei, G. Antichi, and B. Pfaff, “Revisiting the open vswitch dataplane ten years later,” in Proceedings of the 2021 ACM SIGCOMM 2021 Conference, pp. 245–257, 2021.
- B. Pfaff, J. Pettit, T. Koponen, E. Jackson, A. Zhou, J. Rajahalme, J. Gross, A. Wang, J. Stringer, and P. Shelar, “The design and implementation of open vSwitch,” in 12th USENIX symposium on networked systems design and implementation (NSDI 15), 2015.
- VMware, “vSphere distributed switch.” [Online]. Available: https://tinyurl.com/2bpwzubd.
- M. Mahalingam, D. Dutt, K. Duda, P. Agarwal, L. Kreeger, T. Sridhar, M. Bursell, and C. Wright, “Virtual extensible local area network (VXLAN): A framework for overlaying virtualized layer 2 networks over layer 3 networks,” RFC 7348, 2014.
- J. Gross, I. Ganga, and T. Sridhar, “Geneve: Generic network virtualization encapsulation,” RFC 8926, 2020.
- D. Farinacci, T. Li, S. Hanks, D. Meyer, and P. Traina, “Generic routing encapsulation (GRE),” RFC 2784, 2000.
- I. Burstein, “NVIDIA data center processing unit (DPU) architecture,” in 2021 IEEE Hot Chips 33 Symposium (HCS), 2021.
- J. Weerasinghe and F. Abel, “On the cost of tunnel endpoint processing in overlay virtual networks,” in 2014 IEEE/ACM 7th International Conference on Utility and Cloud Computing, 2014.
- L. Luo, “Towards converged smartnic architecture for bare metal & public clouds,” APNet 2018 Industry Talks.
- NVIDIA, “Virtual switch on DPU.” [Online]. Available: https://tinyurl.com/5n8eb6bz.
- B. Claise, “Cisco systems NetFlow services export version 9,” tech. rep., 2004.
- B. Claise, M. Fullmer, P. Calato, and R. Penno, “IPFIX protocol specification,” Interrnet-draft, work in progress, 2005.
- The P4 Working Group, “In-band network telemetry (INT) dataplane specification.” [Online]. Available: https://tinyurl.com/4x9shr45.
- F. Brockners, S. Bhandari, D. Bernier, and T. Mizrahi, “In situ operations, administration, and maintenance (IOAM) deployment,” tech. rep., 2023.
- G. Cormode and S. Muthukrishnan, “An improved data stream summary: the count-min sketch and its applications,” Journal of Algorithms, 2005.
- B. Bloom, “Space/time trade-offs in hash coding with allowable errors,” Communications of the ACM, 1970.
- S. Geravand and M. Ahmadi, “Bloom filter applications in network security: A state-of-the-art survey,” Computer Networks, 2013.
- Z. Zeng, L. Cui, M. Qian, Z. Zhang, and K. Wei, “A survey on sliding window sketch for network measurement,” Computer Networks, 2023.
- J. White, J. Kim, M. Baldi, Y. Li, and D. McIntyre, “xPU accelerator offload functions.” [Online]. Available: https://tinyurl.com/rzyfx5b4.
- T. Cui, C. Zhao, W. Zhang, K. Zhang, and A. Krishnamurthy, “Laconic: Streamlined load balancers for SmartNICs,” arXiv preprint arXiv:2403.11411, 2024.
- X. Huang, Z. Guo, and M. Song, “FGLB: A fine-grained hardware intra-server load balancer based on 100 G FPGA SmartNIC,” International Journal of Network Management, 2022.
- B. Chang, A. Akella, L. D’Antoni, and K. Subramanian, “Learned load balancing,” in Proceedings of the 24th International Conference on Distributed Computing and Networking, pp. 177–187, 2023.
- Z. Ni, C. Wei, T. Wood, and N. Choi, “A SmartNIC-based load balancing and auto scaling framework for middlebox edge server,” in 2021 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN), 2021.
- H. Tajbakhsh, R. Parizotto, M. Neves, A. Schaeffer-Filho, and I. Haque, “Accelerator-aware in-network load balancing for improved application performance,” in 2022 IFIP Networking Conference (IFIP Networking), IEEE.
- R. Durner, A. Varasteh, M. Stephan, C. Machuca, and W. Kellerer, “HNLB: Utilizing hardware matching capabilities of NICs for offloading stateful load balancers,” in ICC 2019-2019 IEEE International Conference on Communications (ICC), 2019.
- Y. Zhang, J. Bi, Z. Li, Y. Zhou, and Y. Wang, “VMS: Load balancing based on the virtual switch layer in datacenter networks,” IEEE Journal on Selected Areas in Communications, 2020.
- H. Krawczyk, “New hash functions for message authentication,” in International Conference on the Theory and Applications of Cryptographic Techniques, 1995.
- The Linux Foundation, “Scaling in the Linux networking stack.” [Online]. Available: https://tinyurl.com/4fjv42hj.
- Napatech, “5G user plane function offload.” [Online]. Available: https://tinyurl.com/4jxxeh8t.
- R. Davis, “NVIDIA BlueField partner’s DPU storage solutions and use cases.” [Online]. Available: https://tinyurl.com/2s4kmkrp.
- Y. Li, A. Kashyap, Y. Guo, and X. Lu, “Characterizing lossy and lossless compression on emerging BlueField DPU architectures,” in 2023 IEEE Symposium on High-Performance Interconnects (HOTI), 2023.
- L. Peter, “DEFLATE compressed data format specification version 1.3,” RFC 1951, 1996.
- L. Peter and J. Gailly, “ZLIB compressed data format specification version 3.3,” RFC 1950, 1996.
- X. Liang, K. Zhao, S. Di, S. Li, R. Underwood, A. Gok, J. Tian, J. Deng, J. Calhoun, and D. Tao, “SZ3: A modular framework for composing prediction-based error-bounded lossy compressors,” IEEE Transactions on Big Data, 2022.
- E. de Rothschild, “Ai insights - is the acceleration of the power of ai models a recent phenomenon?,” Jun 2023.
- Z. Ma, J. He, J. Qiu, H. Cao, Y. Wang, Z. Sun, L. Zheng, H. Wang, S. Tang, T. Zheng, et al., “Bagualu: targeting brain scale pretrained models with over 37 million cores,” in Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 192–204, 2022.
- A. Moody, J. Fernandez, F. Petrini, and D. Panda, “Scalable nic-based reduction on large-scale clusters. supercomputing, 2003 acm,” in IEEE Conference, 2003.
- A. S. Da Silva, J. A. Wickboldt, L. Z. Granville, and A. Schaeffer-Filho, “Atlantic: A framework for anomaly traffic detection, classification, and mitigation in sdn,” in NOMS 2016-2016 IEEE/IFIP Network Operations and Management Symposium, pp. 27–35, IEEE, 2016.
- T. Itsubo, M. Koibuchi, H. Amano, and H. Matsutani, “Accelerating deep learning using multiple gpus and fpga-based 10gbe switch,” in 2020 28th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), pp. 102–109, IEEE, 2020.
- K. Tanaka, Y. Arikawa, T. Ito, K. Morita, N. Nemoto, F. Miura, K. Terada, J. Teramoto, and T. Sakamoto, “Communication-efficient distributed deep learning with gpu-fpga heterogeneous computing,” in 2020 IEEE Symposium on High-Performance Interconnects (HOTI), pp. 43–46, IEEE, 2020.
- E. de Rothschild, “Nvidia doca allreduce application guide,” Jun 2023.
- R. Ma, E. Georganas, A. Heinecke, S. Gribok, A. Boutros, and E. Nurvitadhi, “Fpga-based ai smart nics for scalable distributed ai training systems,” IEEE Computer Architecture Letters, vol. 21, no. 2, pp. 49–52, 2022.
- Z. Xiong and N. Zilberman, “Do switches dream of machine learning? toward in-network classification,” in Proceedings of the 18th ACM workshop on hot topics in networks, pp. 25–33, 2019.
- S. Ibanez, G. Brebner, N. McKeown, and N. Zilberman, “The p4-¿ netfpga workflow for line-rate packet processing,” in Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, pp. 1–9, 2019.
- B. M. Xavier, R. S. Guimarães, G. Comarela, and M. Martinello, “Programmable switches for in-networking classification,” in IEEE INFOCOM 2021-IEEE Conference on Computer Communications, pp. 1–10, IEEE, 2021.
- D. Sanvito, G. Siracusano, and R. Bifulco, “Can the network be the ai accelerator?,” in Proceedings of the 2018 Morning Workshop on In-Network Computing, pp. 20–25, 2018.
- Redis, “The real-time data platform,” 2024. [Online]. Available: https://redis.io/.
- Danga Interactive, “Memcached - a distributed memory object caching system.” [Online]. Available: https://memcached.org/.
- A. Dragojević, D. Narayanan, M. Castro, and O. Hodson, “FaRM: Fast remote memory,” in 11th USENIX Symposium on Networked Systems Design and Implementation (NSDI 14), 2014.
- C. Mitchell, Y. Geng, and J. Li, “Using one-sided RDMA reads to build a fast, CPU-efficient key-value store,” in 2013 USENIX Annual Technical Conference (USENIX ATC 13), 2013.
- A. Kalia, M. Kaminsky, and D. Andersen, “Using RDMA efficiently for key-value services,” in Proceedings of the 2014 ACM Conference on SIGCOMM, 2014.
- A. Kalia, M. Kaminsky, and D. Andersen, “Design guidelines for high performance RDMA systems,” in 2016 USENIX Annual Technical Conference (USENIX ATC 16), 2016.
- B. Cassell, T. Szepesi, B. Wong, T. Brecht, J. Ma, and X. Liu, “Nessie: A decoupled, client-driven key-value store using RDMA,” IEEE Transactions on Parallel and Distributed Systems, 2017.
- S. Sun, R. Zhang, M. Yan, and J. Wu, “SKV: A SmartNIC-offloaded distributed key-value store,” in 2022 IEEE International Conference on Cluster Computing (CLUSTER), 2022.
- J. Liu, A. Dragojević, S. Flemming, A. Katsarakis, D. Korolija, I. Zablotchi, H. Ng, A. Kalia, and M. Castro, “Honeycomb: ordered key-value store acceleration on an FPGA-based SmartNIC,” IEEE Transactions on Computers, 2023.
- A. Kalia, M. Kaminsky, and D. Andersen, “Datacenter RPCs can be general and fast,” in 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI 19), 2019.
- C. Chen, Hungand Chang and S. Hung, “HKVS: a framework for designing a high throughput heterogeneous key-value store with SmartNIC and RDMA,” in Proceedings of the Conference on Research in Adaptive and Convergent Systems, 2022.
- J. Li, Y. Lu, Q. Wang, J. Lin, Z. Yang, and J. Shu, “AlNiCo SmartNIC-accelerated contention-aware request scheduling for transaction processing,” in 2022 USENIX Annual Technical Conference (USENIX ATC 22), 2022.
- H. Schuh, W. Liang, M. Liu, J. Nelson, and A. Krishnamurthy, “Xenic: SmartNIC-accelerated distributed transactions,” in Proceedings of the ACM SIGOPS 28th Symposium on Operating Systems Principles, 2021.
- S. Choi, M. Shahbaz, B. Prabhakar, and M. Rosenblum, “λ𝜆\lambdaitalic_λ-nic: Interactive serverless compute on programmable smartnics,” in 2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS), 2020.
- Amazon, “Serverless function, FaaS service, AWS lambda.”
- Google, “Google cloud functions.” [Online]. Available: https://tinyurl.com/acayx98p.
- Microsoft, “Azure functions.” [Online]. Available: https://tinyurl.com/a7wat88a.
- The Linux Foundation, “IPDK.” [Online]. Available: https://ipdk.io/.
- A. Zulfiqar, B. Pfaff, W. Tu, G. Antichi, and M. Shahbaz, “The slow path needs an accelerator too!,” ACM SIGCOMM Computer Communication Review, 2023.
- Y. Le, H. Chang, S. Mukherjee, L. Wang, A. Akella, M. Swift, and T. Lakshman, “UNO: Unifying host and smart NIC offload for flexible packet processing,” in Proceedings of the 2017 Symposium on Cloud Computing, 2017.
- S. Wang, Z. Meng, C. Sun, M. Wang, M. Xu, J. Bi, T. Yang, Q. Huang, and H. Hu, “SmartChain: Enabling high-performance service chain partition between SmartNIC and CPU,” in ICC 2020-2020 IEEE International Conference on Communications (ICC), 2020.
- Y. Zhou, M. Wilkening, J. Mickens, and M. Yu, “SmartNIC security isolation in the cloud with S-NIC,” 2024.
- Y. Qiu, Q. Kang, M. Liu, and A. Chen, “Clara: Performance clarity for SmartNIC offloading,” in Proceedings of the 19th ACM Workshop on Hot Topics in Networks, 2020.
- The official ServeTheHome.com YouTube channel, “Servethehome.” [Online]. Available: https://tinyurl.com/yc58uapm.
- The official SNIA YouTube channel, “SNIAVideo.” [Online]. Available: https://tinyurl.com/3bhdb7kd.
- The official OPI YouTube channel, “The open programmable infrastructure.” [Online]. Available: https://www.youtube.com/@OPI_project.
- K. A. Simpson and D. P. Pezaros, “Revisiting the classics: Online rl in the programmable dataplane,” in NOMS 2022-2022 IEEE/IFIP Network Operations and Management Symposium, pp. 1–10, IEEE, 2022.
- J. Xing, K. Hsu, M. Kadosh, A. Lo, Y. Piasetzky, A. Krishnamurthy, and A. Chen, “Runtime programmable switches,” in 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), 2022.
- S. Chole, A. Fingerhut, S. Ma, A. Sivaraman, S. Vargaftik, A. Berger, G. Mendelson, M. Alizadeh, S. Chuang, and I. Keslassy, “DRMT: Disaggregated programmable switching,” in Proceedings of the Conference of the ACM Special Interest Group on Data Communication, 2017.
- NVIDIA, “Introduction to DOCA for DPUs.” [Online]. Available: https://tinyurl.com/4tux5eb9.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.