Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ORCA: A Network and Architecture Co-design for Offloading us-scale Datacenter Applications (2203.08906v2)

Published 16 Mar 2022 in cs.AR, cs.DC, and cs.NI

Abstract: Responding to the "datacenter tax" and "killer microseconds" problems for datacenter applications, diverse solutions including Smart NIC-based ones have been proposed. Nonetheless, they often suffer from high overhead of communications over network and/or PCIe links. To tackle the limitations of the current solutions, this paper proposes ORCA, a holistic network and architecture co-design solution that leverages current RDMA and emerging cache-coherent off-chip interconnect technologies. Specifically, ORCA consists of four hardware and software components: (1) unified abstraction of inter- and intra-machine communications managed by one-sided RDMA write and cache-coherent memory write; (2) efficient notification of requests to accelerators assisted by cache coherence; (3) cache-coherent accelerator architecture directly processing requests received by NIC; and (4) adaptive device-to-host data transfer for modern server memory systems consisting of both DRAM and NVM exploiting state-of-the-art features in CPUs and PCIe. We prototype ORCA with a commercial system and evaluate three popular datacenter applications: in-memory key-value store, chain replication-based distributed transaction system, and deep learning recommendation model inference. The evaluation shows that ORCA provides 30.1~69.1% lower latency, up to 2.5x higher throughput, and 3x higher power efficiency than the current state-of-the-art solutions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Yifan Yuan (14 papers)
  2. Jinghan Huang (5 papers)
  3. Yan Sun (309 papers)
  4. Tianchen Wang (17 papers)
  5. Jacob Nelson (12 papers)
  6. Dan R. K. Ports (5 papers)
  7. Yipeng Wang (20 papers)
  8. Ren Wang (72 papers)
  9. Charlie Tai (6 papers)
  10. Nam Sung Kim (30 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.