Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Compiling Halide Programs to Push-Memory Accelerators (2105.12858v1)

Published 26 May 2021 in cs.AR

Abstract: Image processing and machine learning applications benefit tremendously from hardware acceleration, but existing compilers target either FPGAs, which sacrifice power and performance for flexible hardware, or ASICs, which rapidly become obsolete as applications change. Programmable domain-specific accelerators have emerged as a promising middle-ground between these two extremes, but such architectures have traditionally been difficult compiler targets. The main obstacle is that these accelerators often use a different memory abstraction than CPUs and GPUs: push memories that send a data stream from one computation kernel to other kernels, possibly reordered. To address the compilation challenges caused by push memories, we propose that the representation of memory in the middle and backend of the compiler be altered to combine storage with address generation and control logic in a single structure -- a unified buffer. We show that this compiler abstraction can be implemented efficiently on a programmable accelerator, and design a memory mapping algorithm that combines polyhedral analysis and software vectorization techniques to target our accelerator. Our evaluation shows that the compiler supports programmability while maintaining high performance. It can compile a wide range of image processing and machine learning applications to our accelerator with 4.7x better runtime and 4.3x better energy-efficiency as compared to an FPGA.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Qiaoyi Liu (4 papers)
  2. Dillon Huff (1 paper)
  3. Jeff Setter (2 papers)
  4. Maxwell Strange (3 papers)
  5. Kathleen Feng (2 papers)
  6. Kavya Sreedhar (3 papers)
  7. Ziheng Wang (48 papers)
  8. Keyi Zhang (8 papers)
  9. Mark Horowitz (21 papers)
  10. Priyanka Raina (11 papers)
  11. Fredrik Kjolstad (26 papers)
Citations (4)