Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Efficient and Scalable Clocking Assignment Algorithm for Multi-Threaded Multi-Phase Single Flux Quantum Circuits (2401.06411v1)

Published 12 Jan 2024 in cs.ET

Abstract: A key distinguishing feature of single flux quantum (SFQ) circuits is that each logic gate is clocked. This feature forces the introduction of path-balancing flip-flops to ensure proper synchronization of inputs at each gate. This paper proposes a polynomial time complexity approximation algorithm for clocking assignments that minimizes the insertion of path balancing buffers for multi-threaded multi-phase clocking of SFQ circuits. Existing SFQ multi-phase clocking solutions have been shown to effectively reduce the number of required buffers inserted while maintaining high throughput, however, the associated clock assignment algorithms have exponential complexity and can have prohibitively long runtimes for large circuits, limiting the scalability of this approach. Our proposed algorithm is based on a linear program (LP) that leads to solutions that are experimentally on average within 5% of the optimum and helps accelerate convergence towards the optimal integer linear program (ILP) based solution. The improved LP and ILP runtimes permit multi-phase clocking schemes to scale to larger SFQ circuits than previous state of the art clocking assignment methods. We further extend the existing algorithm to support fanout sharing of the added buffers, saving, on average, an additional 10% of the inserted DFFs. Compared to traditional full path balancing (FPB) methods across 10 benchmarks, our enhanced LP saves 79.9%, 87.8%, and 91.2% of the inserted buffers for 2, 3, and 4 clock phases respectively. Finally, we extend this approach to the generation of circuits that completely mitigate potential hold-time violations at the cost of either adding on average less than 10% more buffers (for designs with 3 or more clock phases) or, more generally, adding a clock phase and thereby reducing throughput.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (20)
  1. T. N. Theis and H.-S. P. Wong, “The end of moore’s law: A new beginning for information technology,” Computing in Science & Engineering, vol. 19, no. 2, pp. 41–50, 2017.
  2. K. Likharev and V. Semenov, “RSFQ logic/memory family: a new josephson-junction technology for sub-terahertz-clock-frequency digital systems,” IEEE TASC, vol. 1, no. 1, pp. 3–28, 1991.
  3. S. Tolpygo, “Superconductor digital electronics: Scalability and energy efficiency issues (review article),” Low Temperature Physics, vol. 42, pp. 361–379, 05 2016.
  4. D. K. Brock, E. K. Track, and J. M. Rowell, “Superconductor ICs: the 100-GHz second generation,” IEEE Spectrum, vol. 37, pp. 40–46, 2000.
  5. X. Li, M. Pan, T. Liu, and P. A. Beerel, “Multi-phase clocking for multi-threaded gate-level-pipelined superconductive logic,” in 2022 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 2022, pp. 62–67.
  6. X. Li, “Multi-phase clocking and hold time fixing for single flux quantum circuits,” Ph.D. dissertation, University of Southern California, 2022.
  7. P. Bunyk, K. Likharev, and D. Zinoviev, “RSFQ technology: Physics and devices,” International Journal of High Speed Electronics and Systems, vol. 11, 03 2001.
  8. I. Vernik, Q. Herr, K. Gaij, and M. Feldman, “Experimental investigation of local timing parameter variations in RSFQ circuits,” IEEE TASC, vol. 9, no. 2, pp. 4341–4344, 1999.
  9. K. Gaj, E. G. Friedman, and M. J. Feldman, “Timing of multi-gigahertz rapid single flux quantum digital circuits,” Journal of VLSI signal processing systems for signal, image and video technology, vol. 16, pp. 247–276, 1997.
  10. S. Kim, K. Lee, W. Cho, J. H. Cheon, and R. A. Rutenbar, “FPGA-based accelerators of fully pipelined modular multipliers for homomorphic encryption,” in 2019 International Conference on ReConFigurable Computing and FPGAs (ReConFig), 2019, pp. 1–8.
  11. V. Shoup, “NTL: A library for doing number theory,” http://www.shoup.net/ntl/.
  12. O. A. Mukhanov, “Energy-efficient single flux quantum technology,” IEEE TASC, vol. 21, no. 3, pp. 760–769, 2011.
  13. K. Ishida, M. Tanaka, I. Nagaoka, T. Ono, S. Kawakami, T. Tanimoto, A. Fujimaki, and K. Inoue, “32 GHz 6.5 mW gate-level-pipelined 4-bit processor using superconductor single-flux-quantum logic,” in 2020 IEEE Symposium on VLSI Circuits, 2020, pp. 1–2.
  14. Q. Herr, A. Y. Herr, O. T. Oberg, and A. Ioannidis, “Ultra-low-power superconductor logic,” Journal of Applied Physics, vol. 109, p. 103903, 2011.
  15. G. Pasandi and M. Pedram, “An efficient pipelined architecture for superconducting single flux quantum logic circuits utilizing dual clocks,” IEEE TASC, vol. 30, no. 2, pp. 1–12, 2020.
  16. M. Li, B. Zhang, and M. Pedram, “Striking a good balance between area and throughput of RSFQ circuits containing feedback loops,” IEEE TASC, vol. 33, no. 5, pp. 1–6, 2023.
  17. K. A. Sakallah, T. N. Mudge, and O. A. Olukotun, “Optimal Clocking of Synchronous Systems,” in TAU Workshop, 1990, pp. 1–21.
  18. R. N. Tadros, A. Fayyazi, M. Pedram, and P. A. Beerel, “SystemVerilog modeling of SFQ and AQFP circuits,” IEEE TASC, vol. 30, no. 2, pp. 1–13, 2020.
  19. R. Saito, C. L. Ayala, and N. Yoshikawa, “Buffer reduction via N-phase clocking in adiabatic quantum-flux-parametron benchmark circuits,” IEEE TASC, vol. 31, no. 6, pp. 1–8, 2021.
  20. R. Bairamkulov, T. Jabbari, and E. G. Friedman, “QuCTS—single-flux quantum clock tree synthesis,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 41, no. 10, pp. 3346–3358, 2022.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com