Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
126 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Accelerating Seed Location Filtering in DNA Read Mapping Using a Commercial Compute-in-SRAM Architecture (2401.11685v1)

Published 22 Jan 2024 in cs.AR and q-bio.GN

Abstract: DNA sequence alignment is an important workload in computational genomics. Reference-guided DNA assembly involves aligning many read sequences against candidate locations in a long reference genome. To reduce the computational load of this alignment, candidate locations can be pre-filtered using simpler alignment algorithms like edit distance. Prior work has explored accelerating filtering on simulated compute-in-DRAM, due to the massive parallelism of compute-in-memory architectures. In this paper, we present work-in-progress on accelerating filtering using a commercial compute-in-SRAM accelerator. We leverage the recently released Gemini accelerator platform from GSI Technology, which is the first, to our knowledge, commercial-scale compute-in-SRAM system. We accelerate the Myers' bit-parallel edit distance algorithm, producing average speedups of 14.1x over single-core CPU performance. Individual query/candidate alignments produce speedups of up to 24.1x. These early results suggest this novel architecture is well-suited to accelerating the filtering step of sequence-to-sequence DNA alignment.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Towards a Reconfigurable Bit-Serial/Bit-Parallel Vector Accelerator Using In-Situ Processing-In-SRAM. Int’l Conf. on Circuits and Systems (ISCAS), May 2020.
  2. Shouji: A Fast and Efficient Pre-Alignment Filter for Sequence Alignment. Bioinformatics, 35(21):4255–4263, Mar 2019.
  3. GateKeeper: A New Hardware Architecture for Accelerating Pre-Alignment in DNA Short Read Mapping. Bioinformatics, 33(21):3355–3363, May 2017.
  4. GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. Int’l Symp. on Microarchitecture (MICRO), Oct 2020.
  5. SeGraM: A Universal Hardware Accelerator for Genomic Sequence-to-Graph and Sequence-to-Sequence Mapping. Int’l Symp. on Computer Architecture (ISCA), Jun 2022.
  6. Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks. Int’l Symp. on Computer Architecture (ISCA), Jul 2018.
  7. Duality Cache for Data Parallel Acceleration. Int’l Symp. on Computer Architecture (ISCA), Jun 2019.
  8. GenAx: A Genome Sequencing Accelerator. Int’l Symp. on Computer Architecture (ISCA), Jun 2018.
  9. SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space. Int’l Symp. on Microarchitecture (MICRO), Oct 2020.
  10. L. Gwennap. GSI Offers In-Memory Computing. Microprocessor Report, Jul 2020.
  11. M. Holtgrewe. Mason: A Read Simulator for Second Generation Sequencing Data. Technical Report FU Berlin, #962, Oct 2010.
  12. A Configurable TCAM/BCAM/SRAM Using 28nm Push-Rule 6T Bit Cell. Symp. on Very Large-Scale Integration Circuits (VLSIC), Jun 2015.
  13. A 28 nm Configurable Memory (TCAM/BCAM/SRAM) Using Push-Rule 6T BitCell Enabling Logic-in-Memory. IEEE Journal of Solid-State Circuits (JSSC), Apr 2016.
  14. GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping Using Processing-in-Memory Technologies. BMC Genomics, 19(2), May 2018.
  15. H. Li. Minimap2: Pairwise Alignment for Nucleotide Sequences. Bioinformatics, 34(18):3094–3100, May 2018.
  16. G. Myers. A Fast Bit-Vector Algorithm for Approximate String Matching Based on Dynamic Programming. Journal of the ACM, 46(3):395–415, May 1999.
  17. GenCache: Leveraging In-Cache Operators for Efficient Sequence Alignment. Int’l Symp. on Microarchitecture (MICRO), Oct 2019.
  18. The SeqAn C++ Template Library for Efficient Sequence Analysis: A Resource for Programmers. Journal of Biotechnology, 261:157–168, Nov 2017.
  19. T. Smith and M. F. Waterman. Identification of Common Molecular Subsequences. Journal of Molecular Biology, 147(1):195–197, Mar 1981.
  20. Darwin: A Genomics Co-processor Provides up to 15,000× acceleration on long read assembly. Int’l Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Mar 2018.
  21. A Compute SRAM with Bit-Serial Integer/Floating-Point Operations for Programmable In-Memory Vector Acceleration. Int’l Solid-State Circuits Conf. (ISSCC), Feb 2019.

Summary

We haven't generated a summary for this paper yet.