Accelerating Seed Location Filtering in DNA Read Mapping Using a Commercial Compute-in-SRAM Architecture (2401.11685v1)
Abstract: DNA sequence alignment is an important workload in computational genomics. Reference-guided DNA assembly involves aligning many read sequences against candidate locations in a long reference genome. To reduce the computational load of this alignment, candidate locations can be pre-filtered using simpler alignment algorithms like edit distance. Prior work has explored accelerating filtering on simulated compute-in-DRAM, due to the massive parallelism of compute-in-memory architectures. In this paper, we present work-in-progress on accelerating filtering using a commercial compute-in-SRAM accelerator. We leverage the recently released Gemini accelerator platform from GSI Technology, which is the first, to our knowledge, commercial-scale compute-in-SRAM system. We accelerate the Myers' bit-parallel edit distance algorithm, producing average speedups of 14.1x over single-core CPU performance. Individual query/candidate alignments produce speedups of up to 24.1x. These early results suggest this novel architecture is well-suited to accelerating the filtering step of sequence-to-sequence DNA alignment.
- Towards a Reconfigurable Bit-Serial/Bit-Parallel Vector Accelerator Using In-Situ Processing-In-SRAM. Int’l Conf. on Circuits and Systems (ISCAS), May 2020.
- Shouji: A Fast and Efficient Pre-Alignment Filter for Sequence Alignment. Bioinformatics, 35(21):4255–4263, Mar 2019.
- GateKeeper: A New Hardware Architecture for Accelerating Pre-Alignment in DNA Short Read Mapping. Bioinformatics, 33(21):3355–3363, May 2017.
- GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. Int’l Symp. on Microarchitecture (MICRO), Oct 2020.
- SeGraM: A Universal Hardware Accelerator for Genomic Sequence-to-Graph and Sequence-to-Sequence Mapping. Int’l Symp. on Computer Architecture (ISCA), Jun 2022.
- Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks. Int’l Symp. on Computer Architecture (ISCA), Jul 2018.
- Duality Cache for Data Parallel Acceleration. Int’l Symp. on Computer Architecture (ISCA), Jun 2019.
- GenAx: A Genome Sequencing Accelerator. Int’l Symp. on Computer Architecture (ISCA), Jun 2018.
- SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space. Int’l Symp. on Microarchitecture (MICRO), Oct 2020.
- L. Gwennap. GSI Offers In-Memory Computing. Microprocessor Report, Jul 2020.
- M. Holtgrewe. Mason: A Read Simulator for Second Generation Sequencing Data. Technical Report FU Berlin, #962, Oct 2010.
- A Configurable TCAM/BCAM/SRAM Using 28nm Push-Rule 6T Bit Cell. Symp. on Very Large-Scale Integration Circuits (VLSIC), Jun 2015.
- A 28 nm Configurable Memory (TCAM/BCAM/SRAM) Using Push-Rule 6T BitCell Enabling Logic-in-Memory. IEEE Journal of Solid-State Circuits (JSSC), Apr 2016.
- GRIM-Filter: Fast Seed Location Filtering in DNA Read Mapping Using Processing-in-Memory Technologies. BMC Genomics, 19(2), May 2018.
- H. Li. Minimap2: Pairwise Alignment for Nucleotide Sequences. Bioinformatics, 34(18):3094–3100, May 2018.
- G. Myers. A Fast Bit-Vector Algorithm for Approximate String Matching Based on Dynamic Programming. Journal of the ACM, 46(3):395–415, May 1999.
- GenCache: Leveraging In-Cache Operators for Efficient Sequence Alignment. Int’l Symp. on Microarchitecture (MICRO), Oct 2019.
- The SeqAn C++ Template Library for Efficient Sequence Analysis: A Resource for Programmers. Journal of Biotechnology, 261:157–168, Nov 2017.
- T. Smith and M. F. Waterman. Identification of Common Molecular Subsequences. Journal of Molecular Biology, 147(1):195–197, Mar 1981.
- Darwin: A Genomics Co-processor Provides up to 15,000× acceleration on long read assembly. Int’l Conf. on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Mar 2018.
- A Compute SRAM with Bit-Serial Integer/Floating-Point Operations for Programmable In-Memory Vector Acceleration. Int’l Solid-State Circuits Conf. (ISSCC), Feb 2019.