2000 character limit reached
Matching reads to many genomes with the $r$-index (1908.01263v1)
Published 4 Aug 2019 in cs.DS and q-bio.GN
Abstract: The $r$-index is a tool for compressed indexing of genomic databases for exact pattern matching, which can be used to completely align reads that perfectly match some part of a genome in the database or to find seeds for reads that do not. This paper shows how to download and install the programs ri-buildfasta and ri-align; how to call ri-buildfasta on a FASTA file to build an $r$-index for that file; and how to query that index with ri-align. Availability: The source code for these programs is released under GPLv3 and available at https://github.com/alshai/r-index .
- Taher Mun (3 papers)
- Alan Kuhnle (27 papers)
- Christina Boucher (17 papers)
- Travis Gagie (123 papers)
- Ben Langmead (11 papers)
- Giovanni Manzini (38 papers)