Permutation Recovery Problem against Deletion Errors for DNA Data Storage (2403.15827v1)

Published 23 Mar 2024 in cs.IT and math.IT

Abstract: Owing to its immense storage density and durability, DNA has emerged as a promising storage medium. However, due to technological constraints, data can only be written onto many short DNA molecules called data blocks that are stored in an unordered way. To handle the unordered nature of DNA data storage systems, a unique address is typically prepended to each data block to form a DNA strand. However, DNA storage systems are prone to errors and generate multiple noisy copies of each strand called DNA reads. Thus, we study the permutation recovery problem against deletions errors for DNA data storage. The permutation recovery problem for DNA data storage requires one to reconstruct the addresses or in other words to uniquely identify the noisy reads. By successfully reconstructing the addresses, one can essentially determine the correct order of the data blocks, effectively solving the clustering problem. We first show that we can almost surely identify all the noisy reads under certain mild assumptions. We then propose a permutation recovery procedure and analyze its complexity.

References (14)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/Encoding/status/1772542763097096448

Permutation Recovery Problem against Deletion Errors for DNA Data Storage (2403.15827v1)

Summary

Related Papers

Tweets