2000 character limit reached
Shift-Interleave Coding for DNA-Based Storage: Correction of IDS Errors and Sequence Losses (2401.14594v1)
Published 26 Jan 2024 in cs.IT and math.IT
Abstract: We propose a novel coding scheme for DNA-based storage systems, called the shift-interleave (SI) coding, designed to correct insertion, deletion, and substitution (IDS) errors, as well as sequence losses. The SI coding scheme employs multiple codewords from two binary low-density parity-check codes. These codewords are processed to form DNA base sequences through shifting, bit-to-base mapping, and interleaving. At the receiver side, an efficient non-iterative detection and decoding scheme is employed to sequentially estimate codewords. The numerical results demonstrate the excellent performance of the SI coding scheme in correcting both IDS errors and sequence losses.
- G. M. Church, Y. Gao, and S. Kosuri, “Next-generation digital information storage in DNA,” Science, vol.337, no.6102, pp.1628–1628, Sep. 2012.
- S. M. H. T. Yazdi, H. M. Kiah, E. Garcia-Ruiz, J. Ma, H. Zhao, and O. Milenkovic, “DNA-based storage: trends and methods,” IEEE Trans. Mol. Biol. Multi-Scale Commun., vol.1, no.3, pp.230–248, Sep. 2015.
- Y. Dong, F. Sun, Z. Ping, Q. Ouyang, and L. Qian, “DNA storage: research landscape and future prospects,” Natl Sci Rev, vol.7, no.6, pp.1092–1107, Jun. 2020.
- L. Xiang, Q. Liu, S. Chen, K. Yan, W. Wu, and K. Yang, “A tutorial on coding methods for DNA-based molecular communications and storage,” IEEE Internet Things J., early access, Nov. 2023.
- Y. Erlich and D. Zielinski, “DNA Fountain enables a robust and efficient storage architecture,” Science, vol.355, no.6328, pp.950–954, Mar. 2017.
- W. H. Press, J. A. Hawkins, S. K. Jones Jr, J. M. Schaub, and I. J. Finkelstein, “HEDGES error-correcting code for DNA storage corrects indels and allows sequence constraints,” Proc. Natl. Acad. Sci. U. S. A., vol. 117, no. 31, pp. 18489–18496, Aug. 2020.
- K. Cai, Y. M. Chee, R. Gabrys, H. M. Kiah, and T. T. Nguyen, “Correcting a single indel/edit for DNA-based data storage: linear-time encoders and order-optimality,” IEEE Trans. Inf. Theory, vol.67, no.6, pp.3438–3451, Jun. 2021.
- P. Fei and Z. Wang, “LDPC codes for portable DNA storage,” Proc. IEEE International Symposium on Information Theory (ISIT), Paris, France, Jul. 2019, pp.76–80.
- J. Tong, G. Han, and Y. Sun, “An improved marker code scheme based on nucleotide bases for DNA data storage,” Appl. Sci., vol.13, no.6, 3632, Mar. 2023.
- I. Maarouf, A. Lenz, L. Welter, A. Wachter-Zeh, E. Rosnes, and A. G. i. Amat, “Concatenated codes for multiple reads of a DNA sequence,” IEEE Trans. Inf. Theory, vol.69, no.2, pp.910–927, Feb. 2023.
- R. Shibata, “Hierarchical interleaving and chained recovery schemes for noisy insertion and deletion channels,” submitted to the IEEE Trans. Commun., 2023.
- R. Shibata and H. Yashima, “Delayed coding scheme for channels with insertion, deletion, and substitution errors,” Proc. IEEE Global Communications Conference (GLOBECOM), Rio de Janeiro, Brazil, Dec. 2022, pp.1–6.
- A. Lenz, P. H. Siegel, A. Wachter-Zeh, and E. Yaakobi, “The noisy drawing channel: reliable data storage in DNA sequences,” IEEE Trans. Inf. Theory, vol.69, no.5, pp.2757–2778, May 2023.
- F. Wang, D. Fertonani, and T. M. Duman, “Symbol-level synchronization and LDPC code design for insertion/deletion channels,” IEEE Trans. Commun., vol.59, no.5, pp.1287–1297, May 2011.
- M. C. Davey and D. J. C. Mackay, “Reliable communication over channels with insertions, deletions, and substitutions,” IEEE Trans. Inf. Theory, vol.47, no.2, pp.687–698, Feb. 2001.
- R. Shibata, G. Hosoya, and H. Yashima, “Design and construction of irregular LDPC codes for channels with synchronization errors: new aspect of degree profiles,” IEICE Trans. Fundamentals, vol.E103-A, no.10, pp.1237–1247, Oct. 2020.
- F. R. Kschischang, B. J. Frey, and H.-A. Loeliger, “Factor graphs and the sum-product algorithm,” IEEE Trans. Inf. Theory, vol.47, no.2, pp. 498–519, Feb. 2001.
- T. J. Richardson, M. A. Shokrollahi, and R. L. Urbanke, “Design of capacity-approaching irregular low-density parity-check codes,” IEEE Trans. Inf. Theory, vol.47, no.2, pp.619–637, Feb. 2001.