Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage
Abstract
In DNA-based data storage, DNA codes with biochemical constraints and error correction are designed to protect data reliability. Single-stranded DNA sequences with secondary structure avoidance (SSA) help to avoid undesirable secondary structures which may cause chemical inactivity. Homopolymer run-length limit and GC-balanced limit also help to reduce the error probability of DNA sequences during synthesizing and sequencing. In this letter, based on a recent work bib7, we construct DNA codes free of secondary structures of stem length ≥ m and have homopolymer run-length ≤ for odd m≤11 and ≥3 with rate 1+2m-3/(2-1++1), where m is in Table tm. In particular, when m=3, =4, its rate tends to 1.3206 bits/nt, beating a previous work by Benerjee et al.. We also construct DNA codes with all of the above three constraints as well as single error correction. At last, codes with GC-locally balanced constraint are presented.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.