On the Palindromic/Reverse-Complement Duplication Correcting Codes
Abstract
Motivated by applications in in-vivo DNA storage, we study codes for correcting duplications. A reverse-complement duplication of length k is the insertion of the reversed and complemented copy of a substring of length k adjacent to its original position, while a palindromic duplication only inserts the reversed copy without complementation. We first construct an explicit code with a single redundant symbol capable of correcting an arbitrary number of reverse-complement duplications (respectively, palindromic duplications), provided that all duplications have length k 3 q n and are disjoint. Next, we derive a Gilbert-Varshamov bound for codes that can correct a reverse-complement duplication (respectively, palindromic duplication) of arbitrary length, showing that the optimal redundancy is upper bounded by 2q n + qq n + O(1). Finally, for q 4, we present two explicit constructions of codes that can correct t length-one reverse-complement duplications. The first construction achieves a redundancy of 2tq n + O(qq n) with encoding complexity O(n) and decoding complexity O(n(2 n)4). The second construction achieves an improved redundancy of (2t-1)q n + O(qq n), but with encoding and decoding complexities of O(n · poly(2 n)).