Correcting One Deletion and One Substitution with a Constant Number of Reads
Abstract
In this paper, we investigate the problem of designing (n, N; B)-reconstruction codes for N∈ \14,11,9,5\, where B is the single-deletion single-substitution ball function that maps a sequence to the set of all sequences obtainable via one deletion and one substitution. Such a code is defined by the requirement that the intersection size of any two distinct single-deletion single-substitution balls is strictly less than the given number of noisy reads N. Note that for any 1 N<N', an (n, N; B)-reconstruction code is also an (n, N'; B)-reconstruction code. It follows that the problem of designing (n, N; B)-reconstruction codes with less redundancy becomes more challenging as N decreases, particularly because the problem for N=1 already reduces to the coding problem of single-deletion and single-substitution correcting codes. To the best of our knowledge, most existing results focus on the case where N is a linear function of n, while only a limited number consider constant N. When N=1, the best known (n, 1; B)-reconstruction codes (single-deletion and single-substitution correcting codes) require (4+o(1)) n redundant bits. In this work, we show that this redundancy can be reduced to 3 n+4 when N=5. As N increases further to 9 and 11, the redundancy can be improved to 2 n+12 n+O(1) and n +12 n+O(1), respectively. Finally, for N=14, we provide a reconstruction code with n+3 bits of redundancy, which is only two bits more than the best known (n, 18; B)-reconstruction codes.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.