Deletion-Correcting Codes for the -Symbol Read Channel

Abstract

This paper studies deletion-correcting codes for the -symbol read channel, whose noiseless output is the vector of all consecutive -mers of a transmitted sequence. This model is motivated by overlapping-read mechanisms arising in nanopore sequencing, racetrack memories with consecutive read heads, and related sequence-labeling problems. We consider an adversarial setting in which a fixed number of -mers are deleted from the read vector. Our first contribution is a structural characterization of the effect of such deletions: after a minimum number of -mers are inserted to restore consistency, the resulting sequence is obtained from the transmitted sequence by deleting symbols from certain periodic substrings; when t -2, these deletions correspond to complete minimum periods. Based on this characterization, we introduce check patterns and construct -read deletion-correcting codes via power-sum syndromes. For every 2, we obtain single-deletion correcting codes with redundancy (n+2)/(-1). For 2 t /2, we construct q-ary -read t-deletion correcting codes with redundancy t n+O(1), and for =2t-1 with t3, we construct codes with redundancy (2t-1) n+O(1). We also study the sporadic parameter pairs (,t)∈\(2,2),(3,2),(3,3)\ and obtain improved constructions, including binary -read 2-deletion correcting codes with redundancy 2 n+O(1) for =2,3, a non-binary 3-read 2-deletion correcting code with redundancy 3 n+O(1), a binary 3-read 3-deletion correcting code with redundancy 5 n+O(1), and a non-binary 3-read 3-deletion correcting code with redundancy 7 n+O(1).

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…