On the Number of Subsequences in the Nonbinary Deletion Channel
Abstract
In the deletion channel, an important problem is to determine the number of subsequences derived from a string U of length n when subjected to t deletions. It is well-known that the number of subsequences in the setting exhibits a strong dependence on the number of runs in the string U, where a run is defined as a maximal substring of identical characters. In this paper we study the number of subsequences of a non-binary string in this scenario, and propose some improved bounds on the number of subsequences of r-run non-binary strings. Specifically, we characterize a family of r-run non-binary strings with the maximum number of subsequences under any t deletions, and show that this number can be computed in polynomial time.
Turn this paper into a full lesson
ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.