Successor-bispecial strings with minimum Burrows--Wheeler transform runs
Abstract
We study successor-bispecial strings over an alphabet Σ of size σ, a minimal-branching analogue of de Bruijn strings, and ask how few Burrows--Wheeler transform (BWT) runs are possible. In a de Bruijn string of order k, every (k-1)-gram has all σ right-extensions; here, every (k-1)-gram has exactly two right-extensions, determined by a successor rule, which also forces two left-extensions. For order 3, we construct an explicit family Bσ(3), for every σ≥ 2, whose cyclic BWT has rc = σ2 + 2 runs. A suitable terminated linearization has the same run count, r = rc = σ2 + 2, while the smallest suffixient set has size χ= 2σ2 + 1. The ratio χ/r = 2 - 3/(σ2 + 2) nearly saturates the known bound χ/r ≤ 2, which we have previously shown to be asymptotically tight. Compared with our earlier general construction, this improves the gap from O(1/σ) to O(1/σ2). We also show that the order-3 pattern appears as a blockwise two-row projection of normalized linear-feedback shift register (LFSR) de Bruijn sequences over Fq, when primitive trinomials x3 - x + c exist. For higher orders, we prove a general lower bound rc ≥ σk-1 + 2 for every σ≥ 3 in the exact-length regime and analyze the boundary-merged higher-order candidate using the last-to-first (LF) permutation: it fails for k = 4 and all σ≥ 3, while verified k = 5 instances for σ∈ 3,4 yield χ/r ratios exceeding 1.96.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.