Construction of Sparse Suffix Trees and LCE Indexes in Optimal Time and Space

Abstract

The notions of synchronizing and partitioning sets are recently introduced variants of locally consistent parsings with great potential in problem-solving. In this paper we propose a deterministic algorithm that constructs for a given readonly string of length n over the alphabet \0,1,…,nO(1)\ a variant of τ-partitioning set with size O(b) and τ = nb using O(b) space and O(1εn) time provided b nε, for ε > 0. As a corollary, for b nε and constant ε > 0, we obtain linear construction algorithms with O(b) space on top of the string for two major small-space indexes: a sparse suffix tree, which is a compacted trie built on b chosen suffixes of the string, and a longest common extension (LCE) index, which occupies O(b) space and allows us to compute the longest common prefix for any pair of substrings in O(n/b) time. For both, the O(b) construction storage is asymptotically optimal since the tree itself takes O(b) space and any LCE index with O(n/b) query time must occupy at least O(b) space by a known trade-off (at least for b (n / n)). In case of arbitrary b (2 n), we present construction algorithms for the partitioning set, sparse suffix tree, and LCE index with O(nb n) running time and O(b) space, thus also improving the state of the art.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…