Approximating LZ77 via Small-Space Multiple-Pattern Matching

Abstract

We generalize Karp-Rabin string matching to handle multiple patterns in O(n n + m) time and O(s) space, where n is the length of the text and m is the total length of the s patterns, returning correct answers with high probability. As a prime application of our algorithm, we show how to approximate the LZ77 parse of a string of length n. If the optimal parse consists of z phrases, using only O(z) working space we can return a parse consisting of at most (1+)z phrases in O(-1n n) time, for any ∈ (0,1]. As previous quasilinear-time algorithms for LZ77 use (n/polylog n) space, but z can be exponentially small in n, these improvements in space are substantial.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…