Lempel-Ziv: a "one-bit catastrophe" but not a tragedy
Abstract
The so-called "one-bit catastrophe" for the compression algorithm LZ'78 asks whether the compression ratio of an infinite word can change when a single bit is added in front of it. We answer positively this open question raised by Lutz and others: we show that there exists an infinite word w such that sup(w)=0 but inf(0w)>0, where sup and inf are respectively the and the of the compression ratios of the prefixes. To that purpose we explore the behaviour of LZ'78 on finite words and show the following results: - There is a constant C>0 such that, for any finite word w and any letter a, (aw)≤ C(w)|w|. Thus, sufficiently compressible words ((w)=o(1/|w|)) remain compressible with a letter in front; - The previous result is tight up to a multiplicative constant for any compression ratio (w)=O(1/|w|). In particular, there are infinitely many words w satisfying (w)=O(1/|w|) but (0w)=(1).
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.