Lempel-Ziv: a "one-bit catastrophe" but not a tragedy

Abstract

The so-called "one-bit catastrophe" for the compression algorithm LZ'78 asks whether the compression ratio of an infinite word can change when a single bit is added in front of it. We answer positively this open question raised by Lutz and others: we show that there exists an infinite word w such that sup(w)=0 but inf(0w)>0, where sup and inf are respectively the and the of the compression ratios of the prefixes. To that purpose we explore the behaviour of LZ'78 on finite words and show the following results: - There is a constant C>0 such that, for any finite word w and any letter a, (aw)≤ C(w)|w|. Thus, sufficiently compressible words ((w)=o(1/|w|)) remain compressible with a letter in front; - The previous result is tight up to a multiplicative constant for any compression ratio (w)=O(1/|w|). In particular, there are infinitely many words w satisfying (w)=O(1/|w|) but (0w)=(1).

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…