Rice-Marlin Codes: Tiny and Efficient Variable-to-Fixed Codes
Abstract
Marlin is a Variable-to-Fixed (VF) codec optimized for high decoding speed through the use of small sized dictionaries that fit in the L1 cache of most CPUs. While the size of Marlin dictionaries is adequate for decoding, they are still too large to be encoded fast. We address this problem by proposing two techniques to reduce the alphabet size. The first technique is to encode rare symbols in their own segment, and the second is to combine Marlin dictionaries with Rice encoding, hence our name Rice-Marlin for our new codec. Using those techniques, we are able to reduce the size of Marlin dictionaries by a factor of 16, not only enabling faster encoding speed, but also achieving better compression efficiency.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.