Bijective BWT based compression schemes

Abstract

We investigate properties of the bijective Burrows-Wheeler transform (BBWT). We show that for any string w, a bidirectional macro scheme of size O(rB) can be induced from the BBWT of w, where rB is the number of maximal character runs in the BBWT. We also show that rB = O(z2 n), where n is the length of w and z is the number of Lempel-Ziv 77 factors of w. Then, we show a separation between BBWT and BWT by a family of strings with rB = ( n) but having only r=2 maximal character runs in the standard Burrows--Wheeler transform (BWT). However, we observe that the smallest rB among all cyclic rotations of w is always at most r. While an o(n2) algorithm for computing an optimal rotation giving the smallest rB is still open, we show how to compute the Lyndon factorizations -- a component for computing BBWT -- of all cyclic rotations in O(n) time. Furthermore, we conjecture that we can transform two strings having the same Parikh vector to each other by BBWT and rotation operations, and prove this conjecture for the case of binary alphabets and permutations.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…