Improving Pyramid Vector Quantizer with power projection
Abstract
Pyramid Vector Quantizer (PVQ) is a promising technique especially for multimedia data compression, already used in Opus audio codec and considered for AV1 video codec. It quantizes vectors from Euclidean unit sphere by first projecting them to L1 norm unit sphere, then quantizing and encoding there. This paper shows that the used standard radial projection is suboptimal and proposes to tune its deformations by using parameterized power projection: x xp/\|xp\| instead, where the optimized power p is applied coordinate-wise, getting usually ≥ 0.5\, dB improvement comparing to radial projection.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.