Compressed Dynamic Range Majority and Minority Data Structures

Abstract

In the range α-majority query problem, we are given a sequence S[1..n] and a fixed threshold α ∈ (0, 1), and are asked to preprocess S such that, given a query range [i..j], we can efficiently report the symbols that occur more than α (j-i+1) times in S[i..j], which are called the range α-majorities. In this article we first describe a dynamic data structure that represents S in compressed space --- nHk+ o(n σ) bits for any k = o(σ n), where σ is the alphabet size and Hk H0 σ is the k-th order empirical entropy of S --- and answers queries in O ( nα n ) time while supporting insertions and deletions in S in O ( nα ) amortized time. We then show how to modify our data structure to receive some β α at query time and report the range β-majorities in O ( nβ n ) time, without increasing the asymptotic space or update-time bounds. The best previous dynamic solution has the same query and update times as ours, but it occupies O(n) words and cannot take advantage of being given a larger threshold β at query time. [ABSTRACT CLIPPED DUE TO LENGTH.]

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…