Computing q-gram Frequencies on Collage Systems

Abstract

Collage systems are a general framework for representing outputs of various text compression algorithms. We consider the all q-gram frequency problem on compressed string represented as a collage system, and present an O((q+h n)n)-time O(qn)-space algorithm for calculating the frequencies for all q-grams that occur in the string. Here, n and h are respectively the size and height of the collage system.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…