Space-Efficient String Indexing for Wildcard Pattern Matching

Abstract

In this paper we describe compressed indexes that support pattern matching queries for strings with wildcards. For a constant size alphabet our data structure uses O(nn) bits for any >0 and reports all occ occurrences of a wildcard string in O(m+σg ·μ(n) + occ) time, where μ(n)=o( n), σ is the alphabet size, m is the number of alphabet symbols and g is the number of wildcard symbols in the query string. We also present an O(n)-bit index with O((m+σg+occ)n) query time and an O(n( n)2)-bit index with O((m+σg+occ) n) query time. These are the first non-trivial data structures for this problem that need o(n n) bits of space.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…