Parameterized DAWGs: efficient constructions and bidirectional pattern searches

Abstract

Two strings x and y over of equal length are said to parameterized match (p-match) if there is a renaming bijection f: → that is identity on and transforms x to y (or vice versa). The p-matching problem is to look for substrings in a text that p-match a given pattern. In this paper, we propose parameterized suffix automata (p-suffix automata) and parameterized directed acyclic word graphs (PDAWGs) which are the p-matching versions of suffix automata and DAWGs. While suffix automata and DAWGs are equivalent for standard strings, we show that p-suffix automata can have (n2) nodes and edges but PDAWGs have only O(n) nodes and edges, where n is the length of an input string. We also give an O(n || (|| + ||))-time O(n)-space algorithm that builds the PDAWG in a left-to-right online manner. As a byproduct, it is shown that the parameterized suffix tree for the reversed string can also be built in the same time and space, in a right-to-left online manner. This duality also leads us to two further efficient algorithms for p-matching: Given the parameterized suffix tree for the reversal of the input string T, one can build the PDAWG of T in O(n) time in an offline manner; One can perform bidirectional p-matching in O(m (||+||) + occ) time using O(n) space, where m denotes the pattern length and occ is the number of pattern occurrences in the text T.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…