A note on the longest common substring with k-mismatches problem
Abstract
The recently introduced longest common substring with k-mismatches (k-LCF) problem is to find, given two sequences S1 and S2 of length n each, a longest substring A1 of S1 and A2 of S2 such that the Hamming distance between A1 and A2 is at most k. So far, the only subquadratic time result for this problem was known for k = 1~FGKU2014. We first present two output-dependent algorithms solving the k-LCF problem and show that for k = O(1- n), where > 0, at least one of them works in subquadratic time, using O(n) words of space. The choice of one of these two algorithms to be applied for a given input can be done after linear time and space preprocessing. Finally we present a tabulation-based algorithm working, in its range of applicability, in O(n2(k+0, σ)/ n) time, where 0 is the length of the standard longest common substring.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.