Tractable Gap-Constraint Languages for Complex Event Recognition

Abstract

For strings u, D ∈ Σ*, a subsequence embedding of u in D is a function e \1, 2, …, |u|\ \1, 2, …, |D|\ with e(i) < e(i+1) for every i ∈ \1, 2, …, |u|-1\ and the i-th symbol of u equals the e(i)-th symbol of D. A gap-constraint for u is a triple (i, j, L) with 1 ≤ i < j ≤ |u| and L is a regular language over Σ. An embedding e satisfies a gap-constraint (i, j, L) if the factor of D strictly between positions e(i) and e(j) is a word from L. We investigate the subsequence matching problem with gap-constraints, which is relevant in the context of complex event recognition (CER): given u, D ∈ Σ* and a set C of gap-constraints, find an embedding of u in D that satisfies all gap-constraints from C. In general, subsequence matching is NP-complete and the only known tractable variants restrict the interval structure of the gap-constraints. In this work, we show that we can solve subsequence matching with gap-constraints with an arbitrary interval structure rather efficiently (in fact, optimally under SETH) in time O(|D| (|u| + |C|)) if the gap-constraint languages satisfy a property which we dub left-convexity: whenever u v w ∈ L and v ∈ L, then also uv ∈ L. Left-convex languages are sufficiently expressive to model interesting real-world scenarios considered in CER, e.g., length constraints L = \w a ≤ |w| ≤ b\ for a, b ∈ N. We also show how our algorithm can be used in order to efficiently enumerate all satisfying embeddings, which is particularly relevant for possible applications in CER. Finally, we show how non-left-convex languages can lead to intractability, i.e., if in addition to length constraints we allow \aa, ε\ as the only non-left-convex constraint language, then the problem is NP-complete again.

0

Discussion (0)

Sign in to join the discussion.

Loading comments…