OVT-MLCS: An Online Visual Tool for MLCS Mining from Long or Big Sequences

Abstract

Mining multiple longest common subsequences (MLCS) from a set of sequences of three or more over a finite alphabet (a classical NP-hard problem) is an important task in a wide variety of application fields. Unfortunately, there is still no exact MLCS algorithm/tool that can handle long (length 1,000) or big (length 10,000) sequences, which seriously hinders the development and utilization of massive long or big sequences from various application fields today. To address the challenge, we first propose a novel key point-based MLCS algorithm for mining big sequences, called KP-MLCS, and then present a new method, which can compactly represent all mined MLCSs and quickly reveal common patterns among them. Furthermore, by introducing some new techniques, e.g., real-time graphic visualization and serialization, we have developed a new online visual MLCS mining tool, called OVT-MLCS. OVT-MLCS demonstrates that it not only enables effective online mining, storing, and downloading of MLCSs in the form of graphs and text from long or big sequences with a scale of 3 to 5000 but also provides user-friendly interactive functions to facilitate inspection and analysis of the mined MLCSs. We believe that the functions provided by OVT-MLCS will promote stronger and wider applications of MLCS.

0

Turn this paper into a full lesson

ArcXiv compiles a staged curriculum from this paper: 8-12 lessons across beginner → advanced, synthesised section guides, visuals, flashcards, a quiz, exercises, and on-demand deep dives per section. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…