An Upper Bound on the Convergence Rate of a Second Functional in Optimal Sequence Alignment
Abstract
Consider finite sequences X[1,n]=X1… Xn and Y[1,n]=Y1… Yn of length n, consisting of i.i.d.\ samples of random letters from a finite alphabet, and let S and T be chosen i.i.d.\ randomly from the unit ball in the space of symmetric scoring functions over this alphabet augmented by a gap symbol. We prove a probabilistic upper bound of linear order in n0.75 for the deviation of the score relative to T of optimal alignments with gaps of X[1,n] and Y[1,n] relative to S. It remains an open problem to prove a lower bound. Our result contributes to the understanding of the microstructure of optimal alignments relative to one given scoring function, extending a theory begun by the first two authors.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.