An Upper Bound on the Convergence Rate of a Second Functional in Optimal Sequence Alignment

Abstract

Consider finite sequences X[1,n]=X1… Xn and Y[1,n]=Y1… Yn of length n, consisting of i.i.d.\ samples of random letters from a finite alphabet, and let S and T be chosen i.i.d.\ randomly from the unit ball in the space of symmetric scoring functions over this alphabet augmented by a gap symbol. We prove a probabilistic upper bound of linear order in n0.75 for the deviation of the score relative to T of optimal alignments with gaps of X[1,n] and Y[1,n] relative to S. It remains an open problem to prove a lower bound. Our result contributes to the understanding of the microstructure of optimal alignments relative to one given scoring function, extending a theory begun by the first two authors.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…