Asymptotic distribution of motifs in a stochastic context-free grammar model of RNA folding

Abstract

We analyze the distribution of RNA secondary structures given by the Knudsen-Hein stochastic context-free grammar used in the prediction program Pfold. We prove that the distribution of base pairs, helices and various types of loops in RNA secondary structures in this probabilistic model is asymptotically Gaussian, for a generic choice of the grammar probabilities. Our proofs are based on singularity analysis of probability generating functions. Finally, we use our results to discuss how this model reflects the properties of some known ribosomal secondary structures.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…