A distribution weighting a set of laws whose initial states are grouped into classes

Abstract

Let I be a finite alphabet and ⊂ I be a nonempty strict subset. The sequences in I are organized into connected regions which always start with a symbol in . The regions are labelled by types C(s), thus a region starting at s'∈ C(s) has the same type as one starting at s. Let (s: s∈ ) be a family of distributions on I where each s charges sequences starting with the symbol s. We can define a natural distribution on I, that counts the number of visits to the states from s, properly weighted. A dynamics of interest is such that at the first occurrence of s'∈ C(s) the law regenerates with distribution s'. In this case we are able to find simple conditions for to be stationary. In addition, we study the following more complex model: once a symbol s'∈ C(s) has been encountered, there is a decision to be made, either a new region of type C(s') governed by s' starts or the region continues to be a C(s) region. This decision is modeled as random and depends on s'. In this setting a similar distribution to can be constructed and the conditions for stationarity are supplied. These models are inspired by genomic sequences where I is the set of codons, the classes (C(s): s∈ ) group codons defining similar genomic classes, e.g. in bacteria there are two classes corresponding to the start and stop codons, and the random decision to continue a region or to begin a new region of a different class reflects the well-known fact that not every appearance of a start codon marks the beginning of a new coding region.

0

Turn this paper into a lesson

ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.

Discussion (0)

Sign in to join the discussion.

Loading comments…