A stationary distribution associated to a set of laws whose initial states are grouped into classes. an application in genomics
Artículo
Open/ Download
Access note
Acceso a solo metadatos
Publication date
2016Metadata
Show full item record
Cómo citar
Martínez Aguilera, Servet
Cómo citar
A stationary distribution associated to a set of laws whose initial states are grouped into classes. an application in genomics
Author
Abstract
Let I be a finite set and S be a nonempty strict subset of I which is partitioned into classes, and let C(s) be the class containing s is an element of S. Let (P-s : s is an element of S) be a family of distributions on I-N, where each P-s applies to sequences starting with the symbol s. To this family, we associate a class of distributions P-(pi) on I-N which depends on a probability vector pi. Our main results assume that, for each s is an element of S, Ps regenerates with distribution P-s' when it encounters s' is an element of S \ C(s). From semiregenerative theory, we determine a simple condition on p for P(p) to be time stationary. We give a similar result for the following more complex model. Once a symbol s' is an element of S \ C(s) has been encountered, there is a decision to be made: either a new region of type C(s') governed by P-s' starts or the region continues to be a C(s) region. This decision is modeled as a random event and its probability depends on s and s'. The aim in studying these kinds of models is to attain a deeper statistical understanding of bacterial DNA sequences. Here I is the set of codons and the classes (C(s) : s is an element of S) identify codons that initiate similar genomic regions. In particular, there are two classes corresponding to the start and stop codons which delimit coding and noncoding regions in bacterial DNA sequences. In addition, the random decision to continue the current region or begin a new region of a different class reflects the well- known fact that not every appearance of a start codon marks the beginning of a new coding region.
Patrocinador
Center for Mathematical Modeling (CMM) Basal CONICYT Program PFB 03 INRIA-CHILE program CIRIC
Indexation
Artículo de publicación ISI
Quote Item
Journal of Applied Probability Volumen: 53 Número: 2 Páginas: 315-326 Jun 2016
Collections