Statistical inference for DNA sequences of promoters: a non-stationary qualitative model

被引:0
|
作者
Li, Xiaoyin [1 ]
Doukhan, Paul [2 ,3 ]
Feugeas, Jean-Paul [4 ]
机构
[1] Case Western Reserve Univ, Dept Epidemiol & Biostat, Cleveland, OH 44106 USA
[2] Univ Cergy Pontoise, AGM, Dept Math, Ile De France, Cergy Pontoise, France
[3] IUF, Ile De France, Cergy Pontoise, France
[4] INSERM, UMR 1137, Paris, France
关键词
Modelling DNA; dependence; non-stationary; promoters; strong invariance principle; simultaneous confidence bounds; NUCLEOTIDE COMPOSITION; PERIODICITY; TRENDS;
D O I
10.1080/02331888.2016.1261474
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gene promoters have variable repartition of AGCT nucleotides according to some probabilistic behaviours essentially depending on their position in a string. The paper aims to provide a model for this configuration. With this model we derive non-uniform confidence bounds for those probability distributions in the strings. A uniform bound deriving from previous works in Wu and Zhao [Inference of trends in time series. J R Stat Soc B. 2007; 69: 391-410] is more demanding for the model. A data-based study allows to clarify our suggestions and open the way for applications in molecular biology.
引用
收藏
页码:154 / 166
页数:13
相关论文
共 50 条