Finding haplotype block boundaries by using the minimum-description-length principle

被引:63
|
作者
Anderson, EC [1 ]
Novembre, J [1 ]
机构
[1] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
关键词
D O I
10.1086/377106
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We present a method for detecting haplotype blocks that simultaneously uses information about linkage-disequilibrium decay between the blocks and the diversity of haplotypes within the blocks. By use of phased single-nucleotide polymorphism data, our method partitions a chromosome into a series of adjacent, nonoverlapping blocks. The partition is made by choosing among a family of Markov models for block structure in a chromosomal region. Specifically, in the model, the occurrence of haplotypes within blocks follows a time-inhomogeneous Markov process along the chromosome, and we choose among possible partitions by using the two-stage minimum-description-length criterion. When applied to data simulated from the coalescent with recombination hotspots, our method reliably situates block boundaries at the hotspots and infrequently places block boundaries at sites with background levels of recombination. We apply three previously published block-finding methods to the same data, showing that they either are relatively insensitive to recombination hotspots or fail to discriminate between background sites of recombination and hotspots. When applied to the 5q31 data of Daly et al., our method identifies more block boundaries in agreement with those found by Daly et al. than do other methods. These results suggest that our method may be useful for designing association-based mapping studies that exploit haplotype blocks.
引用
收藏
页码:336 / 354
页数:19
相关论文
共 50 条
  • [1] Minimum-description-length based approach to CT reconstruction using truncated projections from objects with unknown boundaries
    Yuasa, T
    Devaraj, B
    Watanabe, Y
    Sato, T
    Sasaki, Y
    Hoshino, A
    Inaba, H
    Akatsuka, T
    STATISTICAL AND STOCHASTIC METHODS IN IMAGE PROCESSING II, 1997, 3167 : 186 - 195
  • [2] Model selection using the minimum description length principle
    Bryant, PG
    Cordero-Braña, OI
    AMERICAN STATISTICIAN, 2000, 54 (04): : 257 - 268
  • [3] Minimum description length block finder, a method to identify haplotype blocks and to compare the strength of block boundaries (Retraction of vol 73, pg 86, 2003)
    Mannila, H
    Koivisto, M
    Perola, M
    Varilo, T
    Hennah, W
    Ekelund, J
    Lukk, M
    Peltonen, L
    Ukkonen, E
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (02) : 444 - 444
  • [4] Introducing the minimum description length principle
    Grünwald, P
    ADVANCES IN MINIMUM DESCRIPTION LENGTH THEORY AND APPLICATIONS, 2005, : 3 - 21
  • [5] A minimum description length principle for perception
    Chater, N
    ADVANCES IN MINIMUM DESCRIPTION LENGTH THEORY AND APPLICATIONS, 2005, : 385 - 409
  • [6] Identifying Career Boundaries Using Minimum Description Length on a Graph
    Uliana, Ronie M.
    De Castro, Leandro N.
    IEEE ACCESS, 2018, 6 : 42407 - 42421
  • [7] INFERRING DECISION TREES USING THE MINIMUM DESCRIPTION LENGTH PRINCIPLE
    QUINLAN, JR
    RIVEST, RL
    INFORMATION AND COMPUTATION, 1989, 80 (03) : 227 - 248
  • [8] Regression spline smoothing using the minimum description length principle
    Lee, TCM
    STATISTICS & PROBABILITY LETTERS, 2000, 48 (01) : 71 - 82
  • [9] The minimum description length principle in coding and modeling
    Barron, A
    Rissanen, J
    Yu, B
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) : 2743 - 2760
  • [10] Incremental Learning with the Minimum Description Length Principle
    Murena, Pierre-Alexandre
    Cornuejols, Antoine
    Dessalles, Jean-Louis
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 1908 - 1915