A parallel algorithm for subset selection

被引:1
|
作者
Poston, WL
Wegman, EJ
Solka, JL
机构
[1] USN, Ctr Surface Warfare, Dahlgren Div, Dahlgren, VA 22448 USA
[2] George Mason Univ, Ctr Computat Stat, Fairfax, VA 22030 USA
关键词
parallel subset selection; information matrix; effective independence distribution; hat matrix;
D O I
10.1080/00949659808811869
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prior to performing an analysis of a large data set, it is often desirable to process a subset of the data only. Current methods of subset selection choose points in a random manner, which can lead to poor solutions. The method for selection described in this paper employs the Effective Independence Distribution (EID) method that chooses observations that optimize the determinant of the information matrix. Since the method requires repeated calculations of three matrix multiplications and a matrix inverse, it is computationally intensive for extremely large data sets. A recursive form of the EID is developed here which is suitable for parallelization. The parallel method is described in detail, and load balancing and communication issues are addressed. Implementation results on the Intel Paragon show that this is an effective parallel algorithm.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [41] A polynomial algorithm for best-subset selection problem
    Zhu, Junxian
    Wen, Canhong
    Zhu, Jin
    Zhang, Heping
    Wang, Xueqin
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (52) : 33117 - 33123
  • [42] A genetic algorithm applied to optimal gene subset selection
    Ding, SD
    Liu, J
    Wu, CL
    Yang, Q
    CEC2004: PROCEEDINGS OF THE 2004 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2004, : 1654 - 1660
  • [43] SUBSET SELECTION OF MYOELECTRIC CHANNELS A Genetic Algorithm for Subset Selection of Myoelectric Channels for Patients following TMR Surgery
    Kvas, Gernot
    Velik, Rosemarie
    BIOSIGNALS 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIO-INSPIRED SYSTEMS AND SIGNAL PROCESSING, 2009, : 222 - +
  • [44] A New MPR Selection Algorithm based on the Optimal Subset
    Zhang, Hong
    Fan, Wen-jie
    Wang, Chuan-zhen
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 109 - 112
  • [45] An Improved Approximation Algorithm for the Column Subset Selection Problem
    Boutsidis, Christos
    Mahoney, Michael W.
    Drineas, Petros
    PROCEEDINGS OF THE TWENTIETH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2009, : 968 - +
  • [46] Gabor filter subset selection using a genetic algorithm
    Mandriota, C
    Ancona, N
    Stella, E
    Distante, A
    OPTOMECHATRONIC SYSTEMS III, 2002, 4902 : 707 - 714
  • [47] From Sequential Algorithm Selection to Parallel Portfolio Selection
    Lindauer, M.
    Hoos, Holger H.
    Hutter, F.
    LEARNING AND INTELLIGENT OPTIMIZATION, LION 9, 2015, 8994 : 1 - 16
  • [48] A Fast Parallel Selection Algorithm on GPUs
    Bakunas-Milanowski, Darius
    Rego, Vernon
    Sang, Janche
    Yu, Chansu
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 609 - 614
  • [49] Parallel genetic algorithm with fading selection
    Akopov, Andranik S.
    INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2014, 49 (3-4) : 325 - 331
  • [50] Towards scalable rough set based attribute subset selection for intrusion detection using parallel genetic algorithm in MapReduce
    El-Alfy, El-Sayed M.
    Alshammari, Mashaan A.
    SIMULATION MODELLING PRACTICE AND THEORY, 2016, 64 : 18 - 29