A parallel algorithm for subset selection

被引:1
|
作者
Poston, WL
Wegman, EJ
Solka, JL
机构
[1] USN, Ctr Surface Warfare, Dahlgren Div, Dahlgren, VA 22448 USA
[2] George Mason Univ, Ctr Computat Stat, Fairfax, VA 22030 USA
关键词
parallel subset selection; information matrix; effective independence distribution; hat matrix;
D O I
10.1080/00949659808811869
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Prior to performing an analysis of a large data set, it is often desirable to process a subset of the data only. Current methods of subset selection choose points in a random manner, which can lead to poor solutions. The method for selection described in this paper employs the Effective Independence Distribution (EID) method that chooses observations that optimize the determinant of the information matrix. Since the method requires repeated calculations of three matrix multiplications and a matrix inverse, it is computationally intensive for extremely large data sets. A recursive form of the EID is developed here which is suitable for parallelization. The parallel method is described in detail, and load balancing and communication issues are addressed. Implementation results on the Intel Paragon show that this is an effective parallel algorithm.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [31] Generalized Branch and Bound Algorithm for feature subset selection
    Viswanath, P.
    Kumar, P. Vinay
    Babu, V. Suresh
    Kumar, M. Venkateswara
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 214 - +
  • [32] Binary Owl Search Algorithm for Feature Subset Selection
    Mandal, Ashis Kumar
    Sen, Rikta
    Chakraborty, Basabi
    2019 IEEE 10TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2019), 2019, : 186 - 191
  • [33] An evolutionary algorithm for subset selection in causal inference models
    Cho, Wendy K. Tam
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2018, 69 (04) : 630 - 644
  • [34] A Feature Subset Selection Algorithm Automatic Recommendation Method
    Wang, Guangtao
    Song, Qinbao
    Sun, Heli
    Zhang, Xueying
    Xu, Baowen
    Zhou, Yuming
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2013, 47 : 1 - 34
  • [35] On the optimality of the backward greedy algorithm for the subset selection problem
    Couvreur, C
    Bresler, Y
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2000, 21 (03) : 797 - 808
  • [36] A Conservative Feature Subset Selection Algorithm with Missing Data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 725 - 730
  • [37] An Optimal Subset Selection Algorithm for Distributed Hypothesis Test
    Li, Jiarui
    Guo, Guangbao
    IAENG International Journal of Applied Mathematics, 2024, 54 (12) : 2811 - 2815
  • [38] An Improved Greedy Algorithm for Subset Selection in Linear Estimation
    Dutta, Shamak
    Wilde, Nils
    Smith, Stephen L.
    2022 EUROPEAN CONTROL CONFERENCE (ECC), 2022, : 1067 - 1072
  • [39] Feature subset selection by gravitational search algorithm optimization
    Han, XiaoHong
    Chang, XiaoMing
    Quan, Long
    Xiong, XiaoYan
    Li, JingXia
    Zhang, ZhaoXia
    Liu, Yi
    INFORMATION SCIENCES, 2014, 281 : 128 - 146
  • [40] A conservative feature subset selection algorithm with missing data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    NEUROCOMPUTING, 2010, 73 (4-6) : 585 - 590