Variables Extraction on Large Binary Variables in Discriminant Analysis based on Mixed Variables Location Model

被引:0
|
作者
Mei, Long Mei [1 ]
Hamid, Hashibah [1 ]
Aziz, Nazrina [1 ]
机构
[1] Univ Utara Alalatisia, Sch Quantitat Sci, UUM Sintok, Sintok 06010, Malaysia
关键词
PRINCIPAL-COMPONENTS-ANALYSIS; CLASSIFICATION;
D O I
10.1063/1.4937096
中图分类号
O59 [应用物理学];
学科分类号
摘要
The natural performance of the location model is a potential tool for allocating an object into one of the two observed groups involving mixtures of continuous and binary variables. In constructing location model, continuous variable is used to estimate parameters while binary variable is utilized to create segmentation in each group. Such segmentation is called as multinomial cells. Basically, the multinomial cells will grow exponentially according to the number of the binary variable. These multinomial cells will become empty when there is no object can be assigned into some of them. Then the occurring of empty cells will lead to unreliable parameter estimation. Consequently, the construction of the discriminant rule based on location model is impossible. Therefore, this paper attempts to discuss how the location model based on maximum likelihood estimation can be constructed even dealing with many measured binary variables. In other word, how is location model able to deal with the issue of many empty cells for classifying an object into correct group? For remedy this problem, this paper adapts nonlinear principal component analysis in order to reduce large binary variables considered in the study. This new strategy can be expected as an alternative discriminant tool practically when large number of binary variables are considered in a classification tasks.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Variable selection in discriminant analysis based on the location model for mixed variables
    Mahat, Nor Idayu
    Krzanowski, Wojtek Janusz
    Hernandez, Adolfo
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2007, 1 (02) : 105 - 122
  • [2] Variable selection in discriminant analysis based on the location model for mixed variables
    Nor Idayu Mahat
    Wojtek Janusz Krzanowski
    Adolfo Hernandez
    Advances in Data Analysis and Classification, 2007, 1 : 105 - 122
  • [3] Multiple Correspondence Analysis for Handling Large Binary Variables in Smoothed Location Model
    Huong, Penny Ngu Ai
    Hamid, Hashibah Binti
    Aziz, Nazrina Binti
    INNOVATION AND ANALYTICS CONFERENCE AND EXHIBITION (IACE 2015), 2015, 1691
  • [4] DISCRIMINANT-ANALYSIS BASED ON BINARY AND CONTINUOUS-VARIABLES
    TU, CT
    HAN, CP
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1982, 77 (378) : 447 - 454
  • [5] Discriminant analysis with mixed non normal variables
    Mbaeyi, George Chinanu
    Nweke, Chijioke Joel
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (01) : 39 - 45
  • [6] A comparison of discriminant procedures for binary variables
    Asparoukhov, OK
    Krzanowski, WJ
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 38 (02) : 139 - 160
  • [7] Minimum distance probability discriminant analysis for mixed variables
    Núñez, M
    Villarroya, A
    Oller, JM
    BIOMETRICS, 2003, 59 (02) : 248 - 253
  • [8] Variable selection in discriminant analysis for mixed continuous-binary variables and several groups
    Mbina, Alban Mbina
    Nkiet, Guy Martial
    Obiang, Fulgence Eyi
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 773 - 795
  • [9] Variable selection in discriminant analysis for mixed continuous-binary variables and several groups
    Alban Mbina Mbina
    Guy Martial Nkiet
    Fulgence Eyi Obiang
    Advances in Data Analysis and Classification, 2019, 13 : 773 - 795
  • [10] SELECTION OF VARIABLES IN DISCRIMINANT ANALYSIS
    MERSCH, G
    ANNALES DE LA SOCIETE SCIENTIFIQUE DE BRUXELLES SERIES 1-SCIENCES MATHEMATIQUES ASTRONOMIQUES ET PHYSIQUES, 1973, 87 (03): : 299 - 309