Performance Analysis of Data Mining Algorithms Based on PCA

被引:0
|
作者
Bai, Ruifeng [1 ]
Wang, Jie [2 ]
Yang, Lin [2 ]
Pan, Jingchang [2 ]
机构
[1] Shandong Univ, Coll Business, Weihai 264209, Peoples R China
[2] Shandong Univ, Scholl Mech Elect & Informat Engn, Weihai 264209, Peoples R China
关键词
PCA; Classification; Clustering; Spectrum; Cataclysmic Variable Star; DIGITAL SKY SURVEY; CATACLYSMIC VARIABLES;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data mining algorithms behave differently under different application context. It is an important topic to find out the characteristics of the relevant algorithms. This paper studied PCA based dimension reduction and the functional performance of data mining algorithms (ANN, Bayes, KNN, K-means) under different dimension reduction rates in finding Cataclysmic Variable Stars(CVs) in a hybrid celestial spectra dataset. The dataset was selected from SDSS(Sloan Digital Sky Survey), 1417 spectra altogether. In the dataset, there are 15 CVs, along with other type of celestial bodies. ANN, Bayes, KNN and K-means were chosen to test their performances in finding CVs and time cost under different PCA dimensions. The classification accuracy and time cost were analyzed of the four mentioned algorithms in detail under different PCA dimensions. A series of experiments were done to carry out our research. Through this study, we can understand the inherent characteristics of the four algorithms and make better choices in future data mining applications.
引用
收藏
页码:1506 / 1509
页数:4
相关论文
共 50 条
  • [1] PERFORMANCE ANALYSIS OF DATA MINING ALGORITHMS FOR SOFTWARE QUALITY PREDICTION
    Gayatri, N.
    Nickolas, S.
    Reddy, A. V.
    Chitra, R.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 393 - 395
  • [2] ASYMPTOTIC PERFORMANCE ANALYSIS OF PCA ALGORITHMS BASED ON THE WEIGHTED SUBSPACE CRITERION
    Delmas, Jean Pierre
    Gabillon, Victor
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3237 - +
  • [3] Data Mining Algorithms(KNN & DT) Based Predictive Analysis on Selected Candidates in Academic Performance
    Ramalingam, M.
    Ilakkiya, R.
    2021 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2021), 2021, : 332 - 337
  • [4] Performance analysis of data mining algorithms for diagnosing COVID-19
    Nopour, Raoof
    Kazemi-Arpanahi, Hadi
    Shanbehzadeh, Mostafa
    Azizifar, Akbar
    JOURNAL OF EDUCATION AND HEALTH PROMOTION, 2021, 10 (01)
  • [5] Performance Analysis and Ranking of Data Mining Algorithms Across Multiple Datasets
    Nasor, Mohamed
    Ali, Sharaz
    2019 IEEE 19TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2019), 2019,
  • [6] Comparison Performance of Qualitative Bankruptcy Classification Based on Data Mining Algorithms
    Sjarif, Nilam Nur Amir
    Lim, Yee Fang
    Azmi, NurulHuda Firdaus Mohd
    Kamardin, Kamalia
    Ten, Doris Wong Hooi
    Abas, Hafiza
    Ali, Al Fahim Mubarak
    ADVANCED SCIENCE LETTERS, 2018, 24 (10) : 7602 - 7606
  • [7] Performance Comparison of ADRS and PCA as a Preprocessor to ANN for Data Mining
    Navaroli, Nicholas
    Turner, David
    Concepcion, Arturo I.
    Lynch, Robert S.
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS, 2008, : 47 - +
  • [8] Mining performance data through nonlinear PCA with optimal scaling
    Costantini, Paola
    Linting, Marielle
    Porzio, Giovanni C.
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2010, 26 (01) : 85 - 101
  • [9] Performance Analysis ofUtility Mining algorithms
    Sivamathi, C.
    Vijayarani, S.
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 3, 2015, : 101 - 104
  • [10] Diabetes Recognition in Pregnant Women by Extracting Features Using PCA and Data Mining Algorithms
    Rahman, Mafizur
    Islam, Linta
    2019 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2019,