Feature Selection for Clustering Online Learners

被引:33
|
作者
Huang, Lei [1 ]
Wang, Xinghui [1 ]
Wu, Zhouhua [1 ]
Wang, Feiyu [2 ]
机构
[1] Guangxi Radio & TV Univ, Smart Educ Lab, Nanning, Peoples R China
[2] Miami Univ, Oxford, OH 45056 USA
关键词
clustering; learning analytics; educational data mining; feature selection; dimensionality reduction;
D O I
10.1109/EITT.2019.00009
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
As one of the most important approaches for learning analytics and educational data mining, various clustering algorithms have been explored and compared in the analysis of online learners by their behavior. However, choosing which features for clustering has a strong impact on the quality of clustering, and has not received enough attention yet. By using an entropy-based feature selection method, this research broadens the range of candidate initial features and provides efficient algorithms for extracting the most important features. Experiment with real-life data reveals that this method not only overcomes the data sparsity and complexity problems for clustering in high-dimensional feature space but also surpasses dimensionality reduction methods like PCA in interpretability.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [1] Online Feature Selection Based on Fuzzy Clustering and Its Applications
    Thanh Minh Nguyen
    Wu, Q. M. Jonathan
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2016, 24 (06) : 1294 - 1306
  • [2] An Online Unsupervised Streaming Features Selection Through Dynamic Feature Clustering
    Yan X.
    Homaifar A.
    Sarkar M.
    Lartey B.
    Gupta K.D.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (05): : 1281 - 1292
  • [3] On online high-dimensional spherical data clustering and feature selection
    Amayri, Ola
    Bouguila, Nizar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (04) : 1386 - 1398
  • [4] Feature selection for clustering
    Dash, M
    Liu, H
    KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 110 - 121
  • [5] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [6] Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection
    Heredia B.
    Khoshgoftaar T.M.
    Prusa J.D.
    Crawford M.
    Social Network Analysis and Mining, 2017, 7 (1)
  • [7] Data-Driven Feature Word Selection for Clustering Online News Comments
    Cho, Heeryon
    Lee, Jong-Seok
    2016 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2016, : 494 - 497
  • [8] Feature Selection for Visual Clustering
    Alagambigai, P.
    Thangavel, K.
    2009 INTERNATIONAL CONFERENCE ON ADVANCES IN RECENT TECHNOLOGIES IN COMMUNICATION AND COMPUTING (ARTCOM 2009), 2009, : 498 - +
  • [9] On feature selection through clustering
    Butterworth, R
    Piatetsky-Shapiro, G
    Simovici, DA
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 581 - 584
  • [10] Feature selection and document clustering
    Dhillon, I
    Kogan, J
    Nicholas, C
    SURVEY OF TEXT MINING: CLUSTERING, CLASSIFICATION, AND RETRIEVAL, 2004, : 73 - 100