Feature selection strategies for automated classification of digital media content

被引:5
|
作者
Rocha, Rocio [1 ]
Cobo, Angel [2 ]
机构
[1] Univ Cantabria, Dept Business Adm, E-39005 Santander, Spain
[2] Univ Cantabria, Dept Appl Math & Computat Sci, E-39005 Santander, Spain
关键词
automatic classification; clustering; digital media; feature selection; machine learning; text mining;
D O I
10.1177/0165551511412028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes strategies for feature selection of digital news articles that allow an effective implementation of learning algorithms for the unsupervised classification of news articles. With the appropriate selection of a small subset of features a correct identification of related news can be achieved, thus enabling organizations and individual users to keep track of current events. The paper defines a quality measure of the discriminatory power of each feature and verifies that the selection of a feature subset with higher quality values allows obtaining good classification results. A Particle Swarm Optimization (PSO) based selection method is also proposed. Both proposals are validated on two collections of press clippings collated from news search services in digital media. Experimental results reveal that good classification accuracy can be achieved with small subsets of between 3 per cent and 6 per cent of the features.
引用
收藏
页码:418 / 428
页数:11
相关论文
共 50 条
  • [21] Automated feature selection in neuroevolution
    Tan, Maxine
    Hartley, Michael
    Bister, Michel
    Deklerck, Rudi
    EVOLUTIONARY INTELLIGENCE, 2009, 1 (04) : 271 - 292
  • [22] Hierarchical audio content classification system using an optimal feature selection algorithm
    Krishnamoorthy, P.
    Kumar, Sarvesh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 54 (02) : 415 - 444
  • [23] Hierarchical audio content classification system using an optimal feature selection algorithm
    P. Krishnamoorthy
    Sarvesh Kumar
    Multimedia Tools and Applications, 2011, 54 : 415 - 444
  • [24] Audio Content Feature Selection and Classification A random forests and decision tree approach
    Al-Maathidi, Muhammad M.
    Li, Francis F.
    PROCEEDINGS OF 2015 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATCS AND COMPUTING (IEEE PIC), 2015, : 108 - 112
  • [25] Sarcasm classification: A novel approach by using Content Based Feature Selection Method
    Kumar, H. M. Keerthi
    Harish, B. S.
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 378 - 386
  • [26] Enhancing Indian legal judgment classification with embeddings, feature selection, and ensemble strategies
    Prabhakar, Priyanka
    Pati, Peeta Basa
    ARTIFICIAL INTELLIGENCE AND LAW, 2025,
  • [27] Drug and Nondrug Classification Based on Deep Learning with Various Feature Selection Strategies
    Yu, Long
    Sun, Xia
    Tian, Shengwei
    Shi, Xinyu
    Yan, Yilin
    CURRENT BIOINFORMATICS, 2018, 13 (03) : 253 - 259
  • [28] Design and Comparison of Different Evolution Strategies for Feature Selection and Consolidation in Music Classification
    Vatolkin, I.
    Theimer, W.
    Rudolph, G.
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 174 - +
  • [29] Feature subset selection for classification of malignant and benign breast masses in digital mammography
    Ramzi Chaieb
    Karim Kalti
    Pattern Analysis and Applications, 2019, 22 : 803 - 829
  • [30] Feature subset selection for classification of malignant and benign breast masses in digital mammography
    Chaieb, Ramzi
    Kalti, Karim
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (03) : 803 - 829