Feature selection strategies for automated classification of digital media content

被引:5
|
作者
Rocha, Rocio [1 ]
Cobo, Angel [2 ]
机构
[1] Univ Cantabria, Dept Business Adm, E-39005 Santander, Spain
[2] Univ Cantabria, Dept Appl Math & Computat Sci, E-39005 Santander, Spain
关键词
automatic classification; clustering; digital media; feature selection; machine learning; text mining;
D O I
10.1177/0165551511412028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes strategies for feature selection of digital news articles that allow an effective implementation of learning algorithms for the unsupervised classification of news articles. With the appropriate selection of a small subset of features a correct identification of related news can be achieved, thus enabling organizations and individual users to keep track of current events. The paper defines a quality measure of the discriminatory power of each feature and verifies that the selection of a feature subset with higher quality values allows obtaining good classification results. A Particle Swarm Optimization (PSO) based selection method is also proposed. Both proposals are validated on two collections of press clippings collated from news search services in digital media. Experimental results reveal that good classification accuracy can be achieved with small subsets of between 3 per cent and 6 per cent of the features.
引用
收藏
页码:418 / 428
页数:11
相关论文
共 50 条
  • [31] FEATURE SELECTION METHOD FOR ML/DL CLASSIFICATION OF NETWORK ATTACKS IN DIGITAL FORENSICS
    Grakovski, Alexander
    Krivchenkov, Aleksandr
    Misnevs, Boriss
    TRANSPORT AND TELECOMMUNICATION JOURNAL, 2022, 23 (02) : 131 - 141
  • [32] Accelerating Crisis Response: Automated Image Classification for Geolocating Social Media Content
    Firmansyah, Hafiz Budi
    Fernandez-Marquez, Jose Luis
    Oguz Mulayim, Mehmet
    Gomes, Jorge
    Lorini, Valerio
    PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 77 - 81
  • [33] A classification algorithm and optimal feature selection methodology for automated solder joint defect inspection
    Oyeleye, O
    Lehtihet, EA
    JOURNAL OF MANUFACTURING SYSTEMS, 1998, 17 (04) : 251 - 262
  • [34] An Automated Text Classification Method: Using Improved Fuzzy Set Approach for Feature Selection
    Abbasi, Bushra Zaheer
    Hussain, Shahid
    Faisal, Muhammad Imran
    PROCEEDINGS OF 2019 16TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2019, : 666 - 670
  • [35] Optimal feature selection for automated classification of FDG-PET in patients with suspected dementia
    Serag, Ahmed
    Wenzel, Fabian
    Thiele, Frank
    Buchert, Ralph
    Young, Stewart
    MEDICAL IMAGING 2009: COMPUTER-AIDED DIAGNOSIS, 2009, 7260
  • [36] Feature Selection for Classification with QAOA
    Turati, Gloria
    Dacrema, Maurizio Ferrari
    Cremonesi, Paolo
    2022 IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE 2022), 2022, : 782 - 785
  • [37] Feature Selection for Collective Classification
    Senliol, Baris
    Aral, Atakan
    Cataltepe, Zehra
    2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 285 - 290
  • [38] ONLINE FEATURE SELECTION AND CLASSIFICATION
    Kalkan, Habil
    Cetisli, Bayram
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 2124 - 2127
  • [39] Feature Selection for Twitter Classification
    Ostrowski, David Alfred
    2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 267 - 272
  • [40] Feature Selection for Monotonic Classification
    Hu, Qinghua
    Pan, Weiwei
    Zhang, Lei
    Zhang, David
    Song, Yanping
    Guo, Maozu
    Yu, Daren
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (01) : 69 - 81