Cluster-Based News Representative Generation with Automatic Incremental Clustering

被引:0
|
作者
Shabirin, Irsal [1 ]
Barakbah, Ali Ridho [1 ]
Syarif, Iwan [1 ]
机构
[1] Politekn Elekt Negeri Surabaya, Grad Sch Informat & Comp Engn, Jl Raya Its Sukolilo Sur 60111, Indonesia
关键词
Clustering; Metadata Aggregation; Automatic Incremental Clustering; Representative News;
D O I
10.24003/emitter.v7i2.378
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays we are facing much abundant information, especially news, and makes us confused in sorting out the information, so that it wastes our time in filtering that information. Though the news often contains similar contents that should save our time for reading. In this paper, we propose a new approach to provide aggregation mechanisms from cluster-based news and produce representative news, using our proposed Automatic Incremental Clustering. This approach presents a mechanism for clustering incremental news data and dynamically providing an automatic creation of new clusters. This approach consists of six main functions, which are (1) Data acquisition with incremental news sources from several news service providers, (2) Keyword extraction for term representation of news data, (3) Metadata aggregation for creating vector space of terms, (4) Automatic clustering for initiating news cluster generation, (5) Automatic incremental clustering for clustering incoming news data to pre-determined clusters or creating a new cluster of news data, and (6) News representation for selecting the most representative news of data clusters. For experimental study, we involved 95 news data service providers with 751 news data for for creating initial clusters with automatic clustering and 110 news data for incremental automatic clustering. Our approach performed 85.14% accuracy for incremental automatic clustering, and is able to dynamically create new clusters for incremental news data.
引用
收藏
页码:467 / 479
页数:13
相关论文
共 50 条
  • [1] Automatic Representative News Generation using On-Line Clustering
    Sigita, Marlisa
    Barakbah, Ali Ridho
    Kusumaningtyas, Entin Martiana
    Winarno, Idris
    EMITTER-INTERNATIONAL JOURNAL OF ENGINEERING TECHNOLOGY, 2013, 1 (01) : 107 - 114
  • [2] REPRESENTATIVE POINTS AND CLUSTER ATTRIBUTES BASED INCREMENTAL SEQUENCE CLUSTERING ALGORITHM
    Wu, Di
    Ren, Jiadong
    COMPUTING AND INFORMATICS, 2017, 36 (06) : 1361 - 1384
  • [3] Automatic parallelization of representative-based clustering algorithms for multicore cluster systems
    Saiyedul Islam
    Sundar Balasubramaniam
    Shruti Gupta
    Shikhar Brajesh
    Rohan Badlani
    Nitin Labhishetty
    Abhinav Baid
    Poonam Goyal
    Navneet Goyal
    International Journal of Data Science and Analytics, 2020, 10 : 135 - 159
  • [4] Automatic parallelization of representative-based clustering algorithms for multicore cluster systems
    Islam, Saiyedul
    Balasubramaniam, Sundar
    Gupta, Shruti
    Brajesh, Shikhar
    Badlani, Rohan
    Labhishetty, Nitin
    Baid, Abhinav
    Goyal, Poonam
    Goyal, Navneet
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 10 (02) : 135 - 159
  • [5] An incremental cluster-based approach to spam filtering
    Hsiao, Wen-Feng
    Chang, Te-Min
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (03) : 1599 - 1608
  • [6] Adaptive and incremental query expansion for cluster-based browsing
    Eguchi, K
    Ito, H
    Kumamoto, A
    Kanata, Y
    6TH INTERNATIONAL CONFERENCE ON DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 1999, : 25 - 34
  • [7] Decoupling of clustering and classification steps in a cluster-based classification
    Hashemi, RR
    Bahar, M
    Childers, C
    Tyler, AA
    ICMLA 2005: FOURTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2005, : 285 - 290
  • [8] Cluster-Based Routing Algorithm for WSN Based on Subtractive Clustering
    Chen, Ling
    Liu, Wenwen
    Gong, Daofu
    Chen, Yan
    2020 16TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC, 2020, : 403 - 406
  • [9] The Core Cluster-Based Subspace Weighted Clustering Ensemble
    Huang, Xuan
    Qin, Fang
    Lin, Lin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [10] Incremental Clustering of News Reports
    Azzopardi, Joel
    Staff, Christopher
    ALGORITHMS, 2012, 5 (03) : 364 - 378