Research on small files classification based on improved KNN algorithm and pretreatment strategy

被引:0
|
作者
Shi, Hengliang [1 ,2 ]
Bai, Xiaolei [1 ]
Zhen, Lintao [1 ]
机构
[1] Information Engineering College, Henan University of Science and Technology, No. 263, Kaiyuan Road, Luoyang, China
[2] Noah (Suzhou) IT Solution Co., Ltd, Suzhou, China
来源
ICIC Express Letters | 2015年 / 9卷 / 02期
关键词
Data handling - Learning algorithms - Information retrieval systems;
D O I
暂无
中图分类号
学科分类号
摘要
This article which combines MapReduce model with mass data processing innovatively, proposes small files classification and pretreatment strategy research on mass data. The described method provides more convenience for the parallel computing characteristics of MapReduce architecture, and saves a large amount of processing time. Meanwhile, the classification method is proved to be efficient and reliable through some experiments. The strategy of the paper can be widely applied to document classification and clustering research and application. © 2015, ICIC International.
引用
收藏
页码:603 / 608
相关论文
共 50 条
  • [41] An Efficient GNSS Coordinate Classification Strategy with an Adaptive KNN Algorithm for Epidemic Management
    Chen, Jong-Shin
    Kuo, Chun-Ming
    MATHEMATICS, 2024, 12 (04)
  • [42] Application of kNN Improved Algorithm in Automatic Classification of Network Public Proposal Cases
    Jiang Fuji
    Chu Chu
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2017), 2017, : 82 - 86
  • [43] Improved KNN Algorithm for Fine-Grained Classification of Encrypted Network Flow
    Ma, Chencheng
    Du, Xuehui
    Cao, Lifeng
    ELECTRONICS, 2020, 9 (02)
  • [44] A Research on Behavior of Sleepy Lizards Based on KNN Algorithm
    Guo, Xiaolv
    Chu, Shu-Chuan
    Tang, Lin-Lin
    Roddick, John F.
    Pan, Jeng-Shyang
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2012), PT II, 2012, 7197 : 109 - 118
  • [45] The Domain Classification Algorithm Based on KNN in Micro-blog
    Zhu, Guofeng
    Zhou, Zhurong
    Han, Fengjiao
    Ying, Zhongyun
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 188 - 192
  • [46] Study on microblogging marketing system based on KNN classification algorithm
    Meng, Qingqiang
    Han, Xue
    Metallurgical and Mining Industry, 2015, 7 (09): : 1096 - 1101
  • [47] Fast kNN classification algorithm based on partial distance search
    Chung Yuan Christian Univ, Chungli, Taiwan
    Electron Lett, 21 (2062-2063):
  • [48] Fast kNN classification algorithm based on partial distance search
    Hwang, WJ
    Wen, KW
    ELECTRONICS LETTERS, 1998, 34 (21) : 2062 - 2063
  • [49] Classification algorithm based on Weighted SVMs and locally Tuning kNN
    Wang Shu-Bin
    Ling Ping
    You Xiang-Yang
    Xu Ming
    Rong Xiang-Sheng
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 240 - +
  • [50] Human performance modeling for manufacturing based on an improved KNN algorithm
    Li, Ni
    Kong, Haipeng
    Ma, Yaofei
    Gong, Guanghong
    Huai, Wenqing
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2016, 84 (1-4): : 473 - 483