A Serial Sample Selection Framework for Active Learning

被引:0
|
作者
Li, Chengchao [1 ]
Zhao, Pengpeng [1 ]
Wu, Jian [1 ]
Xu, Haihui [1 ]
Cui, Zhiming [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
Data Mining; Active Learning; Sampling Strategy; Uncertainty; Representativeness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Learning is a machine learning and data mining technique that selects the most informative samples for labeling and uses them as training data. It aims to obtain a high performance classifier by labeling as little data as possible from large amount of unlabeled samples, which means sampling strategy is the core issue. Existing approaches either tend to ignore information in unlabeled data and are prone to querying outliers or noise samples, or calculate large amounts of non-informative samples leading to significant computation cost. In order to solve above problems, this paper proposed a serial active learning framework. It first measures uncertainty of unlabeled samples and selects the most uncertain sample set. From which, it further generates the most representative sample set based on the mutual information criterion. Finally, the framework selects the most informative sample from the most representative sample set based on expected error reduction strategy. Experimental results on multiple datasets show that our approach outperforms Random Sampling and the state of the art adaptive active learning method.
引用
收藏
页码:435 / 446
页数:12
相关论文
共 50 条
  • [31] Sample diversity selection strategy based on label distribution morphology for active label distribution learning
    Li, Weiwei
    Qian, Wei
    Chen, Lei
    Jia, Xiuyi
    PATTERN RECOGNITION, 2024, 150
  • [32] Batch Mode Active Learning for Semantic Segmentation Based on Multi-Clue Sample Selection
    Tan, Yao
    Yang, Liu
    Hu, Qinghua
    Du, Zhibin
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 831 - 840
  • [33] Active learning for training sample selection in remote sensing image classification using spatial information
    Lu, Qikai
    Ma, Yong
    Xia, Gui-Song
    REMOTE SENSING LETTERS, 2017, 8 (12) : 1210 - 1219
  • [34] Contrastive Open-Set Active Learning-Based Sample Selection for Image Classification
    Yan, Zizheng
    Ruan, Delian
    Wu, Yushuang
    Huang, Junshi
    Chai, Zhenhua
    Han, Xiaoguang
    Cui, Shuguang
    Li, Guanbin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5525 - 5537
  • [35] A Bayesian Framework for Active Learning
    Fredlund, Richard
    Everson, Richard M.
    Fieldsend, Jonathan E.
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [36] Active learning with model selection - Simultaneous optimization of sample points and models for trigonometric polynomial models
    Sugiyama, M
    Ogawa, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (12) : 2753 - 2763
  • [37] The true sample complexity of active learning
    Maria-Florina Balcan
    Steve Hanneke
    Jennifer Wortman Vaughan
    Machine Learning, 2010, 80 : 111 - 139
  • [38] Active learning for ranking with sample density
    Wenbin Cai
    Muhan Zhang
    Ya Zhang
    Information Retrieval Journal, 2015, 18 : 123 - 144
  • [39] Active learning for ranking with sample density
    Cai, Wenbin
    Zhang, Muhan
    Zhang, Ya
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (02): : 123 - 144
  • [40] The true sample complexity of active learning
    Balcan, Maria-Florina
    Hanneke, Steve
    Vaughan, Jennifer Wortman
    MACHINE LEARNING, 2010, 80 (2-3) : 111 - 139