A Serial Sample Selection Framework for Active Learning

被引:0
|
作者
Li, Chengchao [1 ]
Zhao, Pengpeng [1 ]
Wu, Jian [1 ]
Xu, Haihui [1 ]
Cui, Zhiming [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
关键词
Data Mining; Active Learning; Sampling Strategy; Uncertainty; Representativeness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Learning is a machine learning and data mining technique that selects the most informative samples for labeling and uses them as training data. It aims to obtain a high performance classifier by labeling as little data as possible from large amount of unlabeled samples, which means sampling strategy is the core issue. Existing approaches either tend to ignore information in unlabeled data and are prone to querying outliers or noise samples, or calculate large amounts of non-informative samples leading to significant computation cost. In order to solve above problems, this paper proposed a serial active learning framework. It first measures uncertainty of unlabeled samples and selects the most uncertain sample set. From which, it further generates the most representative sample set based on the mutual information criterion. Finally, the framework selects the most informative sample from the most representative sample set based on expected error reduction strategy. Experimental results on multiple datasets show that our approach outperforms Random Sampling and the state of the art adaptive active learning method.
引用
收藏
页码:435 / 446
页数:12
相关论文
共 50 条
  • [41] Serial entrepreneurship, learning by doing and self-selection
    Rocha, Vera
    Carneiro, Anabela
    Varum, Celeste Amorim
    INTERNATIONAL JOURNAL OF INDUSTRIAL ORGANIZATION, 2015, 40 : 91 - 106
  • [42] SiFT: A Serial Framework with Textual Guidance for Federated Learning
    Li, Xuyang
    Zhang, Weizhuo
    Yu, Yue
    Zheng, Wei-Shi
    Zhang, Tong
    Wang, Ruixuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 655 - 665
  • [43] An Augmented Sample Selection Framework for Prediction of Anticancer Peptides
    Tao, Huawei
    Shan, Shuai
    Fu, Hongliang
    Zhu, Chunhua
    Liu, Boye
    MOLECULES, 2023, 28 (18):
  • [44] Feature Selection in Meta Learning Framework
    Shilbayeh, Samar
    Vadera, Sunil
    2014 SCIENCE AND INFORMATION CONFERENCE (SAI), 2014, : 269 - 275
  • [45] Active Learning-Based Sample Selection for Label-Efficient Blind Image Quality Assessment
    Song, Tianshu
    Li, Leida
    Cheng, Deqiang
    Chen, Pengfei
    Wu, Jinjian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5884 - 5896
  • [46] Efficient Active Learning for Image Classification and Segmentation Using a Sample Selection and Conditional Generative Adversarial Network
    Mahapatra, Dwarikanath
    Bozorgtabar, Behzad
    Thiran, Jean-Philippe
    Reyes, Mauricio
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT II, 2018, 11071 : 580 - 588
  • [47] Hierarchical Analog Circuit Reliability Analysis using Multivariate Nonlinear Regression and Active Learning Sample Selection
    Maricau, Elie
    De Jonghe, Dimitri
    Gielen, Georges
    DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2012), 2012, : 745 - 750
  • [48] AN ACTIVE LEARNING METHOD USING CLUSTERING AND COMMITTEE-BASED SAMPLE SELECTION FOR SOUND EVENT CLASSIFICATION
    Zhao Shuyang
    Heittola, Toni
    Virtanen, Tuomas
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 116 - 120
  • [49] Release from active learning/model selection dilemma: Optimizing sample points and models at the same time
    Sugiyama, M
    Ogawa, H
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 2917 - 2922
  • [50] Double Machine Learning for Sample Selection Models
    Bia, Michela
    Huber, Martin
    Laffers, Lukas
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024, 42 (03) : 958 - 969