Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm

被引:2
|
作者
Xu, Zuobing [1 ]
Hogan, Christopher [2 ]
Bauer, Robert [2 ]
机构
[1] eBay Inc, San Jose, CA 95125 USA
[2] H5 Inc, San Francisco, CA 94105 USA
关键词
large scale; active learning; greedy algorithm; submodular functions;
D O I
10.1109/ICDMW.2009.38
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Active learning algorithms actively select training examples to acquire labels from domain experts, which are very effective to reduce human labeling effort in the context of supervised learning. To reduce computational time in training, as well as provide more convenient user interaction environment, it is necessary to select batches of new training examples instead of a single example. Batch mode active learning algorithms incorporate a diversity measure to construct a batch of diversified candidate examples. Existing approaches use greedy algorithms to make it feasible to the scale of thousands of data. Greedy algorithms, however, are not efficient enough to scale to even larger real world classification applications, which contain millions of data. In this paper, we present an extremely efficient active learning algorithm. This new active learning algorithm achieves the same results as the traditional greedy algorithm, while the run time is reduced by a factor of several hundred times. We prove that the objective function of the algorithm is submodular, which guarantees to find the same solution as the greedy algorithm. We evaluate our approach on several large scale real-world text classification problems, and show that our new approach achieves substantial speedups, while obtaining the same classification accuracy.
引用
收藏
页码:326 / +
页数:2
相关论文
共 50 条
  • [21] Batch mode active learning via adaptive criteria weights
    Li, Hao
    Wang, Yongli
    Li, Yanchao
    Xiao, Gang
    Hu, Peng
    Zhao, Ruxin
    APPLIED INTELLIGENCE, 2021, 51 (06) : 3475 - 3489
  • [22] Greedy-DAgger - A Student Rollout Efficient Imitation Learning Algorithm
    Torok, Mitchell
    Deghat, Mohammad
    Song, Yang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2878 - 2885
  • [23] Querying Discriminative and Representative Samples for Batch Mode Active Learning
    Wang, Zheng
    Ye, Jieping
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 158 - 166
  • [24] Batch Mode Active Learning for Regression With Expected Model Change
    Cai, Wenbin
    Zhang, Muhan
    Zhang, Ya
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (07) : 1668 - 1681
  • [25] Efficient Batch-Mode Reinforcement Learning Using Extreme Learning Machines
    Liu, Jiahang
    Zuo, Lei
    Xu, Xin
    Zhang, Xinglong
    Ren, Junkai
    Fang, Qiang
    Liu, Xinwang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (06): : 3664 - 3677
  • [26] Batch mode active learning algorithm combining with self-training for multiclass brain-computer interfaces
    Chen, Minyou
    Tan, Xuemin
    Journal of Information and Computational Science, 2015, 12 (06): : 2351 - 2359
  • [27] FAST GREEDY ALGORITHM FOR ACTIVE CONTOURS
    LAM, KM
    YAN, H
    ELECTRONICS LETTERS, 1994, 30 (01) : 21 - 23
  • [28] A SPACE EFFICIENT ALGORITHM FOR THE GREEDY TRIANGULATION
    LINGAS, A
    LECTURE NOTES IN CONTROL AND INFORMATION SCIENCES, 1988, 113 : 359 - 364
  • [29] A SPACE EFFICIENT GREEDY TRIANGULATION ALGORITHM
    GOLDMAN, SA
    INFORMATION PROCESSING LETTERS, 1989, 31 (04) : 191 - 196
  • [30] An Efficient Greedy Algorithm for Sequence Recommendation
    Benouaret, Idir
    Amer-Yahia, Sihem
    Roy, Senjuti Basu
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, 2019, 11706 : 314 - 326