Adaptive batch mode active learning with deep similarity

被引:1
|
作者
Zhang, Kaiyuan [1 ]
Qian, Buyue [2 ]
Wei, Jishang [3 ]
Yin, Changchang [1 ]
Cao, Shilei [1 ]
Li, Xiaoyu [1 ]
Cao, Yanjun [4 ]
Zheng, Qinghua [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect & Informat Engn, Xian 710049, Shaanxi, Peoples R China
[2] Capital Med Univ, Beijing Chaoyang Hosp, Beijing 100020, Peoples R China
[3] HP Labs, 1501 Page Mill Rd, Palo Alto, CA 94304 USA
[4] Northwest Univ, Biomed Key Lab Shaanxi Prov, Xian 710069, Peoples R China
基金
中国国家自然科学基金;
关键词
Active learning; Adaptive batch mode active learning; Classification model; Deep neural network; Deep learning; CLASSIFICATION;
D O I
10.1016/j.eij.2023.100412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning is usually used in scenarios where few labels are available and manual labeling is expensive. To improve model performance, it is necessary to find the most valuable instance among all instances and label it to maximize the benefits of labeling. In practical scenarios, it is often more efficient to query a group of instances instead of a individual instance during each iteration. To achieve this goal, we need to explore the similarities between instances to ensure the informativeness and diversity. Many ad-hoc algorithms are proposed for batch mode active learning, and there are generally two major issues. One is that similarity measurement among in-stances often only relies on the expression of features but it is not well integrated with the classification algo-rithm model. This will cut down the precise measurement of diversity. The other is that in order to explore the decision boundary, these algorithms often choose the instance near the boundary. It is difficult to get the true boundary when there are few labeled instances. As a large number of instances continue to be labeled, infor-mation between instances is less used, and the performance will be greatly improved if it is properly used. In our work, we propose an adaptive algorithm based on deep neural networks to solve the two problems mentioned above. During the training phase, we established a paired network to improve the accuracy of the classification model, and the network can project the instance to a new feature space for more accurate similarity measure-ment. When batch labeling instances, we use the adaptive algorithm to select the instance by balancing the maximum uncertainty (exploration) and diversity (exploitation). Our algorithm has been validated for heart failure prediction tasks in real-world EHR datasets. Due to the no public of EHR data, we also conducted vali-dation on two other classic classification tasks. Our algorithm is superior to the baseline method in both accuracy and convergence rate.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Batch-Mode Active Learning for Technology-Assisted Review
    Saha, Tanay Kumar
    Al Hasan, Mohammad
    Burgess, Chandler
    Habib, Md Ahsan
    Johnson, Jeff
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1134 - 1143
  • [32] Asymmetric propagation based batch mode active learning for image retrieval
    Niu, Biao
    Cheng, Jian
    Bai, Xiao
    Lu, Hanqing
    SIGNAL PROCESSING, 2013, 93 (06) : 1639 - 1650
  • [33] Cluster optimized batch mode active learning sample selection method
    He, Zhonghai
    Xia, Zhichao
    Du, Yinzhi
    Zhang, Xiaofang
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [34] Batch Mode Active Learning for Node Classification in Assortative and Disassortative Networks
    Ping, Shuqiu
    Liu, Dayou
    Yang, Bo
    Zhu, Yungang
    Chen, Hechang
    Wang, Zheng
    IEEE ACCESS, 2018, 6 : 4750 - 4758
  • [35] A novel batch-mode active learning method for SVM classifier
    Liu, Kang
    Qian, Xu
    Journal of Information and Computational Science, 2012, 9 (16): : 5077 - 5084
  • [36] NimbleLearn: A Scalable and Fast Batch-mode Active Learning Approach
    Kong, Ruoyan
    Qiu, Zhanlong
    Liu, Yang
    Zhao, Qi
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 350 - 359
  • [37] A novel batch-mode active learning method for SVM classifier
    Liu, K. (liukang1112@gmail.com), 1600, Binary Information Press, Flat F 8th Floor, Block 3, Tanner Garden, 18 Tanner Road, Hong Kong (09):
  • [38] Batch Mode Active Learning with Applications to Text Categorization and Image Retrieval
    Hoi, Steven C. H.
    Jin, Rong
    Lyu, Michael R.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1233 - 1248
  • [39] BatchRank: A Novel Batch Mode Active Learning Framework for Hierarchical Classification
    Chakraborty, Shayok
    Balasubramanian, Vineeth
    Sankar, Adepu Ravi
    Panchanathan, Sethuraman
    Ye, Jieping
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 99 - 108
  • [40] Efficient Transport Simulation With Restricted Batch-Mode Active Learning
    Antunes, Francisco
    Ribeiro, Bernardete
    Pereira, Francisco C.
    Gomes, Rui
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (11) : 3642 - 3651