Active learning sample selection - based on multicriteria

被引:0
|
作者
He, Zhonghai [1 ,2 ]
Shen, Kun [1 ,4 ]
Zhang, Xiaofang [3 ]
机构
[1] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao, Peoples R China
[2] Hebei Key Lab Micronano Precis Opt Sensing & Meas, Qinhuangdao, Peoples R China
[3] Beijing Inst Technol, Sch Opt & Photon, Beijing, Peoples R China
[4] Northeastern Univ Qinhuangdao, Sch Control Engn, Qinhuangdao 066000, Peoples R China
关键词
Multivariate calibration; multicriteria modeling; active learning; sample selection; CALIBRATION; REGRESSION; DENSITY; QUERY; SETS;
D O I
10.1177/09670335231211618
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
In multivariate calibration problems, model performance is affected significantly by the calibration samples used during model building. In recent years, active learning methods have become one of the best methods for sample selection. However, most active learning methods only select instances from prediction uncertainty or sample space distance, and these single-criteria methods tend to select undesired samples. In addition, sample density characterizes the spatial information carried by the sample, but few studies in quantitative analysis utilize sample density alone to select calibration samples. Considering these issues, based on the k-means clustering algorithm, this paper proposes an active learning sample selection method (DIDAL), which combines the three criteria of diversity, informativeness and sample density. The most representative sample is iteratively selected for - addition to the calibration set for modeling and estimating the chemical concentration of analytes. Soybean meal and soy sauce samples were analyzed by DIDAL and compared with existing sample selection methods. The prediction results show that the DIDAL algorithm significantly outperforms several existing algorithms and is close to the performance of full-sample modeling. A model with high prediction accuracy can be constructed by selecting only a few samples using the DIDAL method.
引用
收藏
页码:289 / 297
页数:9
相关论文
共 50 条
  • [11] Sample Selection Based on Active Learning for Short-Term Wind Speed Prediction
    Yang, Jian
    Zhao, Xin
    Wei, Haikun
    Zhang, Kanjian
    ENERGIES, 2019, 12 (03)
  • [12] Sample diversity selection strategy based on label distribution morphology for active label distribution learning
    Li, Weiwei
    Qian, Wei
    Chen, Lei
    Jia, Xiuyi
    PATTERN RECOGNITION, 2024, 150
  • [13] Batch Mode Active Learning for Semantic Segmentation Based on Multi-Clue Sample Selection
    Tan, Yao
    Yang, Liu
    Hu, Qinghua
    Du, Zhibin
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 831 - 840
  • [14] Contrastive Open-Set Active Learning-Based Sample Selection for Image Classification
    Yan, Zizheng
    Ruan, Delian
    Wu, Yushuang
    Huang, Junshi
    Chai, Zhenhua
    Han, Xiaoguang
    Cui, Shuguang
    Li, Guanbin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5525 - 5537
  • [15] Multicriteria-Based Active Discriminative Dictionary Learning for Scene Recognition
    Zheng, Caixia
    Yi, Yugen
    Qi, Miao
    Liu, Fucong
    Bi, Chao
    Wang, Jianzhong
    Kong, Jun
    IEEE ACCESS, 2018, 6 : 4416 - 4426
  • [16] Cluster optimized batch mode active learning sample selection method
    He, Zhonghai
    Xia, Zhichao
    Du, Yinzhi
    Zhang, Xiaofang
    INFRARED PHYSICS & TECHNOLOGY, 2025, 145
  • [17] Improving Annotation Efficiency through Uncertain Sample Selection in Active Learning
    Kawano, Yasufumi
    Nota, Yoshiki
    Mochizuki, Rinpei
    Aoki, Yoshimitsu
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2022, 88 (02): : 211 - 216
  • [18] Active Manifold Learning via Gershgorin Circle Guided Sample Selection
    Xu, Hongteng
    Zha, Hongyuan
    Li, Ren-Cang
    Davenport, Mark A.
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3108 - 3114
  • [19] Active learning support vector machines for optimal sample selection in classification
    Zomer, S
    Sänchez, MDN
    Brereton, RG
    Pavón, JLP
    JOURNAL OF CHEMOMETRICS, 2004, 18 (06) : 294 - 305
  • [20] Active Learning-Based Sample Selection for Label-Efficient Blind Image Quality Assessment
    Song, Tianshu
    Li, Leida
    Cheng, Deqiang
    Chen, Pengfei
    Wu, Jinjian
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5884 - 5896