Learning Algorithm in Two-Stage Selective Prediction

被引:1
|
作者
Ye, Weicheng [1 ]
Chen, Dangxing [2 ]
Ramazanli, Ilqar [3 ]
机构
[1] Credit Suisse Secur, New York, NY 10010 USA
[2] Duke Kunshan Univ, Zu Chongzhi Ctr Math & Computat Sci, Kunshan, Jiangsu, Peoples R China
[3] Facebook, New York, NY USA
关键词
Deep Learning; Selective Prediction; Pattern Recognition; Analysis of Machine Learning Algorithms;
D O I
10.1109/CACML55074.2022.00093
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data gathered from real-world applications often suffer from corruption. The low-quality data will hinder the performance of the learning system in terms of classification accuracy, model building time, and interpretability of the classifier. Selective prediction, also known as prediction with a reject option, is to reduce the error rate by abstaining from prediction under uncertainty while keeping coverage as high as possible. Deep Neural Network (DNN) has a high capacity for fitting large-scale data. If DNNs can leverage the tradeoff coverage by selective prediction, then the performance can potentially be improved. However, the current DNN embedded with the reject option requires the knowledge of the rejection threshold, and the searching of threshold is inefficient in large-scale applications. Besides, the abstention of prediction on partial datasets increases the model bias and might not be optimal. To resolve these problems, we propose innovative threshold learning algorithms integrated with the selective prediction that can estimate the intrinsic rejection rate of the dataset. Correspondingly, we provide a rigorous framework to generalize the estimation of data corruption rate. To leverage the advantage of multiple learning algorithms, we extend our learning algorithms to a hierarchical two-stage system. Our methods have the advantage of being flexible with any neural network architecture. The empirical results show that our algorithms can achieve state-of-the-art performance in challenging real-world datasets in both classification and regression problems.
引用
收藏
页码:512 / 521
页数:10
相关论文
共 50 条
  • [1] A Two-Stage Learning Method for Response Prediction
    Chen, Kuan-Hsi
    Ting, Zih-Yun
    Shen, Jia-Ying
    Hu, Yuh-Jyh
    Liang, Tyne
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1336 - 1341
  • [2] Heterogeneous defect prediction with two-stage ensemble learning
    Zhiqiang Li
    Xiao-Yuan Jing
    Xiaoke Zhu
    Hongyu Zhang
    Baowen Xu
    Shi Ying
    Automated Software Engineering, 2019, 26 : 599 - 651
  • [3] Heterogeneous defect prediction with two-stage ensemble learning
    Li, Zhiqiang
    Jing, Xiao-Yuan
    Zhu, Xiaoke
    Zhang, Hongyu
    Xu, Baowen
    Ying, Shi
    AUTOMATED SOFTWARE ENGINEERING, 2019, 26 (03) : 599 - 651
  • [5] Two-stage learning algorithm for fuzzy cognitive maps
    Papageorgiou, EI
    Groumpos, PP
    2004 2ND INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2004, : 82 - 87
  • [6] A two-stage solution algorithm for paroxysmal atrial fibrillation prediction
    Lynn, KS
    Chiang, HD
    COMPUTERS IN CARDIOLOGY 2001, VOL 28, 2001, 28 : 405 - 407
  • [7] A prediction-based two-stage replica replacement algorithm
    Tian, Tian
    Luo, Junzhou
    PROCEEDINGS OF THE 2007 11TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, VOLS 1 AND 2, 2007, : 594 - +
  • [8] Two-stage learning algorithm for biomedical named entity recognition
    Che X.-J.
    Xu H.
    Pan M.-Y.
    Liu Q.-L.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (08): : 2380 - 2387
  • [9] A two-stage federated learning method for personalization via selective collaboration
    Xu, Jiuyun
    Zhou, Liang
    Zhao, Yingzhi
    Li, Xiaowen
    Zhu, Kongshang
    Xu, Xiangrui
    Duan, Qiang
    Zhang, Ruru
    COMPUTER COMMUNICATIONS, 2025, 232
  • [10] Two-Stage Metric Learning
    Wang, Jun
    Sun, Ke
    Sha, Fei
    Marchand-Maillet, Stephane
    Kalousis, Alexandros
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 370 - 378