Neyman-Pearson Multi-Class Classification via Cost-Sensitive Learning

被引:0
|
作者
Tian, Ye [1 ]
Feng, Yang [2 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY USA
[2] NYU, Sch Global Publ Hlth, Dept Biostat, 708 Broadway, New York, NY 10003 USA
关键词
Cost-sensitive learning; Confusion matrix; Duality; Feasibility; Multi-class classification; Neyman-Pearson paradigm; MAXIMUM-LIKELIHOOD;
D O I
10.1080/01621459.2024.2402567
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Most existing classification methods aim to minimize the overall misclassification error rate. However, in applications such as loan default prediction, different types of errors can have varying consequences. To address this asymmetry issue, two popular paradigms have been developed: the Neyman-Pearson (NP) paradigm and the cost-sensitive (CS) paradigm. Previous studies on the NP paradigm have primarily focused on the binary case, while the multi-class NP problem poses a greater challenge due to its unknown feasibility. In this work, we tackle the multi-class NP problem by establishing a connection with the CS problem via strong duality and propose two algorithms. We extend the concept of NP oracle inequalities, crucial in binary classifications, to NP oracle properties in the multi-class context. Our algorithms satisfy these NP oracle properties under certain conditions. Furthermore, we develop practical algorithms to assess the feasibility and strong duality in multi-class NP problems, which can offer practitioners the landscape of a multi-class NP problem with various target error levels. Simulations and real data studies validate the effectiveness of our algorithms. To our knowledge, this is the first study to address the multi-class NP problem with theoretical guarantees. The proposed algorithms have been implemented in the R package npcs, which is available on CRAN. Supplementary materials for this article are available online, including a standardized description of the materials available for reproducing the work.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Decision-Theoretic Rough Set Approach to Multi-class Cost-Sensitive Classification
    Deng, Guojian
    Jia, Xiuyi
    ROUGH SETS, (IJCRS 2016), 2016, 9920 : 250 - 260
  • [22] Cost-sensitive probability for weighted voting in an ensemble model for multi-class classification problems
    Artittayapron Rojarath
    Wararat Songpan
    Applied Intelligence, 2021, 51 : 4908 - 4932
  • [23] Detecting financial misstatements with fraud intention using multi-class cost-sensitive learning
    Kim, Yeonkook J.
    Baik, Bok
    Cho, Sungzoon
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 62 : 32 - 43
  • [24] Sequential multi-class three-way decisions based on cost-sensitive learning
    Chen, Wenbin
    Zhang, Qinghua
    Dai, Yongyang
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 146 : 47 - 61
  • [25] ANALYSIS TO NEYMAN-PEARSON CLASSIFICATION WITH CONVEX LOSS FUNCTION
    Min Han (Beijing University of Technology
    Analysis in Theory and Applications, 2008, (01) : 18 - 28
  • [26] Neyman-Pearson classification: parametrics and sample size requirement
    Tong, Xin
    Xia, Lucy
    Wang, Jiacheng
    Feng, Yang
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [27] Neyman-pearson classification: Parametrics and sample size requirement
    Tong, Xin
    Xia, Lucy
    Wang, Jiacheng
    Feng, Yang
    Journal of Machine Learning Research, 2020, 21
  • [28] A survey on Neyman-Pearson classification and suggestions for future research
    Tong, Xin
    Feng, Yang
    Zhao, Anqi
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2016, 8 (02): : 64 - 81
  • [29] Universal Neyman-Pearson classification with a partially known hypothesis
    Boroumand, Parham
    Fabregas, Albert Guillen i
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2024, 13 (02)
  • [30] Operational Patterns Recognition Using Multi-class Cost-sensitive Learning for Alumina Evaporation Process
    Tang Mingzhu
    Yang Chunhua
    Qing Jingjing
    Gui Weihua
    CHINESE JOURNAL OF ELECTRONICS, 2013, 22 (02): : 282 - 286