F-measure maximizing logistic regression

被引:0
|
作者
Okabe, Masaaki [1 ]
Tsuchida, Jun [1 ]
Yadohisa, Hiroshi [1 ]
机构
[1] Doshisha Univ, Grad Sch Culture & Informat Sci, Kyoto, Japan
关键词
Density ratio; Discriminant analysis; Imbalanced data; Weighted importance; ROC;
D O I
10.1080/03610918.2022.2081706
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Logistic regression is a widely used method in several fields. When applying logistic regression to imbalanced data, wherein the majority classes dominate the minority classes, all class labels are estimated as "majority class." In this study, we use an F-measure optimization method to improve the performance of logistic regression applied to imbalanced data. Although many F-measure optimization methods adopt a ratio of the estimators to approximate the F-measure, the ratio of the estimators tends to exhibit more bias than when the ratio is directly approximated. Therefore, we employ an approximate F-measure to estimate the relative density ratio. In addition, we define and approximate a relative F-measure. We present an algorithm for a logistic regression weighted approximation relative to the F-measure. The results of an experiment using real world data demonstrate that our proposed algorithm can efficiently improve the performance of logistic regression applied to imbalanced data.
引用
收藏
页码:2554 / 2564
页数:11
相关论文
共 50 条
  • [21] Implication Intensity: Randomized F-measure for Cluster Evaluation
    Li, Limin
    Wu, Junjie
    Zhu, Shiwei
    2009 6TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1 AND 2, 2009, : 724 - 729
  • [22] Adjusted F-measure and kernel scaling for imbalanced data learning
    Maratea, Antonio
    Petrosino, Alfredo
    Manzo, Mario
    INFORMATION SCIENCES, 2014, 257 : 331 - 341
  • [23] F-Measure Curves for Visualizing Classifier Performance with Imbalanced Data
    Soleymani, Roghayeh
    Granger, Eric
    Fumera, Giorgio
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2018, 2018, 11081 : 165 - 177
  • [24] F-measure curves: A tool to visualize classifier performance under imbalance
    Soleymani, Roghayeh
    Granger, Eric
    Fumera, Giorgio
    PATTERN RECOGNITION, 2020, 100
  • [25] A note on using the F-measure for evaluating record linkage algorithms
    Hand, David
    Christen, Peter
    STATISTICS AND COMPUTING, 2018, 28 (03) : 539 - 547
  • [26] Extreme F-Measure Maximization using Sparse Probability Estimates
    Jasinska, Kalina
    Dembczynski, Krzysztof
    Busa-Fekete, Robert
    Pfannschmidt, Karlson
    Klerx, Timo
    Huellermeier, Eyke
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [27] Cost-Sensitive Hypergraph Learning With F-Measure Optimization
    Wang, Nan
    Liang, Ruozhou
    Zhao, Xibin
    Gao, Yue
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 2767 - 2778
  • [28] Efficient Optimization of F-Measure with Cost-Sensitive SVM
    Cheng, Fan
    Zhou, Yuan
    Gao, Jian
    Zheng, Shuangqiu
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2016, 2016
  • [29] A Review of the F-Measure: Its History, Properties, Criticism, and Alternatives
    Christen, Peter
    Hand, David J.
    Kirielle, Nishadi
    ACM COMPUTING SURVEYS, 2024, 56 (03)
  • [30] PROFIT MAXIMIZING LOGISTIC REGRESSION MODELING FOR CREDIT SCORING
    Devos, Arnout
    Dhondt, Jakob
    Stripling, Eugen
    Baesens, Bart
    vanden Broucke, Seppe Klm
    Sukhatme, Gaurav
    2018 IEEE DATA SCIENCE WORKSHOP (DSW), 2018, : 125 - 129