AUC Optimization from Multiple Unlabeled Datasets

被引:0
|
作者
Xie, Zheng [1 ]
Liu, Yu [1 ,2 ]
Li, Ming [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Nanjing Univ, Sch Artificial Intelligence, Nanjing, Peoples R China
关键词
AREA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised learning aims to make machine learning more powerful when the perfect supervision is unavailable, and has attracted much attention from researchers. Among the various scenarios of weak supervision, one of the most challenging cases is learning from multiple unlabeled (U) datasets with only a little knowledge of the class priors, or Um learning for short. In this paper, we study the problem of building an AUC (area under ROC curve) optimal model from multiple unlabeled datasets, which maximizes the pairwise ranking ability of the classifier. We propose U-m-AUC, an AUC optimization approach that converts the U-m data into a multi-label AUC optimization problem, and can be trained efficiently. We show that the proposed U-m-AUC is effective theoretically and empirically.
引用
收藏
页码:16058 / 16066
页数:9
相关论文
共 50 条
  • [1] Binary Classification from Multiple Unlabeled Datasets via Surrogate Set Classification
    Lu, Nan
    Lei, Shida
    Niu, Gang
    Sato, Issei
    Sugiyama, Masashi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [2] Semi-supervised AUC optimization based on positive-unlabeled learning
    Sakai, Tomoya
    Niu, Gang
    Sugiyama, Masashi
    MACHINE LEARNING, 2018, 107 (04) : 767 - 794
  • [3] Online AUC Optimization for Sparse High-Dimensional Datasets
    Zhou, Baojian
    Ying, Yiming
    Skiena, Steven
    20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 881 - 890
  • [4] Semi-Supervised AUC Optimization without Guessing Labels of Unlabeled Data
    Xie, Zheng
    Li, Ming
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4310 - 4317
  • [5] Correction to: Semi-supervised AUC optimization based on positive-unlabeled learning
    Tomoya Sakai
    Gang Niu
    Masashi Sugiyama
    Machine Learning, 2018, 107 : 795 - 795
  • [6] Predicting Classification Accuracy of Unlabeled Datasets Using Multiple Deep Neural Networks
    You, Shingchern D.
    Liu, Hsiao-Chung
    Liu, Chien-Hung
    IEEE ACCESS, 2022, 10 : 44627 - 44637
  • [7] Automatic Evaluation of Cluster in Unlabeled Datasets
    Krishnamoorthi, M.
    INFORMATION AND NETWORK TECHNOLOGY, 2011, 4 : 120 - 124
  • [8] CLAMI: Defect Prediction on Unlabeled Datasets
    Nam, Jaechang
    Kim, Sunghun
    2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2015, : 452 - 463
  • [9] Data Removal from an AUC Optimization Model
    Li, Jie
    Guo, Jun-Qi
    Gao, Wei
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 221 - 235
  • [10] Efficient Label Collection for Unlabeled Image Datasets
    Wigness, Maggie
    Draper, Bruce A.
    Beveridge, J. Ross
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4594 - 4602