Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

被引:5
|
作者
Li, Tianyu [1 ]
Wang, Chien-Chih [1 ]
Ma, Yukun [2 ]
Ortal, Patricia [1 ]
Zhao, Qifang [1 ]
Stenger, Bjorn [1 ]
Hirate, Yu [1 ]
机构
[1] Rakuten Inst Technol, Tokyo, Japan
[2] Continental Automot Grp, AIR Labs, Singapore, Singapore
关键词
Classification; Semi-supervised Learning; Reinforcement Learning; Deep Learning;
D O I
10.1109/ICDM.2019.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 50 条
  • [1] Bayesian Classifiers for Positive Unlabeled Learning
    He, Jiazhen
    Zhang, Yang
    Li, Xue
    Wang, Yong
    WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 81 - +
  • [2] Learning Bayesian classifiers from positive and unlabeled examples
    Calvo, Boria
    Larranaga, Pedro
    Lozano, Jose A.
    PATTERN RECOGNITION LETTERS, 2007, 28 (16) : 2375 - 2384
  • [3] Federated Learning with Positive and Unlabeled Data
    Lin, Xinyang
    Chen, Hanting
    Xu, Yixing
    Xu, Chao
    Gui, Xiaolin
    Deng, Yiping
    Wang, Yunhe
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [4] Positive and unlabeled learning in categorical data
    Ienco, Dino
    Pensa, Ruggero G.
    NEUROCOMPUTING, 2016, 196 : 113 - 124
  • [5] Learning Balanced Bayesian Classifiers From Labeled and Unlabeled Data
    Guo, Lu
    Wang, Limin
    Li, Qilong
    Li, Kuo
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 330 - 342
  • [6] Regularization of Unlabeled Data for Learning of Classifiers based on Mixture Models
    Iswanto, Bambang Heru
    ICICI-BME: 2009 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, COMMUNICATION, INFORMATION TECHNOLOGY, AND BIOMEDICAL ENGINEERING, 2009, : 345 - 349
  • [7] Measuring the Significance of Policy Outputs with Positive Unlabeled Learning
    Zubek, Radoslaw
    Dasgupta, Abhishek
    Doyle, David
    AMERICAN POLITICAL SCIENCE REVIEW, 2021, 115 (01) : 339 - 346
  • [8] Analysis of Learning from Positive and Unlabeled Data
    du Plessis, Marthinus C.
    Niu, Gang
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [9] Learning from positive and unlabeled data: a survey
    Jessa Bekker
    Jesse Davis
    Machine Learning, 2020, 109 : 719 - 760
  • [10] Learning from positive and unlabeled data: a survey
    Bekker, Jessa
    Davis, Jesse
    MACHINE LEARNING, 2020, 109 (04) : 719 - 760