Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

被引:5
|
作者
Li, Tianyu [1 ]
Wang, Chien-Chih [1 ]
Ma, Yukun [2 ]
Ortal, Patricia [1 ]
Zhao, Qifang [1 ]
Stenger, Bjorn [1 ]
Hirate, Yu [1 ]
机构
[1] Rakuten Inst Technol, Tokyo, Japan
[2] Continental Automot Grp, AIR Labs, Singapore, Singapore
关键词
Classification; Semi-supervised Learning; Reinforcement Learning; Deep Learning;
D O I
10.1109/ICDM.2019.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 50 条
  • [31] Class-prior estimation for learning from positive and unlabeled data
    du Plessis, Marthinus C.
    Niu, Gang
    Sugiyama, Masashi
    MACHINE LEARNING, 2017, 106 (04) : 463 - 492
  • [32] Learning gene regulatory networks from only positive and unlabeled data
    Luigi Cerulo
    Charles Elkan
    Michele Ceccarelli
    BMC Bioinformatics, 11
  • [33] A Quantum-Inspired Direct Learning Strategy for Positive and Unlabeled Data
    Chenguang Zhang
    Xuejiao Du
    Yan Zhang
    International Journal of Computational Intelligence Systems, 16
  • [34] A Quantum-Inspired Direct Learning Strategy for Positive and Unlabeled Data
    Zhang, Chenguang
    Du, Xuejiao
    Zhang, Yan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [35] Tri-training based learning from positive and unlabeled data
    Zhang, Bangzuo
    Zuo, Wanli
    2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 640 - 644
  • [36] Learning gene regulatory networks from only positive and unlabeled data
    Cerulo, Luigi
    Elkan, Charles
    Ceccarelli, Michele
    BMC BIOINFORMATICS, 2010, 11
  • [37] An Active Learning Based on Uncertainty and Density Method for Positive and Unlabeled Data
    Luo, Jun
    Zhou, Wenan
    Du, Yu
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 229 - 241
  • [38] Active Learning for Multivariate Time Series Classification with Positive Unlabeled Data
    He, Guoliang
    Duan, Yong
    Li, Yifei
    Qian, Tieyun
    He, Jinrong
    Jia, Xiangyang
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 178 - 185
  • [39] Learning Bayesian network classifiers for facial expression recognition using both labeled and unlabeled data
    Cohen, I
    Sebe, N
    Cozman, FG
    Cirelo, MC
    Huang, TS
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 595 - 601
  • [40] Conditional generative positive and unlabeled learning
    Papic, Ales
    Kononenko, Igor
    Bosnic, Zoran
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224