Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

被引:5
|
作者
Li, Tianyu [1 ]
Wang, Chien-Chih [1 ]
Ma, Yukun [2 ]
Ortal, Patricia [1 ]
Zhao, Qifang [1 ]
Stenger, Bjorn [1 ]
Hirate, Yu [1 ]
机构
[1] Rakuten Inst Technol, Tokyo, Japan
[2] Continental Automot Grp, AIR Labs, Singapore, Singapore
关键词
Classification; Semi-supervised Learning; Reinforcement Learning; Deep Learning;
D O I
10.1109/ICDM.2019.00050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings.
引用
收藏
页码:399 / 408
页数:10
相关论文
共 50 条
  • [41] Efficient Training for Positive Unlabeled Learning
    Sansone, Emanuele
    De Natale, Francesco G. B.
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2584 - 2598
  • [42] On Positive and Unlabeled Learning for Text Classification
    Nagy, Istvan T.
    Farkas, Richard
    Csirik, Janos
    TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 219 - 226
  • [43] Positive and unlabeled examples help learning
    De Comité, F
    Denis, F
    Gilleron, R
    Letouzey, F
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 1999, 1720 : 219 - 230
  • [44] Learning from positive and unlabeled examples
    Denis, F
    Gilleron, R
    Letouzey, F
    THEORETICAL COMPUTER SCIENCE, 2005, 348 (01) : 70 - 83
  • [45] Positive and Unlabeled Learning with Label Disambiguation
    Zhang, Chuang
    Ren, Dexin
    Liu, Tongliang
    Yang, Jian
    Gong, Chen
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4250 - 4256
  • [46] Robust and unbiased positive and unlabeled learning
    Liu, Yinjie
    Zhao, Jie
    Xu, Yitian
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [47] Multi-Positive and Unlabeled Learning
    Xu, Yixing
    Xu, Chang
    Xu, Chao
    Tao, Dacheng
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3182 - 3188
  • [48] Learning from positive and unlabeled examples
    Letouzey, F
    Denis, F
    Gilleron, R
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2000, 1968 : 71 - 85
  • [49] Positive unlabeled learning with tensor networks
    Zunkovic, Bojan
    NEUROCOMPUTING, 2023, 552
  • [50] False positive rate control for positive unlabeled learning
    Kong, Shuchen
    Shen, Weiwei
    Zheng, Yingbin
    Zhang, Ao
    Pu, Jian
    Wang, Jun
    NEUROCOMPUTING, 2019, 367 : 13 - 19