Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

被引：5

作者：

Li, Tianyu ^{[1
]}

Wang, Chien-Chih ^{[1
]}

Ma, Yukun ^{[2
]}

Ortal, Patricia ^{[1
]}

Zhao, Qifang ^{[1
]}

Stenger, Bjorn ^{[1
]}

Hirate, Yu ^{[1
]}

机构：

[1] Rakuten Inst Technol, Tokyo, Japan

[2] Continental Automot Grp, AIR Labs, Singapore, Singapore

来源：

2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年

关键词：

Classification; Semi-supervised Learning; Reinforcement Learning; Deep Learning;

D O I：

10.1109/ICDM.2019.00050

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings.

引用

页码：399 / 408

页数：10

共 50 条

[41] Efficient Training for Positive Unlabeled Learning
Sansone, Emanuele
De Natale, Francesco G. B.
Zhou, Zhi-Hua
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2584 - 2598
[42] On Positive and Unlabeled Learning for Text Classification
Nagy, Istvan T.
Farkas, Richard
Csirik, Janos
TEXT, SPEECH AND DIALOGUE, TSD 2011, 2011, 6836 : 219 - 226
[43] Positive and unlabeled examples help learning
De Comité, F
Denis, F
Gilleron, R
Letouzey, F
ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 1999, 1720 : 219 - 230
[44] Learning from positive and unlabeled examples
Denis, F
Gilleron, R
Letouzey, F
THEORETICAL COMPUTER SCIENCE, 2005, 348 (01) : 70 - 83
[45] Positive and Unlabeled Learning with Label Disambiguation
Zhang, Chuang
Ren, Dexin
Liu, Tongliang
Yang, Jian
Gong, Chen
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4250 - 4256
[46] Robust and unbiased positive and unlabeled learning
Liu, Yinjie
Zhao, Jie
Xu, Yitian
KNOWLEDGE-BASED SYSTEMS, 2023, 277
[47] Multi-Positive and Unlabeled Learning
Xu, Yixing
Xu, Chang
Xu, Chao
Tao, Dacheng
PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3182 - 3188
[48] Learning from positive and unlabeled examples
Letouzey, F
Denis, F
Gilleron, R
ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2000, 1968 : 71 - 85
[49] Positive unlabeled learning with tensor networks
Zunkovic, Bojan
NEUROCOMPUTING, 2023, 552
[50] False positive rate control for positive unlabeled learning
Kong, Shuchen
Shen, Weiwei
Zheng, Yingbin
Zhang, Ao
Pu, Jian
Wang, Jun
NEUROCOMPUTING, 2019, 367 : 13 - 19

← 1 2 3 4 5 →