Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

被引：5

作者：

Li, Tianyu ^{[1
]}

Wang, Chien-Chih ^{[1
]}

Ma, Yukun ^{[2
]}

Ortal, Patricia ^{[1
]}

Zhao, Qifang ^{[1
]}

Stenger, Bjorn ^{[1
]}

Hirate, Yu ^{[1
]}

机构：

[1] Rakuten Inst Technol, Tokyo, Japan

[2] Continental Automot Grp, AIR Labs, Singapore, Singapore

来源：

2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年

关键词：

Classification; Semi-supervised Learning; Reinforcement Learning; Deep Learning;

D O I：

10.1109/ICDM.2019.00050

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings.

引用

页码：399 / 408

页数：10

共 50 条

[1] Bayesian Classifiers for Positive Unlabeled Learning
He, Jiazhen
Zhang, Yang
Li, Xue
Wang, Yong
WEB-AGE INFORMATION MANAGEMENT, 2011, 6897 : 81 - +
[2] Learning Bayesian classifiers from positive and unlabeled examples
Calvo, Boria
Larranaga, Pedro
Lozano, Jose A.
PATTERN RECOGNITION LETTERS, 2007, 28 (16) : 2375 - 2384
[3] Federated Learning with Positive and Unlabeled Data
Lin, Xinyang
Chen, Hanting
Xu, Yixing
Xu, Chao
Gui, Xiaolin
Deng, Yiping
Wang, Yunhe
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[4] Positive and unlabeled learning in categorical data
Ienco, Dino
Pensa, Ruggero G.
NEUROCOMPUTING, 2016, 196 : 113 - 124
[5] Learning Balanced Bayesian Classifiers From Labeled and Unlabeled Data
Guo, Lu
Wang, Limin
Li, Qilong
Li, Kuo
IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (04) : 330 - 342
[6] Regularization of Unlabeled Data for Learning of Classifiers based on Mixture Models
Iswanto, Bambang Heru
ICICI-BME: 2009 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, COMMUNICATION, INFORMATION TECHNOLOGY, AND BIOMEDICAL ENGINEERING, 2009, : 345 - 349
[7] Measuring the Significance of Policy Outputs with Positive Unlabeled Learning
Zubek, Radoslaw
Dasgupta, Abhishek
Doyle, David
AMERICAN POLITICAL SCIENCE REVIEW, 2021, 115 (01) : 339 - 346
[8] Analysis of Learning from Positive and Unlabeled Data
du Plessis, Marthinus C.
Niu, Gang
Sugiyama, Masashi
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
[9] Learning from positive and unlabeled data: a survey
Jessa Bekker
Jesse Davis
Machine Learning, 2020, 109 : 719 - 760
[10] Learning from positive and unlabeled data: a survey
Bekker, Jessa
Davis, Jesse
MACHINE LEARNING, 2020, 109 (04) : 719 - 760

← 1 2 3 4 5 →