Learning Classifiers on Positive and Unlabeled Data with Policy Gradient

被引：5

作者：

Li, Tianyu ^{[1
]}

Wang, Chien-Chih ^{[1
]}

Ma, Yukun ^{[2
]}

Ortal, Patricia ^{[1
]}

Zhao, Qifang ^{[1
]}

Stenger, Bjorn ^{[1
]}

Hirate, Yu ^{[1
]}

机构：

[1] Rakuten Inst Technol, Tokyo, Japan

[2] Continental Automot Grp, AIR Labs, Singapore, Singapore

来源：

2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年

关键词：

Classification; Semi-supervised Learning; Reinforcement Learning; Deep Learning;

D O I：

10.1109/ICDM.2019.00050

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing algorithms aiming to learn a binary classifier from positive (P) and unlabeled (U) data require estimating the class prior or label noise ahead of building a classification model. However, the estimation and classifier learning are normally conducted in a pipeline instead of being jointly optimized. In this paper, we propose to alternatively train the two steps using reinforcement learning. Our proposal adopts a policy network to adaptively make assumptions on the labels of unlabeled data, while a classifier is built upon the output of the policy network and provides rewards to learn a better policy. The dynamic and interactive training between the policy maker and the classifier can exploit the unlabeled data in a more effective manner and yield a significant improvement in terms of classification performance. Furthermore, we present two different approaches to represent the actions taken by the policy. The first approach considers continuous actions as soft labels, while the other uses discrete actions as hard assignment of labels for unlabeled examples. We validate the effectiveness of the proposed method on two public benchmark datasets as well as one e-commerce dataset. The results show that the proposed method is able to consistently outperform state-of-the-art methods in various settings.

引用

页码：399 / 408

页数：10

共 50 条

[31] Class-prior estimation for learning from positive and unlabeled data
du Plessis, Marthinus C.
Niu, Gang
Sugiyama, Masashi
MACHINE LEARNING, 2017, 106 (04) : 463 - 492
[32] Learning gene regulatory networks from only positive and unlabeled data
Luigi Cerulo
Charles Elkan
Michele Ceccarelli
BMC Bioinformatics, 11
[33] A Quantum-Inspired Direct Learning Strategy for Positive and Unlabeled Data
Chenguang Zhang
Xuejiao Du
Yan Zhang
International Journal of Computational Intelligence Systems, 16
[34] A Quantum-Inspired Direct Learning Strategy for Positive and Unlabeled Data
Zhang, Chenguang
Du, Xuejiao
Zhang, Yan
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
[35] Tri-training based learning from positive and unlabeled data
Zhang, Bangzuo
Zuo, Wanli
2008 INTERNATIONAL SYMPOSIUM ON INFORMATION PROCESSING AND 2008 INTERNATIONAL PACIFIC WORKSHOP ON WEB MINING AND WEB-BASED APPLICATION, 2008, : 640 - 644
[36] Learning gene regulatory networks from only positive and unlabeled data
Cerulo, Luigi
Elkan, Charles
Ceccarelli, Michele
BMC BIOINFORMATICS, 2010, 11
[37] An Active Learning Based on Uncertainty and Density Method for Positive and Unlabeled Data
Luo, Jun
Zhou, Wenan
Du, Yu
ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT I, 2018, 11334 : 229 - 241
[38] Active Learning for Multivariate Time Series Classification with Positive Unlabeled Data
He, Guoliang
Duan, Yong
Li, Yifei
Qian, Tieyun
He, Jinrong
Jia, Xiangyang
2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 178 - 185
[39] Learning Bayesian network classifiers for facial expression recognition using both labeled and unlabeled data
Cohen, I
Sebe, N
Cozman, FG
Cirelo, MC
Huang, TS
2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 595 - 601
[40] Conditional generative positive and unlabeled learning
Papic, Ales
Kononenko, Igor
Bosnic, Zoran
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 224

← 1 2 3 4 5 →