A novel observation points-based positive-unlabeled learning algorithm

被引：6

作者：

He, Yulin ^{[1
,2
]}

Li, Xu ^{[2
]}

Zhang, Manjing ^{[1
]}

Fournier-Viger, Philippe ^{[2
]}

Huang, Joshua Zhexue ^{[1
,2
,4
]}

Salloum, Salman ^{[3
]}

机构：

[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China

[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China

[3] Natl Univ Singapore, Sch Comp, Singapore, Singapore

[4] Shenzhen Univ, Collegeof Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

来源：

CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY | 2023年 / 8卷 / 04期

基金：

中国国家自然科学基金;

关键词：

artificial intelligence; datamining; machine learning; CLASSIFIERS; ENSEMBLE;

D O I：

10.1049/cit2.12152

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this study, an observation points-based positive-unlabeled learning algorithm (hence called OP-PUL) is proposed to deal with positive-unlabeled learning (PUL) tasks by judiciously assigning highly credible labels to unlabeled samples. The proposed OP-PUL algorithm has three components. First, an observation point classifier ensemble (OPCE) algorithm is constructed to divide unlabeled samples into two categories, which are temporary positive and permanent negative samples. Second, a temporary OPC (TOPC) is trained based on the combination of original positive samples and permanent negative samples and then the permanent positive samples that are correctly classified with TOPC are retained from the temporary positive samples. Third, a permanent OPC (POPC) is finally trained based on the combination of original positive samples, permanent positive samples and permanent negative samples. An exhaustive experimental evaluation is conducted to validate the feasibility, rationality and effectiveness of the OP-PUL algorithm, using 30 benchmark PU data sets. Results show that (1) the OP-PUL algorithm is stable and robust as unlabeled samples and positive samples are increased in unlabeled data sets and (2) the permanent positive samples have a consistent probability distribution with the original positive samples. Moreover, a statistical analysis reveals that POPC in the OP-PUL algorithm can yield better PUL performances on the 30 data sets in comparison with four well-known PUL algorithms. This demonstrates that OP-PUL is a viable algorithm to deal with PUL tasks.

引用

页码：1425 / 1443

页数：19

共 50 条

[21] Positive-Unlabeled Learning for Pupylation Sites Prediction
Jiang, Ming
Cao, Jun-Zhe
BIOMED RESEARCH INTERNATIONAL, 2016, 2016
[22] Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes
Hameed, Pathima Nusrath
Verspoor, Karin
Kusljic, Snezana
Halgamuge, Saman
BMC BIOINFORMATICS, 2017, 18
[23] Positive-Unlabeled Learning for inferring drug interactions based on heterogeneous attributes
Pathima Nusrath Hameed
Karin Verspoor
Snezana Kusljic
Saman Halgamuge
BMC Bioinformatics, 18
[24] Intrusion Detection based on Non-negative Positive-unlabeled Learning
Lv, Sicai
Liu, Yang
Liu, Zhiyao
Chao, Wang
Wu, Chenrui
Wang, Bailing
PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 1015 - 1020
[25] Estimating Visited Stores Through Positive-Unlabeled Learning
Shirai, Ryo
Imai, Ryo
Liew, Seng Pei
Amagata, Daichi
Takahashi, Tsubasa
Hara, Takahiro
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VII, DASFAA 2024, 2024, 14856 : 377 - 389
[26] Construction of Fatigue Criteria Through Positive-Unlabeled Learning
Coudray, Olivier
Bristiel, Philippe
Dinis, Miguel
Keribin, Christine
Pamphile, Patrick
FATIGUE & FRACTURE OF ENGINEERING MATERIALS & STRUCTURES, 2025, 48 (01) : 101 - 117
[27] Positive-unlabeled learning for open set domain adaptation
Loghmani, Mohammad Reza
Vincze, Markus
Tommasi, Tatiana
PATTERN RECOGNITION LETTERS, 2020, 136 : 198 - 204
[28] Partial Optimal Transport with Applications on Positive-Unlabeled Learning
Chapel, Laetitia
Alaya, Mokhtar Z.
Gasso, Gilles
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[29] Recovering True Classifier Performance in Positive-Unlabeled Learning
Jain, Shantanu
White, Martha
Radivojac, Predrag
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2066 - 2072
[30] Spotting Fake Reviews using Positive-Unlabeled Learning
Li, Huayi
Liu, Bing
Mukherjee, Arjun
Shao, Jidong
COMPUTACION Y SISTEMAS, 2014, 18 (03): : 467 - 475

← 1 2 3 4 5 →