Positive-Unlabeled Learning for Network Link Prediction

被引:5
|
作者
Gan, Shengfeng [1 ]
Alshahrani, Mohammed [2 ]
Liu, Shichao [3 ]
机构
[1] Hubei Univ Educ, Coll Comp, Wuhan 430205, Peoples R China
[2] Albaha Univ, Coll Comp Sci & IT, Albaha 65515, Saudi Arabia
[3] Huazhong Agr Univ, Coll Informat, Wuhan 430070, Peoples R China
基金
中国国家自然科学基金;
关键词
network link prediction; positive-unlabeled learning; network representation learning; supervised classification; CLASSIFICATION; SVM;
D O I
10.3390/math10183345
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Link prediction is an important problem in network data mining, which is dedicated to predicting the potential relationship between nodes in the network. Normally, network link prediction based on supervised classification will be trained on a dataset consisting of a set of positive samples and a set of negative samples. However, well-labeled training datasets with positive and negative annotations are always inadequate in real-world scenarios, and the datasets contain a large number of unlabeled samples that may hinder the performance of the model. To address this problem, we propose a positive-unlabeled learning framework with network representation for network link prediction only using positive samples and unlabeled samples. We first learn representation vectors of nodes using a network representation method. Next, we concatenate representation vectors of node pairs and then feed them into different classifiers to predict whether the link exists or not. To alleviate data imbalance and enhance the prediction precision, we adopt three types of positive-unlabeled (PU) learning strategies to improve the prediction performance using traditional classifier estimation, bagging strategy and reliable negative sampling. We conduct experiments on three datasets to compare different PU learning methods and discuss their influence on the prediction results. The experimental results demonstrate that PU learning has a positive impact on predictive performances and the promotion effects vary with different network structures.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Positive-unlabeled learning in bioinformatics and computational biology: a brief review
    Li, Fuyi
    Dong, Shuangyu
    Leier, Andre
    Han, Meiya
    Guo, Xudong
    Xu, Jing
    Wang, Xiaoyu
    Pan, Shirui
    Jia, Cangzhi
    Zhang, Yang
    Webb, Geoffrey, I
    Coin, Lachlan J. M.
    Li, Chen
    Song, Jiangning
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [42] Deep Generative Positive-Unlabeled Learning under Selection Bias
    Na, Byeonghu
    Kim, Hyemi
    Song, Kyungwoo
    Joo, Weonyoung
    Kim, Yoon-Yeong
    Moon, Il-Chul
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 1155 - 1164
  • [43] Screening drug-target interactions with positive-unlabeled learning
    Peng, Lihong
    Zhu, Wen
    Liao, Bo
    Duan, Yu
    Chen, Min
    Chen, Yi
    Yang, Jialiang
    SCIENTIFIC REPORTS, 2017, 7
  • [44] Spotting Fake Reviews via Collective Positive-Unlabeled Learning
    Li, Huayi
    Chen, Zhiyuan
    Liu, Bing
    Wei, Xiaokai
    Shao, Jidong
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 899 - 904
  • [45] AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications
    Yang, Pengyi
    Ormerod, John T.
    Liu, Wei
    Ma, Chendong
    Zomaya, Albert Y.
    Yang, Jean Y. H.
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (05) : 1932 - 1943
  • [46] Biometric identity recognition based on contrastive positive-unlabeled learning
    Sun, Le
    Hua, Yiwen
    Muhammad, Ghulam
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2024, 83
  • [47] Positive-unlabeled learning for coronary artery segmentation in CCTA images
    Chen, Fei
    Li, Sulei
    Wei, Chen
    Zhang, Yue
    Guo, Kaitai
    Zheng, Yang
    Cao, Feng
    Liang, Jimin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [48] A flexible procedure for mixture proportion estimation in positive-unlabeled learning
    Lin, Zhenfeng
    Long, James P.
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (02) : 178 - 187
  • [49] Positive-Unlabeled Learning with Non-Negative Risk Estimator
    Kiryo, Ryuichi
    Niu, Gang
    du Plessis, Marthinus C.
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [50] Information-Theoretic Representation Learning for Positive-Unlabeled Classification
    Sakai, Tomoya
    Niu, Gang
    Sugiyama, Masashi
    NEURAL COMPUTATION, 2021, 33 (01) : 244 - 268