NIAPU: network-informed adaptive positive-unlabeled learning for disease gene identification

被引:4
|
作者
Stolfi, Paola [1 ]
Mastropietro, Andrea [2 ]
Pasculli, Giuseppe [2 ]
Tieri, Paolo [1 ]
Vergni, Davide [1 ]
机构
[1] Natl Res Council Italy CNR, Inst Appl Comp IAC Mauro Picone, I-00185 Rome, Italy
[2] Sapienza Univ Rome, Dept Comp Control & Management Engn DIAG Antonio, I-00185 Rome, Italy
关键词
PREDICTION; PRIORITIZATION; WALKING;
D O I
10.1093/bioinformatics/btac848
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Gene-disease associations are fundamental for understanding disease etiology and developing effective interventions and treatments. Identifying genes not yet associated with a disease due to a lack of studies is a challenging task in which prioritization based on prior knowledge is an important element. The computational search for new candidate disease genes may be eased by positive-unlabeled learning, the machine learning (ML) setting in which only a subset of instances are labeled as positive while the rest of the dataset is unlabeled. In this work, we propose a set of effective network-based features to be used in a novel Markov diffusion-based multi-class labeling strategy for putative disease gene discovery. Results: The performances of the new labeling algorithm and the effectiveness of the proposed features have been tested on 10 different disease datasets using three ML algorithms. The new features have been compared against classical topological and functional/ontological features and a set of network- and biological-derived features already used in gene discovery tasks. The predictive power of the integrated methodology in searching for new disease genes has been found to be competitive against state-of-the-art algorithms. Availability and implementation: The source code of NIAPU can be accessed at https://github.com/AndMastro/ NIAPU. The source data used in this study are available online on the respective websites. Contact: mastropietro@diag.uniroma1.it or davide.vergni@cnr.it Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Biometric identity recognition based on contrastive positive-unlabeled learning
    Sun, Le
    Hua, Yiwen
    Muhammad, Ghulam
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2024, 83
  • [42] Leveraging Positive-Unlabeled Learning for Enhanced Black Spot Accident Identification on Greek Road Networks
    Sevetlidis, Vasileios
    Pavlidis, George
    Mouroutsos, Spyridon G.
    Gasteratos, Antonios
    COMPUTERS, 2024, 13 (02)
  • [43] Positive-unlabeled learning for coronary artery segmentation in CCTA images
    Chen, Fei
    Li, Sulei
    Wei, Chen
    Zhang, Yue
    Guo, Kaitai
    Zheng, Yang
    Cao, Feng
    Liang, Jimin
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 87
  • [44] A flexible procedure for mixture proportion estimation in positive-unlabeled learning
    Lin, Zhenfeng
    Long, James P.
    STATISTICAL ANALYSIS AND DATA MINING, 2020, 13 (02) : 178 - 187
  • [45] Positive-Unlabeled Learning with Non-Negative Risk Estimator
    Kiryo, Ryuichi
    Niu, Gang
    du Plessis, Marthinus C.
    Sugiyama, Masashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [46] Information-Theoretic Representation Learning for Positive-Unlabeled Classification
    Sakai, Tomoya
    Niu, Gang
    Sugiyama, Masashi
    NEURAL COMPUTATION, 2021, 33 (01) : 244 - 268
  • [47] Prioritization of disease genes from GWAS using ensemble-based positive-unlabeled learning
    Nikita Kolosov
    Mark J. Daly
    Mykyta Artomov
    European Journal of Human Genetics, 2021, 29 : 1527 - 1535
  • [48] Unsupervised Body Hair Detection by Positive-Unlabeled Learning in Photoacoustic Image
    Kikkawa, Ryo
    Kajita, Hiroki
    Imanishi, Nobuaki
    Aiso, Sadakazu
    Bise, Ryoma
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3349 - 3352
  • [49] EPuL: An Enhanced Positive-Unlabeled Learning Algorithm for the Prediction of Pupylation Sites
    Nan, Xuanguo
    Bao, Lingling
    Zhao, Xiaosa
    Zhao, Xiaowei
    Sangaiah, Arun Kumar
    Wang, Gai-Ge
    Ma, Zhiqiang
    MOLECULES, 2017, 22 (09):
  • [50] Positive-unlabeled learning for the prediction of conformational B-cell epitopes
    Jing Ren
    Qian Liu
    John Ellis
    Jinyan Li
    BMC Bioinformatics, 16