MuCoMiD: A <bold>Mu</bold>ltitask Graph <bold>Co</bold>nvolutional Learning Framework for <bold>mi</bold>RNA-<bold>D</bold>isease Association Prediction

被引:5
|
作者
Dong, Ngan [1 ]
Muecke, Stefanie [2 ]
Khosla, Megha [3 ]
机构
[1] Leibniz Univ Hann, L3S Res Ctr, D-30167 Hannover, Germany
[2] TRAIN Omics, Translat Alliance Lower Saxony, D-37081 Hannover, Germany
[3] Delft Univ Technol TU Delft, NL-2628 CD Delft, Netherlands
关键词
Data integration; disease; graph representation learning; MiRNA; multitask; MICRORNAS; TUMORIGENESIS; SIMILARITY;
D O I
10.1109/TCBB.2022.3176456
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Growing evidence from recent studies implies that microRNAs or miRNAs could serve as biomarkers in various complex human diseases. Since wet-lab experiments for detecting miRNAs associated with a disease are expensive and time-consuming, machine learning techniques for miRNA-disease association prediction have attracted much attention in recent years. A big challenge in building reliable machine learning models is that of data scarcity. In particular, existing approaches trained on the available small datasets, even when combined with precalculated handcrafted input features, often suffer from bad generalization and data leakage problems. We overcome the limitations of existing works by proposing a novel multitask graph convolution-based approach, which we refer to as MuCoMiD. MuCoMiD allows automatic feature extraction while incorporating knowledge from five heterogeneous biological information sources (associations between miRNAs/diseases and protein-coding genes (PCGs), interactions between protein-coding genes, miRNA family information, and disease ontology) in a multitask setting which is a novel perspective and has not been studied before. To effectively test the generalization capability of our model, we conduct large-scale experiments on the standard benchmark datasets as well as on our proposed large independent testing sets and case studies. MuCoMiD obtains significantly higher Average Precision (AP) scores than all benchmarked models on three large independent testing sets, especially those with many new miRNAs, as well as in the detection of false positives. Thanks to its capability of learning directly from raw input information, MuCoMiD is easier to maintain and update than handcrafted feature-based methods, which would require recomputation of features every time there is a change in the original information sources (e.g., disease ontology, miRNA/disease-PCG associations, etc.). We share our code for reproducibility and future research at https://git.l3s.uni-hannover.de/dong/cmtt.
引用
收藏
页码:3081 / 3092
页数:12
相关论文
共 50 条
  • [1] Coupled pyroelectric-photovoltaic effect in 2D ferroelectric α-<bold>In</bold><bold>2</bold><bold>Se</bold><bold>3</bold>
    Uzhansky, Michael
    Rakshit, Abhishek
    Kalcheim, Yoav
    Koren, Elad
    NPJ 2D MATERIALS AND APPLICATIONS, 2025, 9 (01)
  • [2] Unravelling tRNA fragments in DENV pathogenesis: Insights from RNA sequencing( Vol <bold> </bold>14<bold>, </bold>18357<bold>, </bold>2024<bold>)</bold>
    Madhry, Deeksha
    Kumari, Kiran
    Meena, Varsha
    Roy, Riya
    Verma, Bhupendra
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [3] Graph Unlearning<bold> </bold>
    Chen, Min
    Zhang, Zhikun
    Wang, Tianhao
    Backes, Michael
    Humbert, Mathias
    Zhang, Yang
    PROCEEDINGS OF THE 2022 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS 2022, 2022, : 499 - 513
  • [4] <bold>Leaf scale quantification of the effect of photosynthetic gas exchange on</bold> Δ47<bold>of CO</bold><bold>2</bold>
    Adnew, Getachew Agmuas
    Hofmann, Magdalena E. G.
    Pons, Thijs L.
    Koren, Gerbrand
    Ziegler, Martin
    Lourens, Lucas J.
    Rockmann, Thomas
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [5] Some characterizations of <bold>RP</bold><bold>[d]</bold> and sensitivity for abelian group actions
    Liu, Zhuowei
    DYNAMICAL SYSTEMS-AN INTERNATIONAL JOURNAL, 2025,
  • [6] A Framework for Haptic Interpersonal Communication<bold> </bold>
    Mukta, Marufa Ycasmin
    Hassanein, Hossam S.
    2024 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CCECE 2024, 2024, : 164 - 165
  • [7] Study The "Interval" of Liberal Learning<bold> </bold>
    Mackler, Stephanie
    RECONCEPTUALIZING STUDY IN EDUCATIONAL DISCOURSE AND PRACTICE, 2017, : 54 - 67
  • [9] <bold>Lifetime prediction and fracture behavior of shear cycled Cu/Sn</bold>-<bold>3.0Ag</bold>-<bold>0.5Cu/Cu joints under current stressing</bold>
    Li, Wangyun
    Liu, Longgen
    Chen, Feng
    Xu, Yiqin
    Qin, Hongbo
    Gong, Yubing
    JOURNAL OF MATERIALS SCIENCE-MATERIALS IN ELECTRONICS, 2024, 35 (30)
  • [10] <bold>A short-term wind speed prediction method based on the BLS</bold>-<bold>RVM hybrid model</bold>
    Geng, Jianchun
    Wen, Lili
    INTERNATIONAL JOURNAL OF LOW-CARBON TECHNOLOGIES, 2024, 19 : 613 - 618