CapsCarcino: A novel sparse data deep learning tool for predicting carcinogens

被引:29
|
作者
Wang, Yi-Wei [1 ,2 ,3 ]
Huang, Lei [4 ,5 ]
Jiang, Si-Wen [4 ]
Li, Kan [1 ,2 ]
Zou, Jun [1 ,2 ]
Yang, Sheng-Yong [1 ,2 ]
机构
[1] Sichuan Univ, West China Hosp, State Key Lab Biotherapy, Chengdu 610041, Sichuan, Peoples R China
[2] Sichuan Univ, West China Hosp, Canc Ctr, Chengdu 610041, Sichuan, Peoples R China
[3] Southwest Med Univ, Coll Preclin Med, Luzhou 646000, Sichuan, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engineer, Chengdu 611731, Sichuan, Peoples R China
[5] Sichuan Coll Architectural Technol, Basic Teaching Dept, Deyang 61800, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Carcinogenicity; Predictive classifier; Capsule network; Computational toxicology; IN-SILICO PREDICTION; NONCONGENERIC CHEMICALS; MODELS; MUTAGENICITY; TOXICITY; BIOASSAY;
D O I
10.1016/j.fct.2019.110921
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
Determining chemical carcinogenicity in the early stages of drug discovery is fundamentally important to prevent the adverse effect of carcinogens on human health. There has been a recent surge of interest in developing computational approaches to predict chemical carcinogenicity. However, the predictive power of many existing approaches is limited, and there is plenty of room for improvement. Here, we develop a new deep learning architecture, termed CapsCarcino, to distinguish between carcinogens and noncarcinogens. CapsCarcino is constructed based on a dynamic routing algorithm that requires less data, extracts more comprehensive information, and does not require feature selection. We find that CapsCarcino provides a significantly improved predictive and generalization ability over, and outperforms five other machine learning models. Specifically, the best model of CapsCarcino achieves an accuracy of 85.0% on an external validation dataset. In addition, we discover that the enhanced predictive capability of CapsCarcino over that of the other methods is robust and can be achieved using sparse datasets. Training on merely 20% of the dataset, CapsCarcino performs comparably to the other methods based on the full training dataset. Further mechanism analysis indicates that CapsCarcino could efficiently learn the characteristics of carcinogens even if structural alerts are insufficiently represented. The results indicate that CapsCarcino should be helpful for carcinogen risk assessment.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Predicting Tool Wear with ParaCRN-AMResNet: A Hybrid Deep Learning Approach
    Guo, Lian
    Wang, Yongguo
    MACHINES, 2024, 12 (05)
  • [32] Deep Learning on Construction Sites: A Case Study of Sparse Data Learning Techniques for Rebar Segmentation
    Cuypers, Suzanna
    Bassier, Maarten
    Vergauwen, Maarten
    SENSORS, 2021, 21 (16)
  • [33] Using satellite data and deep learning to estimate educational outcomes in data-sparse environments
    Runfola, D.
    Stefanidis, A.
    Baier, H.
    REMOTE SENSING LETTERS, 2022, 13 (01) : 87 - 97
  • [34] A Novel Topology Adaptation Strategy for Dynamic Sparse Training in Deep Reinforcement Learning
    Xu, Meng
    Chen, Xinhong
    Wang, Jianping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [35] Stabilized Sparse Online Learning for Sparse Data
    Ma, Yuting
    Zheng, Tian
    JOURNAL OF MACHINE LEARNING RESEARCH, 2017, 18
  • [36] Deep Learning of Sparse Patterns in Medical IoT for Efficient Big Data Harnessing
    Wong, Junhua
    Zhang, Qingxue
    IEEE ACCESS, 2023, 11 : 25856 - 25864
  • [37] Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation
    Vidya, R.
    Nasira, G. M.
    Priyankka, R. P. Jaia
    2014 WORLD CONGRESS ON COMPUTING AND COMMUNICATION TECHNOLOGIES (WCCCT 2014), 2014, : 124 - +
  • [38] Deep Learning Piston Sensing for Sparse Aperture Systems With Simulated Training Data
    Ma, Xiafei
    Xie, Zongliang
    Ma, Haotong
    Xu, Yangjie
    He, Dong
    Ren, Ge
    IEEE PHOTONICS JOURNAL, 2022, 14 (04):
  • [39] A generative deep learning framework for airfoil flow field prediction with sparse data
    Haizhou WU
    Xuejun LIU
    Wei AN
    Hongqiang LYU
    Chinese Journal of Aeronautics, 2022, 35 (01) : 470 - 484
  • [40] Deep learning-assisted wavefront correction with sparse data for holographic tomography
    Lin, Li-Chien
    Huang, Chung-Hsuan
    Chen, Yi-Fan
    Chu, Daping
    Cheng, Chau-Jern
    OPTICS AND LASERS IN ENGINEERING, 2022, 154