Similarity maps - a visualization strategy for molecular fingerprints and machine-learning methods

被引:128
|
作者
Riniker, Sereina [1 ]
Landrum, Gregory A. [1 ]
机构
[1] Novartis Inst BioMed Res, Basel, Switzerland
来源
JOURNAL OF CHEMINFORMATICS | 2013年 / 5卷
关键词
Visualization; Machine-learning; Similarity; Fingerprints; DOPAMINE D3 RECEPTOR; LIGANDS; DESIGN; MODELS;
D O I
10.1186/1758-2946-5-43
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Fingerprint similarity is a common method for comparing chemical structures. Similarity is an appealing approach because, with many fingerprint types, it provides intuitive results: a chemist looking at two molecules can understand why they have been determined to be similar. This transparency is partially lost with the fuzzier similarity methods that are often used for scaffold hopping and tends to vanish completely when molecular fingerprints are used as inputs to machine-learning (ML) models. Here we present similarity maps, a straightforward and general strategy to visualize the atomic contributions to the similarity between two molecules or the predicted probability of a ML model. We show the application of similarity maps to a set of dopamine D3 receptor ligands using atom-pair and circular fingerprints as well as two popular ML methods: random forests and naive Bayes. An open-source implementation of the method is provided.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Modeling the Vibrational Relaxation Rate Using Machine-Learning Methods
    Bushmakova, M. A.
    Kustova, E. V.
    VESTNIK ST PETERSBURG UNIVERSITY-MATHEMATICS, 2022, 55 (01) : 87 - 95
  • [42] Application of Machine-Learning Methods to Understand Gene Expression Regulation
    Cheng, Chao
    Worzel, William P.
    GENETIC PROGRAMMING THEORY AND PRACTICE XII, 2015, : 1 - 15
  • [43] Machine-Learning Methods for Earthquake Ground Motion Analysis and Simulation
    Alimoradi, Arzhang
    Beck, James L.
    JOURNAL OF ENGINEERING MECHANICS, 2015, 141 (04)
  • [44] Advanced Machine-Learning Methods for Brain-Computer Interfacing
    Lv, Zhihan
    Qiao, Liang
    Wang, Qingjun
    Piccialli, Francesco
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (05) : 1688 - 1698
  • [45] Prediction of Settling Velocity of Microplastics by Multiple Machine-Learning Methods
    Leng, Zequan
    Cao, Lu
    Gao, Yun
    Hou, Yadong
    Wu, Di
    Huo, Zhongyan
    Zhao, Xizeng
    WATER, 2024, 16 (13)
  • [46] Reliable and explainable machine-learning methods for accelerated material discovery
    Kailkhura, Bhavya
    Gallagher, Brian
    Kim, Sookyung
    Hiszpanski, Anna
    Han, T. Yong-Jin
    NPJ COMPUTATIONAL MATERIALS, 2019, 5 (1)
  • [47] Reliable and explainable machine-learning methods for accelerated material discovery
    Bhavya Kailkhura
    Brian Gallagher
    Sookyung Kim
    Anna Hiszpanski
    T. Yong-Jin Han
    npj Computational Materials, 5
  • [48] Can machine-learning methods really help predict suicide?
    McHugh, Catherine M.
    Large, Matthew M.
    CURRENT OPINION IN PSYCHIATRY, 2020, 33 (04) : 369 - 374
  • [49] Ensemble of Machine-Learning Methods for Predicting Gully Erosion Susceptibility
    Pal, Subodh Chandra
    Arabameri, Alireza
    Blaschke, Thomas
    Chowdhuri, Indrajit
    Saha, Asish
    Chakrabortty, Rabin
    Lee, Saro
    Band, Shahab. S.
    REMOTE SENSING, 2020, 12 (22) : 1 - 25
  • [50] Risk estimation and risk prediction using machine-learning methods
    Kruppa, Jochen
    Ziegler, Andreas
    Koenig, Inke R.
    HUMAN GENETICS, 2012, 131 (10) : 1639 - 1654