Latent semantic structure indexing (LaSSI) for defining chemical similarity

被引:30
|
作者
Hull, RD [1 ]
Singh, SB [1 ]
Nachbar, RB [1 ]
Sheridan, RP [1 ]
Kearsley, SK [1 ]
Fluder, EM [1 ]
机构
[1] Merck Res Labs, Dept Mol Syst, Rahway, NJ 07065 USA
关键词
D O I
10.1021/jm000393c
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A novel method for computing chemical similarity from chemical substructure descriptors is described. This new method, called LaSSI, uses the singular value decomposition (SVD) of a chemical descriptor-molecule matrix to create a low-dimensional representation of the original descriptor space. Ranking molecules by similarity to a probe molecule in the reduced-dimensional space has several advantages over analogous ranking in the original descriptor space: matching latent structures is more robust than matching discrete descriptors, choosing the number of singular values provides a rational way to vary the "fuzziness" of the search, and the reduction in the dimensionality of the chemical space increases-searching speed. LaSSI also allows the calculation of the similarity between two descriptors and between a descriptor and a molecule.
引用
收藏
页码:1177 / 1184
页数:8
相关论文
共 50 条
  • [1] Chemical similarity searches using latent semantic structural indexing (LaSSI) and comparison to TOPOSIM
    Hull, RD
    Fluder, EM
    Singh, SB
    Nachbar, RB
    Kearsley, SK
    Sheridan, RP
    JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (08) : 1185 - 1191
  • [2] Mining the chemical quarry with joint chemical probes: An application of latent semantic structure indexing (LaSSI) and TOPOSIM (dice) to chemical database mining
    Singh, SB
    Sheridan, RP
    Fluder, EM
    Hull, RD
    JOURNAL OF MEDICINAL CHEMISTRY, 2001, 44 (10) : 1564 - 1575
  • [3] A similarity-based probability model for latent semantic indexing
    Ding, CHQ
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 58 - 65
  • [4] Similarity-based Matrix Completion Algorithm for Latent Semantic Indexing
    Mirzal, Andri
    2013 IEEE INTERNATIONAL CONFERENCE ON CONTROL SYSTEM, COMPUTING AND ENGINEERING (ICCSCE 2013), 2013, : 79 - 84
  • [5] Text structure analysis based on latent semantic indexing
    Lin, Hongfei
    Zhan, Xuegang
    Yao, Tianshun
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2000, 13 (01): : 47 - 51
  • [6] Enhanced Latent Semantic Indexing Using Cosine Similarity Measures for Medical Application
    Al-Anzi, Fawaz
    AbuZeina, Dia
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2020, 17 (05) : 742 - 749
  • [7] Probabilistic latent semantic indexing
    Hofmann, T
    SIGIR'99: PROCEEDINGS OF 22ND INTERNATIONAL CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 1999, : 50 - 57
  • [8] Regularized Latent Semantic Indexing
    Wang, Quan
    Xu, Jun
    Li, Hang
    Craswell, Nick
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 685 - 694
  • [9] INDEXING BY LATENT SEMANTIC ANALYSIS
    DEERWESTER, S
    DUMAIS, ST
    FURNAS, GW
    LANDAUER, TK
    HARSHMAN, R
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1990, 41 (06): : 391 - 407
  • [10] Application of Latent Semantic Indexing for Hindi-English CLIR Irrespective of Context Similarity
    SivaKumar, A. P.
    Premchand, P.
    Govardhan, A.
    TRENDS IN NETWORKS AND COMMUNICATIONS, 2011, 197 : 711 - +