Latent semantic structure indexing (LaSSI) for defining chemical similarity

被引:30
|
作者
Hull, RD [1 ]
Singh, SB [1 ]
Nachbar, RB [1 ]
Sheridan, RP [1 ]
Kearsley, SK [1 ]
Fluder, EM [1 ]
机构
[1] Merck Res Labs, Dept Mol Syst, Rahway, NJ 07065 USA
关键词
D O I
10.1021/jm000393c
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A novel method for computing chemical similarity from chemical substructure descriptors is described. This new method, called LaSSI, uses the singular value decomposition (SVD) of a chemical descriptor-molecule matrix to create a low-dimensional representation of the original descriptor space. Ranking molecules by similarity to a probe molecule in the reduced-dimensional space has several advantages over analogous ranking in the original descriptor space: matching latent structures is more robust than matching discrete descriptors, choosing the number of singular values provides a rational way to vary the "fuzziness" of the search, and the reduction in the dimensionality of the chemical space increases-searching speed. LaSSI also allows the calculation of the similarity between two descriptors and between a descriptor and a molecule.
引用
收藏
页码:1177 / 1184
页数:8
相关论文
共 50 条
  • [41] Gene clustering by Latent Semantic Indexing of MEDLINE abstracts
    Homayouni, R
    Heinrich, K
    Wei, L
    Berry, MW
    BIOINFORMATICS, 2005, 21 (01) : 104 - 115
  • [42] IPSILON: Incremental Parsing for Semantic Indexing of Latent Concepts
    Bae, Soo Hyun
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (07) : 1933 - 1947
  • [43] Categorization of Malay Documents using Latent Semantic Indexing
    Ab Samat, Nordianah
    Murad, Masrah Azrifah Azmi
    Atan, Rodziah
    Abdullah, Muhammad Taufik
    KMICE 2008 - KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE, 2008 - TRANSFERRING, MANAGING AND MAINTAINING KNOWLEDGE FOR NATION CAPACITY DEVELOPMENT, 2008, : 87 - 91
  • [44] A Weakly Supervised Optimize Method in Latent Semantic Indexing
    Ji, Duo
    Guo, Dongbo
    Cai, Dongfeng
    Bai, Yu
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 308 - 314
  • [45] Document Classification Method based on Latent Semantic Indexing
    Kim, Jeong-Joon
    Lee, Yong-Soo
    Moon, Jin-Yong
    Park, Jeong-Min
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2018, 11 (04): : 97 - 112
  • [46] Intelligent information management using latent semantic indexing
    Berry, MW
    Witter, DI
    MINING AND MODELING MASSIVE DATA SETS IN SCIENCE, ENGINEERING, AND BUSINESS WITH A SUBTHEME IN ENVIRONMENTAL STATISTICS, 1997, 29 (01): : 273 - 280
  • [47] Anomalous Android Application Detection with Latent Semantic Indexing
    Shahriar, Hossain
    Clincy, Victor
    PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSAC), VOL 2, 2016, : 624 - 625
  • [48] THE APPLICATION OF LATENT SEMANTIC INDEXING AND ONTOLOGY IN TEXT CLASSIFICATION
    Yang, Xi-Quan
    Sun, Na
    Sun, Tie-Li
    Cao, Xue-Ya
    Zheng, Xiao-Juan
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (12A): : 4491 - 4499
  • [49] Using latent semantic indexing for multilanguage information retrieval
    Berry, MW
    Young, PG
    COMPUTERS AND THE HUMANITIES, 1995, 29 (06): : 413 - 429
  • [50] Application of latent semantic indexing to processing of noisy text
    Price, RJ
    Zukas, AE
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 602 - 603