ChemDB update - full-text search and virtual chemical space

被引:92
|
作者
Chen, Jonathan H. [1 ]
Linstead, Erik [1 ]
Swamidass, S. Joshua [1 ]
Wang, Dennis [1 ]
Baldi, Pierre [1 ]
机构
[1] Univ Calif Irvine, Sch Informat & Comp Sci, Inst Genom & Bioinformat, Irvine, CA 92623 USA
关键词
D O I
10.1093/bioinformatics/btm341
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
ChemDB is a chemical database containing nearly 5M commercially available small molecules, important for use as synthetic building blocks, probes in systems biology and as leads for the discovery of drugs and other useful compounds. The data is publicly available over the web for download and for targeted searches using a variety of powerful methods. The chemical data includes predicted or experimentally determined physicochemical properties, such as 3D structure, melting temperature and solubility. Recent developments include optimization of chemical structure ( and substructure) retrieval algorithms, enabling full database searches in less than a second. A text-based search engine allows efficient searching of compounds based on over 65M annotations from over 150 vendors. When searching for chemicals by name, fuzzy text matching capabilities yield productive results even when the correct spelling of a chemical name is unknown, taking advantage of both systematic and common names. Finally, built in reaction models enable searches through virtual chemical space, consisting of hypothetical products readily synthesizable from the building blocks in ChemDB. Availability: ChemDB and Supplementary Materials are available at http://cdb.ics.uci.edu. Contact: pfbaldi@ics.uci.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2348 / 2351
页数:4
相关论文
共 50 条
  • [1] Semantic Full-text Search with Broccoli
    Bast, Hannah
    Baurle, Florian
    Buchhold, Bjoern
    Haussmann, Elmar
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 1265 - 1266
  • [2] Fast and Exact Nearest Neighbor Search in Hamming Space on Full-Text Search Engines
    Mu, Cun
    Zhao, Jun
    Yang, Guang
    Yang, Binwei
    Yan, Zheng
    SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019), 2019, 11807 : 49 - 56
  • [3] Full-text Search Using Database Index
    Chaitanya, B. Sri Sai Krishna
    Reddy, D. Ajay Kumar
    Chandra, B. Pavan Sai Eshwar
    Krishna, A. Bala
    Menon, Remya R. K.
    2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [4] Preparing heterogeneous XML for full-text search
    Lehtonen, Miro
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2006, 24 (04) : 455 - 474
  • [5] Expressiveness and performance of full-text search languages
    Botev, Chavdar
    Amer-Yahia, Sihem
    Shanmugasundaram, Jayavel
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 349 - 367
  • [6] An Index for Efficient Semantic Full-Text Search
    Bast, Hannah
    Buchhold, Bjoern
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 369 - 378
  • [7] Full-Text Search Engine using MySQL
    Gyorodi, C.
    Gyorodi, R.
    Pecherle, G.
    Cornea, G. M.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2010, 5 (05) : 735 - 743
  • [8] Study and implementation of the full-text search for hypertext
    Zheng, Q.H.
    Zhang, W.
    2001, Xi'an Jiaotong University (35):
  • [9] Addition of chemical search capabilities to PATENTSCOPE: Turning a full-text search system into a chemistry database
    Eiblmaier, Josef
    Mazenc, Christophe
    Geppert, Dorothee
    Isenko, Larisa
    Saller, Heinz
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [10] Bitlist: New Full-text Index for Low Space Cost and Efficient Keyword Search
    Rao, Weixiong
    Chen, Lei
    Hui, Pan
    Tarkoma, Sasu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (13): : 1522 - 1533