ChemDB update - full-text search and virtual chemical space

被引:92
|
作者
Chen, Jonathan H. [1 ]
Linstead, Erik [1 ]
Swamidass, S. Joshua [1 ]
Wang, Dennis [1 ]
Baldi, Pierre [1 ]
机构
[1] Univ Calif Irvine, Sch Informat & Comp Sci, Inst Genom & Bioinformat, Irvine, CA 92623 USA
关键词
D O I
10.1093/bioinformatics/btm341
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
ChemDB is a chemical database containing nearly 5M commercially available small molecules, important for use as synthetic building blocks, probes in systems biology and as leads for the discovery of drugs and other useful compounds. The data is publicly available over the web for download and for targeted searches using a variety of powerful methods. The chemical data includes predicted or experimentally determined physicochemical properties, such as 3D structure, melting temperature and solubility. Recent developments include optimization of chemical structure ( and substructure) retrieval algorithms, enabling full database searches in less than a second. A text-based search engine allows efficient searching of compounds based on over 65M annotations from over 150 vendors. When searching for chemicals by name, fuzzy text matching capabilities yield productive results even when the correct spelling of a chemical name is unknown, taking advantage of both systematic and common names. Finally, built in reaction models enable searches through virtual chemical space, consisting of hypothetical products readily synthesizable from the building blocks in ChemDB. Availability: ChemDB and Supplementary Materials are available at http://cdb.ics.uci.edu. Contact: pfbaldi@ics.uci.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2348 / 2351
页数:4
相关论文
共 50 条
  • [21] Full-text Search for Verifiable Credential Metadata on Distributed Ledgers
    Lux, Zoltan Andras
    Beierle, Felix
    Zickau, Sebastian
    Goendoer, Sebastian
    2019 SIXTH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY (IOTSMS), 2019, : 519 - 528
  • [22] Efficient fuzzy full-text type-ahead search
    Guoliang Li
    Shengyue Ji
    Chen Li
    Jianhua Feng
    The VLDB Journal, 2011, 20 : 617 - 640
  • [23] ACCESS RIGHTS IN ENTERPRISE FULL-TEXT SEARCH Searching Large Intranets Effectively using Virtual Terms
    Kasprzak, Jan
    Brandejs, Michal
    Cuhel, Matej
    Obsivac, Tomas
    ICEIS 2010: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2010, : 32 - 39
  • [24] RepoVis: Visual Overviews and Full-Text Search in Software Repositories
    Feiner, Johannes
    Andrews, Keith
    2018 SIXTH IEEE WORKING CONFERENCE ON SOFTWARE VISUALIZATION (VISSOFT), 2018, : 1 - 11
  • [25] FULL-TEXT DATABASES
    TENOPIR, C
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1984, 19 : 215 - 246
  • [26] Humanities full-text
    Williams, H
    LIBRARY JOURNAL, 2003, 128 (05) : 124 - 124
  • [27] Efficient fuzzy full-text type-ahead search
    Li, Guoliang
    Ji, Shengyue
    Li, Chen
    Feng, Jianhua
    VLDB JOURNAL, 2011, 20 (04): : 617 - 640
  • [28] Project ISLA: A space/time/full-text/format search and retrieval system designed by end users
    Hunt, LL
    Wegner, LS
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 1996, 22 (06): : 440 - 449
  • [29] Full-text chemical identification with improved generalizability and tagging consistency
    Kim, Hyunjae
    Sung, Mujeen
    Yoon, Wonjin
    Park, Sungjoon
    Kang, Jaewoo
    DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2022, 2022
  • [30] Proposal of a lightweight, offline, full-text search engine for an mHealth app
    Lopes, Carla Teixeira
    Azevedo, David
    Monteiro, Joao M.
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,