MolGrapher: Graph-based Visual Recognition of Chemical Structures

被引:3
|
作者
Morin, Lucas [1 ,2 ]
Danelljan, Martin [2 ]
Agea, Maria Isabel [1 ]
Nassar, Ahmed [1 ]
Weber, Valery [1 ]
Meijer, Ingmar [1 ]
Staar, Peter [1 ]
Yu, Fisher [2 ]
机构
[1] IBM Res, Zurich, Switzerland
[2] Swiss Fed Inst Technol, Zurich, Switzerland
关键词
EXTRACTION;
D O I
10.1109/ICCV51070.2023.01791
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The automatic analysis of chemical literature has immense potential to accelerate the discovery of new materials and drugs. Much of the critical information in patent documents and scientific articles is contained in figures, depicting the molecule structures. However, automatically parsing the exact chemical structure is a formidable challenge, due to the amount of detailed information, the diversity of drawing styles, and the need for training data. In this work, we introduce MolGrapher to recognize chemical structures visually. First, a deep keypoint detector detects the atoms. Second, we treat all candidate atoms and bonds as nodes and put them in a graph. This construct allows a natural graph representation of the molecule. Last, we classify atom and bond nodes in the graph with a Graph Neural Network. To address the lack of real training data, we propose a synthetic data generation pipeline producing diverse and realistic results. In addition, we introduce a large-scale benchmark of annotated real molecule images, USPTO-30K, to spur research on this critical topic. Extensive experiments on five datasets show that our approach significantly outperforms classical and learning-based methods in most settings. Code, models, and datasets are available (1).
引用
收藏
页码:19495 / 19504
页数:10
相关论文
共 50 条
  • [1] Visual language and graph-based structures in conceptual design
    Slusarczyk, Grazyna
    ADVANCED ENGINEERING INFORMATICS, 2012, 26 (02) : 267 - 279
  • [2] Graph-Based Object Semantic Refinement for Visual Emotion Recognition
    Zhang, Jing
    Liu, Xinyu
    Wang, Zhe
    Yang, Hai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3036 - 3049
  • [3] Graph-based Kinship Recognition
    Guo, Yuanhao
    Dibeklioglu, Hamdi
    van der Maaten, Laurens
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4287 - 4292
  • [4] kGCN: a graph-based deep learning framework for chemical structures
    Kojima, Ryosuke
    Ishida, Shoichi
    Ohta, Masateru
    Iwata, Hiroaki
    Honma, Teruki
    Okuno, Yasushi
    JOURNAL OF CHEMINFORMATICS, 2020, 12 (01)
  • [5] kGCN: a graph-based deep learning framework for chemical structures
    Ryosuke Kojima
    Shoichi Ishida
    Masateru Ohta
    Hiroaki Iwata
    Teruki Honma
    Yasushi Okuno
    Journal of Cheminformatics, 12
  • [6] Graph-Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures
    Saeed, Faisal
    Salim, Naomie
    Abdo, Ammar
    Hentabli, Hamza
    MOLECULAR INFORMATICS, 2013, 32 (02) : 165 - 178
  • [7] Graph-based visual odometry for VSLAM
    Xu, Shaoyan
    Wang, Tao
    Lang, Congyan
    Feng, Songhe
    Jin, Yi
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2018, 45 (05): : 679 - 687
  • [8] GRAPH-BASED RECOGNITION OF MORPHOLOGICAL FEATURES
    GAVANKAR, P
    JOURNAL OF INTELLIGENT MANUFACTURING, 1993, 4 (03) : 209 - 218
  • [9] Graph-based model for object recognition
    Ton, Pham Trong
    Lux, Augustin
    Hai, Tran Thi Thanh
    ICTACS 2006: First International Conference on Theories and Applications of Computer Science 2006, 2007, : 65 - 78
  • [10] Graph-Based Evolution of Visual Languages
    Machado, Penousal
    Nunes, Henrique
    Romero, Juan
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, PT II, PROCEEDINGS, 2010, 6025 : 271 - +