Adversarial Modality Alignment Network for Cross-Modal Molecule Retrieval

被引:7
|
作者
Zhao W. [1 ,2 ]
Zhou D. [3 ]
Cao B. [1 ]
Zhang K. [2 ]
Chen J. [2 ]
机构
[1] Hunan University of Science and Technology, School of Computer Science and Engineering, Xiangtan
[2] Swinburne University of Technology, Department of Computing Technologies, Melbourne, 3122, VIC
[3] Guangdong University of Foreign Studies, School of Information Science and Technology, Guangzhou
来源
关键词
Cross-modal molecule retrieval (Text2Mol); graph transformer network (GTN); modality alignment; molecule representation;
D O I
10.1109/TAI.2023.3254518
中图分类号
学科分类号
摘要
The cross-modal molecule retrieval (Text2Mol) task aims to bridge the semantic gap between molecules and natural language descriptions. A solution to this nontrivial problem relies on a graph convolutional network (GCN) and cross-modal attention with contrastive learning for reasonable results. However, there exist the following issues. First, the cross-modal attention mechanism is only in favor of text representations and cannot provide helpful information for molecule representations. Second, the GCN-based molecule encoder ignores edge features and the importance of various substructures of a molecule. Finally, the retrieval learning loss function is rather simplistic. This article further investigates the Text2Mol problem and proposes a novel adversarial modality alignment network (AMAN) based method to sufficiently learn both description and molecule information. Our method utilizes a SciBERT as a text encoder and a graph transformer network as a molecule encoder to generate multimodal representations. Then, an adversarial network is used to align these modalities interactively. Meanwhile, a triplet loss function is leveraged to perform retrieval learning and further enhance the modality alignment. Experiments on the ChEBI-20 dataset show the effectiveness of our AMAN method compared with baselines. © 2020 IEEE.
引用
收藏
页码:278 / 289
页数:11
相关论文
共 50 条
  • [1] Modality-specific and shared generative adversarial network for cross-modal retrieval
    Wu, Fei
    Jing, Xiao-Yuan
    Wu, Zhiyong
    Ji, Yimu
    Dong, Xiwei
    Luo, Xiaokai
    Huang, Qinghua
    Wang, Ruchuan
    PATTERN RECOGNITION, 2020, 104
  • [2] Category Alignment Adversarial Learning for Cross-Modal Retrieval
    He, Shiyuan
    Wang, Weiyang
    Wang, Zheng
    Xu, Xing
    Yang, Yang
    Wang, Xiaoming
    Shen, Heng Tao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 4527 - 4538
  • [3] Multimodal adversarial network for cross-modal retrieval
    Hu, Peng
    Peng, Dezhong
    Wang, Xu
    Xiang, Yong
    KNOWLEDGE-BASED SYSTEMS, 2019, 180 : 38 - 50
  • [4] Annotation Efficient Cross-Modal Retrieval with Adversarial Attentive Alignment
    Huang, Po-Yao
    Kang, Guoliang
    Liu, Wenhe
    Chang, Xiaojun
    Hauptmann, Alexander G.
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1758 - 1767
  • [5] Adversarial Cross-Modal Retrieval
    Wang, Bokun
    Yang, Yang
    Xu, Xing
    Hanjalic, Alan
    Shen, Heng Tao
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 154 - 162
  • [6] DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL
    Zhou, Yu
    Feng, Yong
    Zhou, Mingliang
    Qiang, Baohua
    Hou, Leong U.
    Zhu, Jiajie
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4325 - 4329
  • [7] Adversarial Graph Convolutional Network for Cross-Modal Retrieval
    Dong, Xinfeng
    Liu, Li
    Zhu, Lei
    Nie, Liqiang
    Zhang, Huaxiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (03) : 1634 - 1645
  • [8] Modality-Fused Graph Network for Cross-Modal Retrieval
    Wu, Fei
    LI, Shuaishuai
    Peng, Guangchuan
    Ma, Yongheng
    Jing, Xiao-Yuan
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2023, E106D (05) : 1094 - 1097
  • [9] Information Aggregation Semantic Adversarial Network for Cross-Modal Retrieval
    Wang, Hongfei
    Feng, Aimin
    Liu, Xuejun
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] MHTN: Modal-Adversarial Hybrid Transfer Network for Cross-Modal Retrieval
    Huang, Xin
    Peng, Yuxin
    Yuan, Mingkuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (03) : 1047 - 1059