Unveiling the power of language models in chemical research question answering

被引:0
|
作者
Chen, Xiuying [1 ,2 ]
Wang, Tairan [2 ]
Guo, Taicheng [3 ]
Guo, Kehan [3 ]
Zhou, Juexiao [2 ]
Li, Haoyang [2 ]
Song, Zirui [1 ]
Gao, Xin [2 ]
Zhang, Xiangliang [2 ,3 ]
机构
[1] Mohamed Bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
[2] King Abdullah Univ Sci & Technol, Jeddah, Saudi Arabia
[3] Univ Notre Dame, Notre Dame, IN USA
来源
COMMUNICATIONS CHEMISTRY | 2025年 / 8卷 / 01期
关键词
D O I
10.1038/s42004-024-01394-x
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
While the abilities of language models are thoroughly evaluated in areas like general domains and biomedicine, academic chemistry remains less explored. Chemical QA tools also play a crucial role in both education and research by effectively translating complex chemical information into an understandable format. Addressing this gap, we introduce ScholarChemQA, a large-scale QA dataset constructed from chemical papers. Specifically, the questions are from paper titles with a question mark, and the multi-choice answers are reasoned out based on the corresponding abstracts. This dataset reflects typical real-world challenges, including an imbalanced data distribution and a substantial amount of unlabeled data that can be potentially useful. Correspondingly, we introduce a ChemMatch model, specifically designed to effectively answer chemical questions by fully leveraging our collected data. Experiments show that Large Language Models (LLMs) still have significant room for improvement in the field of chemistry. Moreover, ChemMatch significantly outperforms recent similar-scale baselines: https://github.com/iriscxy/chemmatch.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Evaluating the Adaptability of Large Language Models for Knowledge-aware Question and Answering
    Thakkar, Jay
    Kolekar, Suresh
    Gite, Shilpa
    Pradhan, Biswajeet
    Alamri, Abdullah
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2024, 17 (01):
  • [22] A Comparative Empirical Evaluation of Neural Language Models for Thai Question-Answering
    Zhu, Fangyi
    Laosen, Nasith
    Laosen, Kanjana
    Paripremkul, Kannikar
    Nanthaamornphong, Aziz
    Ng, See-Kiong
    Bressan, Stephane
    2022 37TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2022), 2022, : 120 - 123
  • [23] QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering
    Yasunaga, Michihiro
    Ren, Hongyu
    Bosselut, Antoine
    Liang, Percy
    Leskovec, Jure
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 535 - 546
  • [24] Evaluating Open-Domain Question Answering in the Era of Large Language Models
    Kamalloo, Ehsan
    Dziri, Nouha
    Clarke, Charles L. A.
    Rafiei, Davood
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5591 - 5606
  • [25] Research on Question Classification for Automatic Question Answering
    Xu, Shihua
    Cheng, Gang
    Kong, Fang
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 218 - 221
  • [26] A medical question answering system using large language models and knowledge graphs
    Guo, Quan
    Cao, Shuai
    Yi, Zhang
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (11) : 8548 - 8564
  • [27] JointLK: Joint Reasoning with Language Models and Knowledge Graphs for Commonsense Question Answering
    Sun, Yueqing
    Shi, Qi
    Qi, Le
    Zhang, Yu
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 5049 - 5060
  • [28] Leveraging Text-to-Text Pretrained Language Models for Question Answering in Chemistry
    Tran, Dan
    Pascazio, Laura
    Akroyd, Jethro
    Mosbach, Sebastian
    Kraft, Markus
    ACS OMEGA, 2024, 9 (12): : 13883 - 13896
  • [29] Incorporating Domain Knowledge and Semantic Information into Language Models for Commonsense Question Answering
    Zhou, Ruiying
    Tian, Keke
    Lai, Hanjiang
    Yin, Jian
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 1160 - 1165
  • [30] Toward expert-level medical question answering with large language models
    Singhal, Karan
    Tu, Tao
    Gottweis, Juraj
    Sayres, Rory
    Wulczyn, Ellery
    Amin, Mohamed
    Hou, Le
    Clark, Kevin
    Pfohl, Stephen R.
    Cole-Lewis, Heather
    Neal, Darlene
    Rashid, Qazi Mamunur
    Schaekermann, Mike
    Wang, Amy
    Dash, Dev
    Chen, Jonathan H.
    Shah, Nigam H.
    Lachgar, Sami
    Mansfield, Philip Andrew
    Prakash, Sushant
    Green, Bradley
    Dominowska, Ewa
    Aguera y Arcas, Blaise
    Tomasev, Nenad
    Liu, Yun
    Wong, Renee
    Semturs, Christopher
    Mahdavi, S. Sara
    Barral, Joelle K.
    Webster, Dale R.
    Corrado, Greg S.
    Matias, Yossi
    Azizi, Shekoofeh
    Karthikesalingam, Alan
    Natarajan, Vivek
    NATURE MEDICINE, 2025, : 943 - 950