TransPolymer: a Transformer-based language model for polymer property predictions

被引:0
|
作者
Changwen Xu
Yuyang Wang
Amir Barati Farimani
机构
[1] Carnegie Mellon University,Department of Materials Science and Engineering
[2] Carnegie Mellon University,Department of Mechanical Engineering
[3] Carnegie Mellon University,Machine Learning Department
[4] Carnegie Mellon University,Department of Chemical Engineering
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Accurate and efficient prediction of polymer properties is of great significance in polymer design. Conventionally, expensive and time-consuming experiments or simulations are required to evaluate polymer functions. Recently, Transformer models, equipped with self-attention mechanisms, have exhibited superior performance in natural language processing. However, such methods have not been investigated in polymer sciences. Herein, we report TransPolymer, a Transformer-based language model for polymer property prediction. Our proposed polymer tokenizer with chemical awareness enables learning representations from polymer sequences. Rigorous experiments on ten polymer property prediction benchmarks demonstrate the superior performance of TransPolymer. Moreover, we show that TransPolymer benefits from pretraining on large unlabeled dataset via Masked Language Modeling. Experimental results further manifest the important role of self-attention in modeling polymer sequences. We highlight this model as a promising computational tool for promoting rational polymer design and understanding structure-property relationships from a data science view.
引用
收藏
相关论文
共 50 条
  • [41] A Transformer-based Multi-modal Joint Attention Fusion Model for Molecular Property Prediction
    Wang, Ke
    Zhang, Wei
    Liu, Yong
    Proceedings - 2023 2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023, 2023, : 4972 - 4974
  • [42] AMMU: A survey of transformer-based biomedical pretrained language models
    Kalyan, Katikapalli Subramanyam
    Rajasekharan, Ajit
    Sangeetha, Sivanesan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 126
  • [43] A Transformer-based Approach for Translating Natural Language to Bash Commands
    Fu, Quchen
    Teng, Zhongwei
    White, Jules
    Schmidt, Douglas C.
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1245 - 1248
  • [44] Smart Home Notifications in Croatian Language: A Transformer-Based Approach
    Simunec, Magdalena
    Soic, Renato
    2023 17TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS, CONTEL, 2023,
  • [45] Transformer-based language models for mental health issues: A survey
    Greco, Candida M.
    Simeri, Andrea
    Tagarelli, Andrea
    Zumpano, Ester
    PATTERN RECOGNITION LETTERS, 2023, 167 : 204 - 211
  • [46] Pre-trained transformer-based language models for Sundanese
    Wilson Wongso
    Henry Lucky
    Derwin Suhartono
    Journal of Big Data, 9
  • [47] Vision transformer-based visual language understanding of the construction process
    Yang, Bin
    Zhang, Binghan
    Han, Yilong
    Liu, Boda
    Hu, Jiniming
    Jin, Yiming
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 99 : 242 - 256
  • [48] CardioBERTpt: Transformer-based Models for Cardiology Language Representation in Portuguese
    Rubel Schneider, Elisa Terumi
    Gumiel, Yohan Bonescki
    Andrioli de Souza, Joao Vitor
    Mukai, Lilian Mie
    Silva e Oliveira, Lucas Emanuel
    Rebelo, Marina de Sa
    Gutierrez, Marco Antonio
    Krieger, Jose Eduardo
    Teodoro, Douglas
    Moro, Claudia
    Paraiso, Emerson Cabrera
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 378 - 381
  • [49] Transformer-Based Composite Language Models for Text Evaluation and Classification
    Skoric, Mihailo
    Utvic, Milos
    Stankovic, Ranka
    MATHEMATICS, 2023, 11 (22)
  • [50] Automatic text summarization using transformer-based language models
    Rao, Ritika
    Sharma, Sourabh
    Malik, Nitin
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (06) : 2599 - 2605