Semantics-Aware BERT for Language Understanding

被引:0
|
作者
Zhang, Zhuosheng [1 ,2 ,3 ]
Wu, Yuwei [1 ,2 ,3 ,4 ]
Zhao, Hai [1 ,2 ,3 ]
Li, Zuchao [1 ,2 ,3 ]
Zhang, Shuailiang [1 ,2 ,3 ]
Zhou, Xi [5 ]
Zhou, Xiang [5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[4] Shanghai Jiao Tong Univ, Coll Zhiyuan, Shanghai, Peoples R China
[5] CloudWalk Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.
引用
收藏
页码:9628 / 9635
页数:8
相关论文
共 50 条
  • [1] Semantics-Aware Inferential Network for Natural Language Understanding
    Zhang, Shuailiang
    Zhao, Hai
    Zhou, Junru
    Zhou, Xi
    Zhou, Xiang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14437 - 14445
  • [2] A semantics-aware approach for multilingual natural language inference
    Phuong Le-Hong
    Erik Cambria
    Language Resources and Evaluation, 2023, 57 : 611 - 639
  • [3] A semantics-aware approach for multilingual natural language inference
    Le-Hong, Phuong
    Cambria, Erik
    LANGUAGE RESOURCES AND EVALUATION, 2023, 57 (02) : 611 - 639
  • [4] Semantics-Aware Autoencoder
    Bellini, Vito
    Di Noia, Tommaso
    Di Sciascio, Eugenio
    Schiavone, Angelo
    IEEE ACCESS, 2019, 7 : 166122 - 166137
  • [5] LAIR: A Language for Automated Semantics-Aware Text Sanitization based on Frame Semantics
    Hedegaard, Steffen
    Houen, Soren
    Simonsen, Jakob Grue
    2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 47 - 52
  • [6] Semantics-aware Motion Retargeting with Vision-Language Models
    Zhang, Haodong
    Chen, Zhike
    Xu, Haocheng
    Hao, Lei
    Wu, Xiaofei
    Xu, Songcen
    Zhang, Zhensong
    Wang, Yue
    Xiong, Rong
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, : 2155 - 2164
  • [7] Semantics-aware malware detection
    Christodorescu, M
    Jha, S
    Seshia, SA
    Song, D
    Bryant, RE
    2005 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, PROCEEDINGS, 2005, : 32 - 46
  • [8] Semantics-aware perimeter protection
    Cremonini, M
    Damiani, E
    Samarati, P
    DATA AND APPLICATIONS SECURITY XVII: STATUS AND PROSPECTS, 2004, 142 : 229 - 242
  • [9] Semantics-Aware Trace Analysis
    Hoffman, Kevin
    Eugster, Patrick
    Jagannathan, Suresh
    PLDI'09 PROCEEDINGS OF THE 2009 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION, 2009, : 453 - 464
  • [10] Semantics-Aware Trace Analysis
    Hoffman, Kevin
    Eugster, Patrick
    Jagannathan, Suresh
    ACM SIGPLAN NOTICES, 2009, 44 (06) : 453 - 464