Semantics-Aware BERT for Language Understanding

被引:0
|
作者
Zhang, Zhuosheng [1 ,2 ,3 ]
Wu, Yuwei [1 ,2 ,3 ,4 ]
Zhao, Hai [1 ,2 ,3 ]
Li, Zuchao [1 ,2 ,3 ]
Zhang, Shuailiang [1 ,2 ,3 ]
Zhou, Xi [5 ]
Zhou, Xiang [5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Shanghai Educ Commiss Intelligent Interac, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, AI Inst, MoE Key Lab Artificial Intelligence, Shanghai, Peoples R China
[4] Shanghai Jiao Tong Univ, Coll Zhiyuan, Shanghai, Peoples R China
[5] CloudWalk Technol, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The latest work on language representations carefully integrates contextualized features into language model training, which enables a series of success especially in various machine reading comprehension and natural language inference tasks. However, the existing language representation models including ELMo, GPT and BERT only exploit plain context-sensitive features such as character or word embeddings. They rarely consider incorporating structured semantic information which can provide rich semantics for language representation. To promote natural language understanding, we propose to incorporate explicit contextual semantics from pre-trained semantic role labeling, and introduce an improved language representation model, Semantics-aware BERT (SemBERT), which is capable of explicitly absorbing contextual semantics over a BERT backbone. SemBERT keeps the convenient usability of its BERT precursor in a light fine-tuning way without substantial task-specific modifications. Compared with BERT, semantics-aware BERT is as simple in concept but more powerful. It obtains new state-of-the-art or substantially improves results on ten reading comprehension and language inference tasks.
引用
收藏
页码:9628 / 9635
页数:8
相关论文
共 50 条
  • [41] SemSwap: Semantics-aware Swapping in Memory Disaggregated Datacenters
    Cui, Siwei
    Jin, Liuyi
    Khanh Nguyen
    Wang, Chenxi
    PROCEEDINGS OF THE 13TH ACM SIGOPS ASIA-PACIFIC WORKSHOP ON SYSTEMS, APSYS 2022, 2022, : 9 - 17
  • [42] A Semantics-Aware Classification Approach for Data Leakage Prevention
    Alneyadi, Sultan
    Sithirasenan, Elankayer
    Muthukkumarasamy, Vallipuram
    INFORMATION SECURITY AND PRIVACY, ACISP 2014, 2014, 8544 : 413 - 421
  • [43] Toward semantics-aware management of intellectual property rights
    Damiani, Ernesto
    Fugazza, Cristiano
    ONLINE INFORMATION REVIEW, 2007, 31 (01) : 59 - 72
  • [44] Invited talk - Towards semantics-aware access control
    Damiani, E
    di Vimercati, SD
    RESEARCH DIRECTIONS IN DATA AND APPLICATIONS SECURITY XVIII, 2004, 144 : 177 - 188
  • [45] Toward semantics-aware annotation and retrieval of spatial data
    Cristiano Fugazza
    Earth Science Informatics, 2011, 4 : 225 - 239
  • [46] A Semantics-Aware Approach to the Automated Network Protocol Identification
    Yun, Xiaochun
    Wang, Yipeng
    Zhang, Yongzheng
    Zhou, Yu
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (01) : 583 - 595
  • [47] A SEMANTICS-AWARE NORMALIZING FLOW MODEL FOR ANOMALY DETECTION
    Ma, Wei
    Lan, Shiyong
    Huang, Weikang
    Wang, Wenwu
    Yang, Hongyu
    Ma, Yitong
    Ma, Yongjie
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2207 - 2212
  • [48] Semantics-Aware Source Coding in Status Update Systems
    Agheli, Pouya
    Pappas, Nikolaos
    Kountouris, Marios
    2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 169 - 174
  • [49] Toward semantics-aware annotation and retrieval of spatial data
    Fugazza, Cristiano
    EARTH SCIENCE INFORMATICS, 2011, 4 (04) : 225 - 239
  • [50] Incorporating BERT With Probability-Aware Gate for Spoken Language Understanding
    Mei, Jie
    Wang, Yufan
    Tu, Xinhui
    Dong, Ming
    He, Tingting
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 826 - 834