Interpretable Entity Representations through Large-Scale Typing

被引:0
|
作者
Onoe, Yasumasa [1 ]
Durrett, Greg [1 ]
机构
[1] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In standard methodology for natural language processing, entities in text are typically embedded in dense vector spaces with pre-trained models. The embeddings produced this way are effective when fed into downstream models, but they require end-task fine-tuning and are fundamentally difficult to interpret. In this paper, we present an approach to creating entity representations that are human readable and achieve high performance on entity-related tasks out of the box. Our representations are vectors whose values correspond to posterior probabilities over finegrained entity types, indicating the confidence of a typing model's decision that the entity belongs to the corresponding type. We obtain these representations using a fine-grained entity typing model, trained either on supervised ultra-fine entity typing data (Choi et al., 2018) or distantly-supervised examples from Wikipedia. On entity probing tasks involving recognizing entity identity, our embeddings used in parameter-free downstream models achieve competitive performance with ELMoand BERT-based embeddings in trained models. We also show that it is possible to reduce the size of our type set in a learning-based way for particular domains. Finally, we show that these embeddings can be post-hoc modified through a small number of rules to incorporate domain knowledge and improve performance.
引用
收藏
页码:612 / 624
页数:13
相关论文
共 50 条
  • [31] A POLYBRENE MICROPLATE TECHNIQUE FOR LARGE-SCALE RED-CELL TYPING
    ETGES, CC
    CALLICOAT, PA
    SMITH, DM
    TRANSFUSION, 1983, 23 (05) : 429 - 429
  • [32] Learning Distributed Representations for Large-Scale Dynamic Social Networks
    Zhiyuli, Aakas
    Liang, Xun
    Xu, Zhiming
    IEEE INFOCOM 2017 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2017,
  • [33] Large-scale SNP typing of candidate genes for susceptibility to malaria.
    Walley, AJ
    Aucan, C
    Hill, AVS
    AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (05) : 528 - 528
  • [34] Learning local equivariant representations for large-scale atomistic dynamics
    Musaelian, Albert
    Batzner, Simon
    Johansson, Anders
    Sun, Lixin
    Owen, Cameron J.
    Kornbluth, Mordechai
    Kozinsky, Boris
    NATURE COMMUNICATIONS, 2023, 14 (01)
  • [35] Learning local equivariant representations for large-scale atomistic dynamics
    Albert Musaelian
    Simon Batzner
    Anders Johansson
    Lixin Sun
    Cameron J. Owen
    Mordechai Kornbluth
    Boris Kozinsky
    Nature Communications, 14
  • [36] BASIS REPRESENTATIONS IN LARGE-SCALE LINEAR-PROGRAMMING SOFTWARE
    AITTONIEMI, L
    ACTA POLYTECHNICA SCANDINAVICA-MATHEMATICS AND COMPUTER SCIENCE SERIES, 1988, (50): : 1 - 60
  • [37] Cognitive Modeling With Representations From Large-Scale Digital Data
    Bhatia, Sudeep
    Aka, Ada
    CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2022, 31 (03) : 207 - 214
  • [38] Propositional logic representations for documents and queries: A large-scale evaluation
    Losada, DE
    Barreiro, A
    ADVANCES IN INFORMATION RETRIEVAL, 2003, 2633 : 219 - 234
  • [39] ON THE REALIZATION OF INPUT-DECENTRALIZED REPRESENTATIONS FOR LARGE-SCALE SYSTEMS
    LOPARO, KA
    KUO, GS
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1985, 30 (09) : 909 - 912
  • [40] Modeling and Representations of Large-Scale 3D Scenes
    Zhigang Zhu
    Takeo Kanade
    International Journal of Computer Vision, 2008, 78 : 119 - 120