Collective Human Opinions in Semantic Textual Similarity

被引:0
|
作者
Wang, Yuxia [1 ]
Tao, Shimin [2 ]
Xie, Ning
Yang, Hao
Baldwin, Timothy [1 ,3 ]
Verspoor, Karin [1 ,4 ]
机构
[1] Univ Melbourne, Melbourne, Vic, Australia
[2] Huawei TSC, Beijing, Peoples R China
[3] MBZUAI, Abu Dhabi, U Arab Emirates
[4] RMIT Univ, Melbourne, Vic, Australia
关键词
D O I
10.1162/tacl_a_00584
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the subjective nature of semantic textual similarity (STS) and pervasive disagreements in STS annotation, existing benchmarks have used averaged human ratings as gold standard. Averaging masks the true distribution of human opinions on examples of low agreement, and prevents models from capturing the semantic vagueness that the individual ratings represent. In this work, we introduce USTS, the first Uncertainty-aware STS dataset with & SIM;15,000 Chinese sentence pairs and 150,000 labels, to study collective human opinions in STS. Analysis reveals that neither a scalar nor a single Gaussian fits a set of observed judgments adequately. We further show that current STS models cannot capture the variance caused by human disagreement on individual instances, but rather reflect the predictive confidence over the aggregate dataset.
引用
收藏
页码:997 / 1013
页数:17
相关论文
共 50 条
  • [31] Exploiting Syntactic and Semantic Information for Textual Similarity Estimation
    Luo, Jiajia
    Shan, Hongtao
    Zhang, Gaoyu
    Yuan, George
    Zhang, Shuyi
    Yan, Fengting
    Li, Zhiwei
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2021, 2021
  • [32] UESTS: An Unsupervised Ensemble Semantic Textual Similarity Method
    Hassan, Basma
    Abdelrahman, Samir E.
    Bahgat, Reem
    Farag, Ibrahim
    IEEE ACCESS, 2019, 7 : 85462 - 85482
  • [33] A Combination of Enhanced WordNet and BERT for Semantic Textual Similarity
    Ramaiah Institute of Technology, India
    不详
    ACM Int. Conf. Proc. Ser., (191-198):
  • [34] Fine-grained Semantic Textual Similarity for Serbian
    Batanovic, Vuk
    Cvetanovic, Milos
    Nikolic, Bosko
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 1370 - 1378
  • [35] A semantic textual similarity measurement model based on the syntactic-semantic representation
    Tang, Zhuo
    Xiao, Qi
    Zhu, Li
    Li, Kenli
    Li, Keqin
    INTELLIGENT DATA ANALYSIS, 2019, 23 (04) : 933 - 950
  • [36] Spectral Learning of Semantic Units in a Sentence Pair to Evaluate Semantic Textual Similarity
    Mehndiratta, Akanksha
    Asawa, Krishna
    8TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS, BDA 2020, 2020, 12581 : 49 - 59
  • [37] A Semantic Logic-Based Approach to Determine Textual Similarity
    Blanco, Eduardo
    Moldovan, Dan
    IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (04): : 683 - 693
  • [38] Enhancing inter-sentence attention for Semantic Textual Similarity
    Zhao, Ying
    Xia, Tingyu
    Jiang, Yunqi
    Tian, Yuan
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (01)
  • [39] Crosslinguistic Semantic Textual Similarity of Buddhist Chinese and Classical Tibetan
    Felbur, Rafal
    Meelen, Marieke
    Vierthaler, Paul
    JOURNAL OF OPEN HUMANITIES DATA, 2022, 8
  • [40] Benchmarking Natural Language Inference and Semantic Textual Similarity for Portuguese
    Fialho, Pedro
    Coheur, Luisa
    Quaresma, Paulo
    INFORMATION, 2020, 11 (10) : 1 - 20