Semantic textual similarity between sentences using bilingual word semantics

被引:21
|
作者
Shajalal, Md [1 ]
Aono, Masaki [2 ]
机构
[1] Bangladesh Agr Univ, Dept Comp Sci & Math, Mymensingh 2202, Bangladesh
[2] Toyohashi Univ Technol, Dept Comp Sci & Engn, Toyohashi, Aichi, Japan
基金
日本学术振兴会;
关键词
Semantic similarity; Word semantics; Word-embedding; Textual similarity; Bilingual semantics;
D O I
10.1007/s13748-019-00180-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic textual similarity between sentences is indispensable for many information retrieval tasks. Traditional lexical similarity measures cannot compute the similarity beyond a trivial level. Moreover, they only can capture the textual similarity, but not semantic. In this paper, we propose a method for semantic textual similarity that leverages bilingual word-level semantics to compute the semantic similarity between sentences. To capture word-level semantics, we employ distribute representation of words in two different languages. The similarity function based on the concept-to-concept relationship corresponding to the words is also utilized for the same purpose. Multiple new semantic similarity measures are introduced based on word-embedding models trained on two different corpora in two different languages. Apart from these, another new semantic similarity measure is also introduced using the word sense comparison. The similarity score between the sentences is then computed by applying a linear ranking approach to all proposed measures with their importance score estimated employing a supervised feature selection technique. We conducted experiments on the SemEval Semantic Textual Similarity (STS-2017) test collections. The experimental results demonstrated that our method is effective for measuring semantic textual similarity and outperforms some known related methods.
引用
收藏
页码:263 / 272
页数:10
相关论文
共 50 条
  • [41] Czech Dataset for Semantic Textual Similarity
    Svoboda, Lukas
    Brychcin, Tomas
    TEXT, SPEECH, AND DIALOGUE (TSD 2018), 2018, 11107 : 213 - 221
  • [42] Fuzzy Word Similarity: A Semantic Approach Using WordNet
    Manna, Sukanya
    Mendis, B. Sumudu U.
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,
  • [43] Measuring interpretable semantic similarity of sentences using a multi chunk aligner
    Majumder, Goutam
    Pakray, Partha
    Pinto, David
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) : 4797 - 4808
  • [44] Semantic text similarity using corpus-based word similarity and string similarity
    University of Ottawa
    不详
    ACM Transactions on Knowledge Discovery from Data, 2008, 2 (02)
  • [45] A Novel Semantic Similarity Measure within Sentences
    Li, Yanni
    Li, Haisheng
    Cai, Qiang
    Han, Dongmei
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1176 - 1179
  • [46] Exploiting Synonymy to Measure Semantic Similarity of Sentences
    Shin, Youhyun
    Ahn, Yeonchan
    Kim, Hyuntak
    Lee, Sang-goo
    ACM IMCOM 2015, PROCEEDINGS, 2015,
  • [47] Similarity of Sentences With Contradiction Using Semantic Similarity Measures (vol 65, pg 701, 2022)
    Krishna Siva Prasad, M.
    Sharma, Poonam
    COMPUTER JOURNAL, 2022, 65 (10): : 2845 - 2845
  • [48] An integrated and efficient approach to measure semantic similarity between short sentences and paragraphs
    1600, AMSE Press, 16 Avenue Grauge Blanche, Tassin-la-Demi-Lune, 69160, France (57):
  • [49] THE INTERACTION OF GRAMMATICAL AND SEMANTIC CATEGORIES IN SENTENCES WITH THE SEMANTICS OF VAIN
    Steksova, Tatiana I.
    VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA FILOLOGIYA-TOMSK STATE UNIVERSITY JOURNAL OF PHILOLOGY, 2016, 40 (02): : 84 - 93
  • [50] Learning to Rank Hypernyms of Financial Terms Using Semantic Textual Similarity
    Ghosh S.
    Chopra A.
    Naskar S.K.
    SN Computer Science, 4 (5)