A novel model for semantic similarity measurement based on wordnet and word embedding

被引:5
|
作者
Zhao, Fuqiang [1 ]
Zhu, Zhengyu [1 ]
Han, Ping [2 ]
机构
[1] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
[2] Chongqing Univ, Sch Foreign Languages & Cultures, Chongqing, Peoples R China
关键词
Semantic similarity; WordNet; word embedding; POS; synset;
D O I
10.3233/JIFS-202337
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To measure semantic similarity between words, a novel model DFRVec that encodes multiple semantic information of a word in WordNet into a vector space is presented in this paper. Firstly, three different sub-models are proposed: 1) DefVec: encoding the definitions of a word in WordNet; 2) FormVec: encoding the part-of-speech (POS) of a word in WordNet; 3) RelVec: encoding the relations of a word in WordNet. Then by combining the three sub-models with an existing word embedding, the new model for generating the vector of a word is proposed. Finally, based on DFRVec and the path information in WordNet, a new method DFRVec+Path to measure semantic similarity between words is presented. The experiments on ten benchmark datasets show that DFRVec+Path can outperform many existing methods on semantic similarity measurement.
引用
收藏
页码:9831 / 9842
页数:12
相关论文
共 50 条
  • [31] Short Text Clustering based on Word Semantic Graph with Word Embedding Model
    Jinarat, Supakpong
    Manaskasemsak, Bundit
    Rungsawang, Arnon
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 1427 - 1432
  • [32] The Semantic Similarity Relation of Entities Discovery: Using Word Embedding
    Ruan, Dong-ru
    Mao, Yu-xin
    Pan, Hong-yan
    Gao, Kai
    2017 9TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC 2017), 2017, : 845 - 850
  • [33] A survey on word embedding techniques and semantic similarity for paraphrase identification
    Kubal, Divesh R.
    Nimkar, Anant V.
    International Journal of Computational Systems Engineering, 2019, 5 (01) : 36 - 52
  • [34] Automated Short-Answer Grading using Semantic Similarity based on Word Embedding
    Lubis, Fetty Fitriyanti
    Mutaqin
    Putri, Atina
    Waskita, Dana
    Sulistyaningtyas, Tri
    Arman, Arry Akhmad
    Rosmansyah, Yusep
    INTERNATIONAL JOURNAL OF TECHNOLOGY, 2021, 12 (03) : 571 - 581
  • [35] Exploring Semantic Similarity Measure Based on Word Embedding Representation for Arabic Passages Retrieval
    Lahbari, Imane
    El Alaoui, Said Ouatik
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2020), VOL 2, 2022, 1418 : 978 - 989
  • [36] Concept vector for semantic similarity and relatedness based on WordNet structure
    Liu, Hongzhe
    Bao, Hong
    Xu, De
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (02) : 370 - 381
  • [37] Incorporating Prior Knowledge into Word Embedding for Chinese Word Similarity Measurement
    Huang, Degen
    Pei, Jiahuan
    Zhang, Cong
    Huang, Kaiyu
    Ma, Jianjun
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2018, 17 (03)
  • [38] Semantic similarity based food entities recognition using WordNet
    Butt, Sahrish
    Bakhtyar, Maheen
    Noor, Waheed
    Baber, Junaid
    Ullah, Ihsan
    Ahmed, Atiq
    Basit, Abdul
    Kakar, M. Saeed H.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (02) : 2069 - 2078
  • [39] PageRank based Semantic Similarity Measure on a Graph based Turkish WordNet
    Tulu, Cagatay
    Orhan, Umut
    2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 468 - 473
  • [40] Improving Vietnamese WordNet using word embedding
    Khang Nhut Lam
    Tuan Huynh To
    Thong Tri Tran
    Kalita, Jugal
    NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 110 - 114