Semantic similarity is not enough: A novel NLP-based semantic similarity measure in context

被引:0
|
作者
Abbasi, Omid Reza [1 ]
Alesheikh, Ali Asghar [1 ]
Lotfata, Aynaz [2 ]
机构
[1] KN Toosi Univ Technol, Dept Geospatial Informat Syst, Tehran, Iran
[2] Univ Calif Davis, Sch Vet Med, Dept Pathol Microbiol & Immunol, Davis, CA 95616 USA
关键词
Computer science; Geographical information science; Machine learning;
D O I
10.1016/j.isci.2024.109883
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this study, we addressed two primary challenges: firstly, the issue of domain shift, which pertains changes in data characteristics or context that can impact model performance, and secondly, the discrepancy between semantic similarity and geographical distance. We employed topic modeling in conjunction with the BERT architecture. Our model was crafted to enhance similarity computations applied to geospatial text, aiming to integrate both semantic similarity and geographical proximity. We tested the model on two datasets, Persian Wikipedia articles and rental property advertisements. The findings demonstrate that the model effectively improved the correlation between semantic similarity and geographical distance. Furthermore, evaluation by real -world users within a recommender system context revealed notable increase in user satisfaction by approximately 22% for Wikipedia articles and 56% for advertisements.
引用
收藏
页数:15
相关论文
共 50 条
  • [11] A novel sentence similarity measure for semantic-based expert systems
    Lee, Ming Che
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6392 - 6399
  • [12] SEMANTIC SIMILARITY BASED CONTEXT-AWARE WEB SERVICE DISCOVERY USING NLP TECHNIQUES
    Kamath, Sowmya S.
    Ananthanarayana, V. S.
    JOURNAL OF WEB ENGINEERING, 2016, 15 (1-2): : 110 - 129
  • [13] A COMBINED MEASURE FOR TEXT SEMANTIC SIMILARITY
    Li, Hao-Di
    Chen, Qing-Cai
    Wang, Xiao-Long
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 1869 - 1873
  • [14] A Semantic Based Similarity Measure for Human Motion Data
    Zhao, Jianjun
    Chen, Bin
    Yao, Gang
    Yang, Libin
    ADVANCES IN COMPUTING, CONTROL AND INDUSTRIAL ENGINEERING, 2012, 235 : 384 - +
  • [15] A measure of semantic similarity between GO terms based on semantic contributions of their ancestors
    Lian, Aie
    Huang, JiFeng
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 266 - 269
  • [16] Semantic Similarity Measure Based on Ontology Hierarchical Tree
    Ge, Jike
    Qiu, Yuhui
    Yin, Shiqun
    Chen, Zuqin
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 5290 - 5294
  • [17] A Semantic Similarity Measure for Ontology-Based Information
    Stuckenschmidt, Heiner
    FLEXIBLE QUERY ANSWERING SYSTEMS: 8TH INTERNATIONAL CONFERENCE, FQAS 2009, 2009, 5822 : 406 - 417
  • [18] Similarity Measure for Semantic Document Interconnections
    Hwang, Myunggwon
    Choi, Dongjin
    Choi, Junho
    Kim, Hanil
    Kim, Pankoo
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (02): : 253 - 267
  • [19] A semantic similarity measure for the SIMS framework
    Pirrone, Roberto
    Russo, Giuseppe
    Sangiorgi, Pierluca
    Ingraffia, Nunzio
    Vicari, Claudia
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 3, PROCEEDINGS, 2008, 5179 : 285 - +
  • [20] Semantic similarity measure for Thai language
    Wongchaisuwat, Papis
    2018 INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2018), 2018, : 11 - 16