Exploiting Syntactic and Semantic Information for Textual Similarity Estimation

被引:3
|
作者
Luo, Jiajia [1 ]
Shan, Hongtao [1 ]
Zhang, Gaoyu [2 ]
Yuan, George [3 ]
Zhang, Shuyi [4 ]
Yan, Fengting [1 ]
Li, Zhiwei [1 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Shanghai Lixin Univ Accounting & Finance, Sch Informat Management, Shanghai 201209, Peoples R China
[3] Shanghai Lixin Univ Accounting & Finance, Sch Financial Technol, Shanghai 201209, Peoples R China
[4] Shanghai Lixin Univ Accounting & Finance, Lixin Res Inst, Shanghai 201209, Peoples R China
基金
中国国家自然科学基金;
关键词
Dependency trees - NAtural language processing - Semantic features - Semantic information - Sentence similarity - Syntactic features - Syntactic information - Textual similarities;
D O I
10.1155/2021/4186750
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The textual similarity task, which measures the similarity between two text pieces, has recently received much attention in the natural language processing (NLP) domain. However, due to the vagueness and diversity of language expression, only considering semantic or syntactic features, respectively, may cause the loss of critical textual knowledge. This paper proposes a new type of structure tree for sentence representation, which exploits both syntactic (structural) and semantic information known as the weight vector dependency tree (WVD-tree). WVD-tree comprises structure trees with syntactic information along with word vectors representing semantic information of the sentences. Further, Gaussian attention weight is proposed for better capturing important semantic features of sentences. Meanwhile, we design an enhanced tree kernel to calculate the common parts between two structures for similarity judgment. Finally, WVD-tree is tested on widely used semantic textual similarity tasks. The experimental results prove that WVD-tree can effectively improve the accuracy of sentence similarity judgments.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Semantic Textual Similarity in Quality Estimation
    Bechara, Hanna
    Parra Escartin, Carla
    Orasan, Constantin
    Specia, Lucia
    BALTIC JOURNAL OF MODERN COMPUTING, 2016, 4 (02): : 256 - 268
  • [2] A semantic textual similarity measurement model based on the syntactic-semantic representation
    Tang, Zhuo
    Xiao, Qi
    Zhu, Li
    Li, Kenli
    Li, Keqin
    INTELLIGENT DATA ANALYSIS, 2019, 23 (04) : 933 - 950
  • [3] Textual entailment beyond semantic similarity information
    Vazquez, Sonia
    Kozareva, Zornitsa
    Montoyo, Andres
    MICAI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4293 : 900 - +
  • [4] Exploiting Syntactic and Semantic Information in Coarse Chinese Question Classification
    Kang, Xin
    Wang, Xiaojie
    Ren, Fuji
    IEEE NLP-KE 2008: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2008, : 174 - +
  • [5] Similarity Computation Exploiting the Semantic and Syntactic Inherent Structure Among Job Titles
    Ahuja, Sarthak
    Mondal, Joydeep
    Singh, Sudhanshu Shekhar
    George, David Glenn
    SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 3 - 18
  • [7] Measuring the short text similarity based on semantic and syntactic information
    Yang, Jiaqi
    Li, Yongjun
    Gao, Congjie
    Zhang, Yinyin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 : 169 - 180
  • [8] SegStereo: Exploiting Semantic Information for Disparity Estimation
    Yang, Guorun
    Zhao, Hengshuang
    Shi, Jianping
    Deng, Zhidong
    Jia, Jiaya
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 660 - 676
  • [9] A Short-Text Similarity Model Combining Semantic and Syntactic Information
    Zhou, Ya
    Li, Cheng
    Huang, Guimin
    Guo, Qingkai
    Li, Hui
    Wei, Xiong
    ELECTRONICS, 2023, 12 (14)
  • [10] Exploiting Rich Syntactic Information for Semantic Parsing with Graph-to-Sequence Model
    Xu, Kun
    Wu, Lingfei
    Wang, Zhiguo
    Yu, Mo
    Chen, Liwei
    Sheinin, Vadim
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 918 - 924