A text similarity measurement method based on singular value decomposition and semantic relevance

被引:7
|
作者
Li X. [1 ]
Yao C. [1 ]
Fan F. [1 ]
Yu X. [1 ]
机构
[1] School of Information Science and Engineering, Dalian Polytechnic University, Dalian
来源
Li, Xu (lixu102@aliyun.com) | 1600年 / Korea Information Processing Society卷 / 13期
关键词
Natural language processing; Semantic relevance; Singular value decomposition; Text representation; Text similarity measurement;
D O I
10.3745/JIPS.02.0067
中图分类号
学科分类号
摘要
The traditional text similarity measurement methods based on word frequency vector ignore the semantic relationships between words, which has become the obstacle to text similarity calculation, together with the high-dimensionality and sparsity of document vector. To address the problems, the improved singular value decomposition is used to reduce dimensionality and remove noises of the text representation model. The optimal number of singular values is analyzed and the semantic relevance between words can be calculated in constructed semantic space. An inverted index construction algorithm and the similarity definitions between vectors are proposed to calculate the similarity between two documents on the semantic level. The experimental results on benchmark corpus demonstrate that the proposed method promotes the evaluation metrics of F-measure. © 2017 KIPS.
引用
收藏
页码:863 / 875
页数:12
相关论文
共 50 条
  • [21] Is singular value decomposition useful for word similarity extraction?
    Gamallo, Pablo
    Bordag, Stefan
    LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (02) : 95 - 119
  • [22] Is singular value decomposition useful for word similarity extraction?
    Pablo Gamallo
    Stefan Bordag
    Language Resources and Evaluation, 2011, 45 : 95 - 119
  • [23] A Text Similarity Measurement Employs Semantic Dictionary-based Sentiment Analysis
    Tohti, Turdi
    Li, Shasha
    Hamdulla, Askar
    2021 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2021, : 358 - 362
  • [24] A novel singular value decomposition-based similarity measure method for non-local means denoising
    Wang, Yi
    Song, Xiao
    Chen, Kai
    Zhang, Xing
    Tie, Ming
    Fang, Hong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (02) : 403 - 410
  • [25] A novel singular value decomposition-based similarity measure method for non-local means denoising
    Yi Wang
    Xiao Song
    Kai Chen
    Xing Zhang
    Ming Tie
    Hong Fang
    Signal, Image and Video Processing, 2022, 16 : 403 - 410
  • [26] A text similarity measurement combining word semantic information with TF-IDF method
    Huang C.-H.
    Yin J.
    Hou F.
    Jisuanji Xuebao/Chinese Journal of Computers, 2011, 34 (05): : 856 - 864
  • [27] The QIF singular value decomposition method
    Evans, DJ
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2002, 79 (05) : 637 - 645
  • [28] A noisy speech recognition method based on singular value decomposition
    Xu, J.
    Wei, G.
    Leung, S.
    Huanan Ligong Daxue Xuebao/Journal of South China University of Technology (Natural Science), 2001, 29 (01): : 91 - 93
  • [29] An Image Fusion Method Based on Quotient Singular Value Decomposition
    Sun, Yu-Qiu
    Koh, Min-sung
    Rodriguez-Marek, Esteban
    2011 IEEE 54TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2011,
  • [30] A general method for remontaging based on a singular value decomposition algorithm
    Lagerlund, TD
    Sharbrough, FW
    Busacker, NE
    JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2003, 20 (03) : 179 - 187