A text similarity measurement method based on singular value decomposition and semantic relevance

被引:7
|
作者
Li X. [1 ]
Yao C. [1 ]
Fan F. [1 ]
Yu X. [1 ]
机构
[1] School of Information Science and Engineering, Dalian Polytechnic University, Dalian
来源
Li, Xu (lixu102@aliyun.com) | 1600年 / Korea Information Processing Society卷 / 13期
关键词
Natural language processing; Semantic relevance; Singular value decomposition; Text representation; Text similarity measurement;
D O I
10.3745/JIPS.02.0067
中图分类号
学科分类号
摘要
The traditional text similarity measurement methods based on word frequency vector ignore the semantic relationships between words, which has become the obstacle to text similarity calculation, together with the high-dimensionality and sparsity of document vector. To address the problems, the improved singular value decomposition is used to reduce dimensionality and remove noises of the text representation model. The optimal number of singular values is analyzed and the semantic relevance between words can be calculated in constructed semantic space. An inverted index construction algorithm and the similarity definitions between vectors are proposed to calculate the similarity between two documents on the semantic level. The experimental results on benchmark corpus demonstrate that the proposed method promotes the evaluation metrics of F-measure. © 2017 KIPS.
引用
收藏
页码:863 / 875
页数:12
相关论文
共 50 条
  • [41] Measurement of Turkish Word Semantic Similarity and Text Categorization Application
    Amasyah, M. Fatih
    Beken, Aytunc
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 1 - 4
  • [42] Updating the partial singular value decomposition in latent semantic indexing
    Tougas, Jane E.
    Spiteri, Raymond J.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 174 - 183
  • [43] Visual support for text information retrieval based on matrix's singular value decomposition
    Hou, JY
    Zhang, YC
    Cao, JL
    Lai, W
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 344 - 351
  • [44] Text encryption by image encryption key with based on Generalized Singular Value Decomposition (GSVD)
    Abdul-Hameed, Mohammed
    JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2023, 26 (06) : 1319 - 1327
  • [45] Hybrid singular value decomposition: A model of human text classification
    Noorinaeini, Amirali
    Lehto, Mark R.
    Wu, Sze-Jung
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: METHODS, TECHNIQUES AND TOOLS IN INFORMATION DESIGN, PT 1, PROCEEDINGS, 2007, 4557 : 517 - 525
  • [46] A novel method based on symbolic regression for interpretable semantic similarity measurement
    Martinez-Gil, Jorge
    Chaves-Gonzalez, Jose M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160 (160)
  • [47] Watermark Based on Singular Value Decomposition
    Qazzaz, Ali Abdulazeez Mohammed Baqer
    Kadhim, Neamah Enad
    BAGHDAD SCIENCE JOURNAL, 2023, 20 (05) : 1797 - 1807
  • [48] A damage localization method based on the singular value decomposition (SVD) for plates
    Yang, Zhi-Bo
    Yu, Jin-Tao
    Tian, Shao-Hua
    Chen, Xue-Feng
    Xu, Guan-Ji
    SMART STRUCTURES AND SYSTEMS, 2018, 22 (05) : 621 - 630
  • [49] A New Blind Adaptive Watermarking Method Based on Singular Value Decomposition
    Zolotavkin, Yevhen
    Juhola, Martti
    2013 INTERNATIONAL CONFERENCE ON SENSOR NETWORK SECURITY TECHNOLOGY AND PRIVACY COMMUNICATION SYSTEM (SNS & PCS), 2013, : 184 - 192
  • [50] A New Support Vector Compression Method Based on Singular Value Decomposition
    Yoon, Sang-Hun
    Lyuh, Chun-Gi
    Chun, Ik-Jae
    Suk, Jung-Hee
    Roh, Tae Moon
    ETRI JOURNAL, 2011, 33 (04) : 652 - 655