Measuring Semantic Similarity by Latent Relational Analysis

被引:0
|
作者
Turney, Peter D. [1 ]
机构
[1] Natl Res Council Canada, Inst Informat Technol, Ottawa, ON K1A 0R6, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces Latent Relational Analysis (LRA), a method for measuring semantic similarity. LRA measures similarity in the semantic relations between two pairs of words. When two pairs have a high degree of relational similarity, they are analogous. For example, the pair cat: meow is analogous to the pair dog: bark. There is evidence from cognitive science that relational similarity is fundamental to many cognitive and linguistic tasks (e. g., analogical reasoning). In the Vector Space Model (VSM) approach to measuring relational similarity, the similarity between two pairs is calculated by the cosine of the angle between the vectors that represent the two pairs. The elements in the vectors are based on the frequencies of manually constructed patterns in a large corpus. LRA extends the VSM approach in three ways: (1) patterns are derived automatically from the corpus, (2) Singular Value Decomposition is used to smooth the frequency data, and (3) synonyms are used to reformulate word pairs. This paper describes the LRA algorithm and experimentally compares LRA to VSM on two tasks, answering college-level multiple-choice word analogy questions and classifying semantic relations in noun-modifier expressions. LRA achieves state-of-the-art results, reaching human-level performance on the analogy questions and significantly exceeding VSM performance on both tasks.
引用
收藏
页码:1136 / 1141
页数:6
相关论文
共 50 条
  • [21] New Model of Semantic Similarity Measuring in WordNet
    Zhou, Zili
    Wang, Yanna
    Gu, Junzhong
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 256 - +
  • [22] On Measuring Semantic Similarity of Business Process models
    Gao Juntao
    Zhang Li
    I-ESA 2009: INTERNATIONAL CONFERENCE ON INTEROPERABILITY FOR ENTERPRISE SOFTWARE AND APPLICATIONS CHINA, PROCEEDINGS, 2009, : 289 - 293
  • [23] COMPARISON OF LATENT SEMANTIC ANALYSIS AND PROBABILISTIC LATENT SEMANTIC ANALYSIS FOR DOCUMENTS CLUSTERING
    Kuta, Marcin
    Kitowski, Jacek
    COMPUTING AND INFORMATICS, 2014, 33 (03) : 652 - 666
  • [24] WWW sits the SAT: Measuring Relational Similarity on the Web
    Bollegala, Danushka
    Matsuo, Yutaka
    Ishizuka, Mitsuru
    ECAI 2008, PROCEEDINGS, 2008, 178 : 333 - +
  • [25] Upgrading Domain Ontology Based on Latent Semantic Analysis and Group Center Similarity Calculation
    Chen, Rung-Ching
    Lee, I-Yan
    Lee, Ya-Ching
    Lo, Yu-lung
    2008 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), VOLS 1-6, 2008, : 1494 - 1499
  • [26] Discriminatively trained spoken document similarity models and their application to probabilistic latent semantic analysis
    Thambiratnam, K.
    Seide, F.
    Yu, P.
    2006 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, 2006, : 42 - +
  • [27] Latent semantic analysis cosines as a cognitive similarity measure: Evidence from priming studies
    Guenther, Fritz
    Dudschig, Carolin
    Kaup, Barbara
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2016, 69 (04): : 626 - 653
  • [28] Learning Spoken Document Similarity and Recommendation using Supervised Probabilistic Latent Semantic Analysis
    Thambiratnam, K.
    Seide, F.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2840 - 2843
  • [29] Measuring discourse-level processes with verbal protocols and latent semantic analysis
    Millis, Keith
    Magliano, Joseph
    Todaro, Stacey
    SCIENTIFIC STUDIES OF READING, 2006, 10 (03) : 225 - 240
  • [30] Methodology for Similarity Assessment of Relational Data Models and Semantic Ontologies
    Zarembo, Imants
    Teilans, Artis
    Barghorn, Knut
    Merkuryev, Yuri
    Berina, Gundega
    2ND INTERNATIONAL CONFERENCE ON SYSTEMS INFORMATICS, MODELLING AND SIMULATION (SIMS 2016), 2016, : 119 - 123