Extracting semantic representations from word co-occurrence statistics: A computational study

被引:385
|
作者
Bullinaria, John A. [1 ]
Levy, Joseph P.
机构
[1] Univ Birmingham, Sch Comp Sci, Birmingham B15 2TT, W Midlands, England
[2] Roehampton Univ, London, England
关键词
D O I
10.3758/BF03193020
中图分类号
B841 [心理学研究方法];
学科分类号
040201 ;
摘要
The idea that at least some aspects of word meaning can be induced from patterns of word co-occurrence is becoming increasingly popular. However, there is less agreement about the precise computations involved, and the appropriate tests to distinguish between the various possibilities. It is important that the effect of the relevant design choices and parameter values are understood if psychological models using these methods are to be reliably evaluated and compared. In this article, we present a systematic exploration of the principal computational possibilities for formulating and validating representations of word meanings from word co-occurrence statistics. We find that, once we have identified the best procedures, a very simple approach is surprisingly successful and robust over a range of psychologically relevant evaluation measures.
引用
收藏
页码:510 / 526
页数:17
相关论文
共 50 条
  • [31] Inference Methods for CRFs with Co-occurrence Statistics
    Ladicky, L'ubor
    Russell, Chris
    Kohli, Pushmeet
    Torr, Philip H. S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 103 (02) : 213 - 225
  • [32] Co-occurrence Networks for Word Sense Induction
    Humonen, Innokentiy S.
    Makarov, Ilya
    2023 IEEE 21ST WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, SAMI, 2023, : 97 - 102
  • [33] Word co-occurrence features for text classification
    Figueiredo, Fabio
    Rocha, Leonardo
    Couto, Thierson
    Salles, Thiago
    Goncalves, Marcos Andre
    Meira, Wagner, Jr.
    INFORMATION SYSTEMS, 2011, 36 (05) : 843 - 858
  • [34] Conceptual grouping in word co-occurrence networks
    Veling, A
    van der Weerd, P
    IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, 1999, : 694 - 699
  • [35] The structure of word co-occurrence network for microblogs
    Garg, Muskan
    Kumar, Mukesh
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2018, 512 : 698 - 720
  • [36] Constraining Weighted Word Co-occurrence Frequencies in Word Embeddings
    Lauren, Paula
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5193 - 5198
  • [37] Extracting Topics with SimultaneousWord Co-occurrence and Semantic Correlation Graphs: Neural Topic Modeling for Short Texts
    Wang, Yiming
    Li, Ximing
    Zhou, Xiaotang
    Ouyang, Jihong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 18 - 27
  • [38] Key word extraction from a document using word co-occurrence statistical information
    Matsuo, Yutaka
    Ishizuka, Mitsuru
    Transactions of the Japanese Society for Artificial Intelligence, 2002, 17 (03) : 217 - 223
  • [39] A comparison of methods for extracting information from the co-occurrence matrix for subcellular classification
    Nanni, Loris
    Brahnam, Sheryl
    Ghidoni, Stefano
    Menegatti, Emanuele
    Barrier, Tonya
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (18) : 7457 - 7467
  • [40] Distributed representations of diseases based on co-occurrence relationship
    Wang, Haoqing
    Mai, Huiyu
    Deng, Zhi-hong
    Yang, Chao
    Zhang, Luxia
    Wang, Huai-yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 183