Using k-Way Co-Occurrences for Learning Word Embeddings

被引:0
|
作者
Bollegala, Danushka [1 ]
Yoshida, Yuichi [2 ]
Kawarabayashi, Ken-ichi [2 ,3 ]
机构
[1] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
[2] Natl Inst Informat, Chiyoda Ku, 2-1-2 Hitotsubashi, Tokyo 1018430, Japan
[3] Japan Sci & Technol Agcy, ERATO, Kawarabayashi Large Graph Project, Kawaguchi, Saitama, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Co-occurrences between two words provide useful insights into the semantics of those words. Consequently, numerous prior work on word embedding learning has used co-occurrences between two words as the training signal for learning word embeddings. Flowever, in natural language texts it is common for multiple words to be related and cooccurring in the same context. We extend the notion of co-occurrences to cover k(>= 2)-way co-occurrences among a set of k-words. Specifically, we prove a theoretical relationship between the joint probability of k(>= 2) words, and the sum of l(2) norms of their embeddings. Next, we propose a learning objective motivated by our theoretical result that utilises k-way Co-occurrences for learning word embeddings. Our experimental results show that the derived theoretical relationship does indeed hold empirically, and despite data sparsity, for some smaller k(<= 5) values, k-way embeddings perform comparably or better than 2-way embeddings in a range of tasks.
引用
收藏
页码:5037 / 5044
页数:8
相关论文
共 50 条
  • [11] Statistical learning of distractor co-occurrences facilitates visual search
    Thorat, Sushrut
    Quek, Genevieve L.
    Peelen, Marius, V
    JOURNAL OF VISION, 2022, 22 (10):
  • [12] Plant Texture Classification Using Gabor Co-occurrences
    Cope, James S.
    Remagnino, Paolo
    Barman, Sarah
    Wilkin, Paul
    ADVANCES IN VISUAL COMPUTING, PT II, 2010, 6454 : 669 - +
  • [13] Using concept co-occurrences for a biomedical facts acquisition
    Minarro-Gimenez, Jose A.
    Costa, Catalina M.
    Schulz, Stefan
    E-HEALTH - FOR CONTINUITY OF CARE, 2014, 205 : 1200 - 1200
  • [14] Retrieving domain-specific collocations by co-occurrences and word order constraints
    Shimohata, S
    Sugio, T
    Nagata, J
    COMPUTATIONAL INTELLIGENCE, 1999, 15 (02) : 92 - 100
  • [15] Clustering of a Health Dataset Using Diagnosis Co-Occurrences
    Wartelle, Adrien
    Mourad-Chehade, Farah
    Yalaoui, Farouk
    Chrusciel, Jan
    Laplanche, David
    Sanchez, Stephane
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 20
  • [16] Holistic Context Modeling using Semantic Co-occurrences
    Rasiwasia, Nikhil
    Vasconcelos, Nuno
    CVPR: 2009 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-4, 2009, : 1889 - 1895
  • [17] Learning Visual Clothing Style with Heterogeneous Dyadic Co-occurrences
    Veit, Andreas
    Kovacs, Balazs
    Bell, Sean
    McAuley, Julian
    Bala, Kavita
    Belongie, Serge
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4642 - 4650
  • [18] Independency of Coding for Affective Similarities and for Word Co-occurrences in Temporal Perisylvian Neocortex
    Liuzzi, Antonietta Gabriella
    Meersmans, Karen
    Storms, Gerrit
    De Deyne, Simon
    Dupont, Patrick
    Vandenberghe, Rik
    NEUROBIOLOGY OF LANGUAGE, 2023, 4 (02): : 257 - 279
  • [19] Word co-occurrences on Webpages as a measure of the relatedness of organizations: A new Webometrics concept
    Vaughan, Liwen
    You, Justin
    JOURNAL OF INFORMETRICS, 2010, 4 (04) : 483 - 491
  • [20] Investigating Code Smell Co-occurrences using Association Rule Learning: A Replicated Study
    Palomba, Fabio
    Oliveto, Rocco
    De Lucia, Andrea
    2017 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING TECHNIQUES FOR SOFTWARE QUALITY EVALUATION (MALTESQUE), 2017, : 8 - 13