A method of extracting related words using standardized mutual information

被引:0
|
作者
Sugimachi, T [1 ]
Ishino, A [1 ]
Takeda, M [1 ]
Matsuo, F [1 ]
机构
[1] Kyushu Univ, Dept Informat, Higashi Ku, Fukuoka 8128581, Japan
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Techniques of automatic extraction of related words are of great importance in many applications such as query expansion and automatic thesaurus construction. In this paper, a method of extracting related words is proposed basing on the statistical information about the co-occurrences of words from huge corpora. The mutual information is one of such statistical measures and has been used for application mainly in natural language processing. A drawback is, however, the mutual information depends mainly on frequencies of words. To overcome this difficulty, we propose as a new measure a normalize deviation of mutual information. We also reveal a correspondence between word ambiguity and related words using word relation graphs constructed using this measure.
引用
收藏
页码:478 / 485
页数:8
相关论文
共 50 条
  • [21] An Extracting Method of Symmetry Plane from Head CT images for Surgery Based on OBB and Image Mutual Information
    Tan, Wenjun
    Kang, Ying
    Dong, Zhiwei
    Yang, Jinzhu
    Su, Ying
    Zhang, Li
    Xu, Lisheng
    Zhao, Dazhe
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1556 - 1563
  • [22] Extracting characteristic words of text using neural networks
    Saito, K
    Nakano, R
    2004 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, PROCEEDINGS, 2004, : 1397 - 1402
  • [23] Quadrature-based image registration method using mutual information
    Fookes, C
    Maeder, A
    2004 2ND IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: MACRO TO NANO, VOLS 1 AND 2, 2004, : 728 - 731
  • [24] Novel Feature Selection Method using Mutual Information and Fractal Dimension
    Pham, D. T.
    Packianather, M. S.
    Garcia, M. S.
    Castellani, M.
    IECON: 2009 35TH ANNUAL CONFERENCE OF IEEE INDUSTRIAL ELECTRONICS, VOLS 1-6, 2009, : 3217 - +
  • [25] MULTI-PHASE LIVER LESIONS CLASSIFICATION USING RELEVANT VISUAL WORDS BASED ON MUTUAL INFORMATION
    Diamant, Idit
    Goldberger, Jacob
    Klang, Eyal
    Amitai, Michal
    Greenspan, Hayit
    2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI), 2015, : 407 - 410
  • [26] Extracting city information in TM image using mixed decision tree method
    Wang, PJ
    Zhu, QJ
    Xie, DH
    Li, L
    IGARSS 2005: IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, PROCEEDINGS, 2005, : 3963 - 3966
  • [27] Coverless Text Information Hiding Method Using the Frequent Words Distance
    Zhang, Jianjun
    Xie, Yicheng
    Wang, Lucai
    Lin, Haijun
    CLOUD COMPUTING AND SECURITY, PT I, 2017, 10602
  • [28] Sentiment analysis of vegan related tweets using mutual information for feature selection
    Shamoi, Elvina
    Turdybay, Akniyet
    Shamoi, Pakizar
    Akhmetov, Iskander
    Jaxylykova, Assel
    Pak, Alexandr
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [29] Visual words assignment on a graph via minimal mutual information loss
    Deng, Yue
    Qian, Yanjun
    Li, Yipeng
    Dai, Qionghai
    Er, Guihua
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
  • [30] A Hybrid Method for Extracting Deep Web Information
    Zhang, Yuanpeng
    Wang, Li
    Jiang, Kui
    Qian, Danmin
    Dong, Jiancheng
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 777 - 782