A method of extracting related words using standardized mutual information

被引:0
|
作者
Sugimachi, T [1 ]
Ishino, A [1 ]
Takeda, M [1 ]
Matsuo, F [1 ]
机构
[1] Kyushu Univ, Dept Informat, Higashi Ku, Fukuoka 8128581, Japan
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Techniques of automatic extraction of related words are of great importance in many applications such as query expansion and automatic thesaurus construction. In this paper, a method of extracting related words is proposed basing on the statistical information about the co-occurrences of words from huge corpora. The mutual information is one of such statistical measures and has been used for application mainly in natural language processing. A drawback is, however, the mutual information depends mainly on frequencies of words. To overcome this difficulty, we propose as a new measure a normalize deviation of mutual information. We also reveal a correspondence between word ambiguity and related words using word relation graphs constructed using this measure.
引用
收藏
页码:478 / 485
页数:8
相关论文
共 50 条
  • [1] Extracting New Words with Mutual Information and Logistic Regression
    Chen X.
    Han C.
    An Y.
    Liu L.
    Li Z.
    Yang R.
    Data Analysis and Knowledge Discovery, 2019, 3 (08) : 105 - 113
  • [2] The Method for Extracting New Login Sentiment Words From Chinese Micro-Blog Based on Improved Mutual Information
    Zhu, Guangli
    Liu, Wenting
    Zhang, Shunxiang
    Chen, Xiang
    Yin, Chang
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2020, 35 (03): : 223 - 232
  • [3] Extracting Tweets related to Disaster Information by using Multiple Co-occurrence Relation of Words
    Yuzawa, Akio
    Ichikawa, Hiroyoshi
    Kobayashi, Aki
    2018 IEEE INTERNATIONAL CONFERENCE ON SMART COMPUTING (SMARTCOMP 2018), 2018, : 321 - 326
  • [4] Extracting Information from Negative Interactions in Multiplex Networks Using Mutual Information
    Hajibagheri, Alireza
    Sukthankar, Gita
    Lakkaraju, Kiran
    SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 322 - 328
  • [5] The recognition method of unknown chinese words in fragments based on mutual information
    Zhu Q.
    Cheng X.-Y.
    Gao Z.-J.
    Journal of Convergence Information Technology, 2010, 5 (03) : 68 - 72
  • [6] Extracting the mutual information for a triple of binary strings
    Romashchenko, A
    18TH IEEE ANNUAL CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2003, : 221 - 229
  • [7] A method based on the related information of neighborhood for extracting coastal line
    Zhang, YJ
    Yan, DM
    Chen, F
    IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 1640 - 1643
  • [8] An iterative method for extracting Chinese unknown words
    He, S
    Zhu, J
    CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (04): : 461 - 464
  • [9] An improved feature transformation method using mutual information
    Bassir, Seyed
    Akbari, Ahmad
    Nassersharif, Babak
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (02) : 107 - 115
  • [10] Collocation Extraction Method Using Mutual Information Contents
    Fukumura, Iori
    Shin, Sanggyu
    Proceedings - 2022 12th International Congress on Advanced Applied Informatics, IIAI-AAI 2022, 2022, : 663 - 664