Recurrent-Neural-Network-Based Boolean Factor Analysis and Its Application to Word Clustering

被引:21
|
作者
Frolov, Alexander A. [1 ]
Husek, Dusan [2 ]
Polyakov, Pavel Yu. [3 ]
机构
[1] Russian Acad Sci, Inst Higher Nervous Act & Neurophysiol, Moscow 119991, Russia
[2] Acad Sci Czech Republ, Inst Comp Sci, Prague 18207 8, Czech Republic
[3] Russian Acad Sci, Sci Res Inst Syst Studies, Moscow 117218, Russia
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2009年 / 20卷 / 07期
基金
俄罗斯基础研究基金会;
关键词
Associative memory; Boolean factor analysis; concepts search; Hopfield-like neural network; information retrieval; neural network application; neural network architecture; recurrent neural network; statistics; unsupervised learning;
D O I
10.1109/TNN.2009.2016090
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The objective of this paper is to introduce a neural-network-based algorithm for word clustering as an extension of the neural-network-based Boolean factor analysis algorithm (Frolov et al, 2007). It is shown that this extended algorithm supports even the more complex model of signals that are supposed to be related to textual documents. It is hypothesized that every topic in textual data is characterized by a set of words which coherently appear in documents dedicated to a given topic. The appearance of each word in a document is coded by the activity of a particular neuron. In accordance with the Hebbian learning rule implemented in the network, sets of coherently appearing words (treated as factors) create tightly connected groups of neurons, hence, revealing them as attractors of the network dynamics. The found factors are eliminated from the network memory by the Hebbian unlearning rule facilitating the search of other factors. Topics related to the found sets of words can be identified based on the words' semantics. To make the method complete, a special technique based on a Bayesian procedure has been developed for the following purposes: first, to provide a complete description of factors in terms of component probability, and second, to enhance the accuracy of classification of signals to determine whether it contains the factor. Since it is assumed that every word may possibly contribute to several topics, the proposed method might be related to the method of fuzzy clustering. In this paper, we show that the results of Boolean factor analysis and fuzzy clustering are not contradictory, but complementary. To demonstrate the capabilities of this attempt, the method is applied to two types of textual data on neural networks in two different languages. The obtained topics and corresponding words are at a good level of agreement despite the fact that identical topics in Russian and English conferences contain different sets of keywords.
引用
收藏
页码:1073 / 1086
页数:14
相关论文
共 50 条
  • [41] Recurrent-neural-network-based identification of a cascade hydraulic actuator for closed-loop automotive power transmission control
    You, Seung-Han
    Hahn, Jin-Oh
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2012, 26 (05) : 1599 - 1606
  • [42] Recurrent-Neural-Network-Based Multivariable Adaptive Control for a Class of Nonlinear Dynamic Systems With Time-Varying Delay
    Hwang, Chih-Lyang
    Jan, Chau
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (02) : 388 - 401
  • [43] Lattice neural network model and its application to handwritten word recognition
    Yamanaka, K
    Kuroyanagi, S
    Sakai, H
    Iwata, A
    PROGRESS IN CONNECTIONIST-BASED INFORMATION SYSTEMS, VOLS 1 AND 2, 1998, : 1219 - 1222
  • [44] A generalized regression neural network based on fuzzy means clustering and its application in system identification
    Zhao, Shi-jun
    Zhang, Jin-lei
    Li, Xun
    Song, Wei
    2007 INTERNATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY CONVERGENCE, PROCEEDINGS, 2007, : 13 - +
  • [45] Neural Network-Based Analysis and Its Application to Spectroscopy for Mango
    Zhang, Zicheng
    Wang, Tianshuo
    Fan, Hanhan
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [46] Recurrent-neural-network-based identification of a cascade hydraulic actuator for closed-loop automotive power transmission control
    Seung-Han You
    Jin-Oh Hahn
    Journal of Mechanical Science and Technology, 2012, 26 : 1599 - 1606
  • [47] The structure optimized fuzzy clustering neural network model and its application
    Zou, Kaiqi
    Hu, Juan
    Kong, Xiaoyan
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2008, 4 (07): : 1627 - 1634
  • [48] Enhancing recurrent neural network-based language models by word tokenization
    Noaman, Hatem M.
    Sarhan, Shahenda S.
    Rashwan, Mohsen. A. A.
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [49] Recurrent Neural Network based Text Summarization Techniques by Word Sequence Generation
    Shini, R. Subha
    Kumar, V. D. Ambeth
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1224 - 1229
  • [50] DISCRIMINATIVE ACOUSTIC WORD EMBEDDINGS: RECURRENT NEURAL NETWORK-BASED APPROACHES
    Settle, Shane
    Livescu, Karen
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 503 - 510