A hybrid neural network for input that is both categorical and quantitative

被引:15
|
作者
Brouwer, RK [1 ]
机构
[1] Univ Coll Cariboo, Dept Comp Sci, Kamloops, BC V2C 5N3, Canada
关键词
D O I
10.1002/int.20032
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data on which a MLP (multilayer perceptron) is normally trained to approximate a continuous function may include inputs that are categorical in addition to the numeric or quantitative inputs. Examples of categorical variables are gender, race, and so on. An approach examined in this article is to train a hybrid network consisting of a MLP and an encoder with multiple output units; that is, a separate output unit for each of the various combinations of values of the categorical variables. Input to the feed forward subnetwork of the hybrid network is then restricted to truly numerical quantities. A MLP with connection matrices that multiply input values and sigmoid functions that further transform values represents a continuous mapping in all input variables. A MLP therefore requires that all inputs correspond to numeric, continuously valued variables and represents a continuous function in all input variables. A categorical variable, on the other hand, produces a discontinuous relationship between an input variable and the output. The way that this problem is often dealt with is to replace the categorical values by numeric ones and treat them as if they were continuously valued. However there is no meaningful correspondence between the continuous quantities generated this way and the original categorical values. The basic difficulty with using these variables is that they define a metric for the categories that may not be reasonable. This suggests that the categorical inputs should be segregated from the continuous inputs as explained above. Results show that the method utilizing a hybrid network and separating numerical from quantitative input, as discussed here, is quite effective. (C) 2004 Wiley Periodicals, Inc.
引用
收藏
页码:979 / 1001
页数:23
相关论文
共 50 条
  • [1] A feed-forward network for input that is both categorical and quantitative
    Brouwer, RK
    NEURAL NETWORKS, 2002, 15 (07) : 881 - 890
  • [2] A hybrid neural network with fuzzy rules for categorical and numeric input
    Brouwer, RK
    NAFIPS 2004: ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1AND 2: FUZZY SETS IN THE HEART OF THE CANADIAN ROCKIES, 2004, : 319 - 324
  • [3] Rule extraction from a neural network with segregated numeric and categorical input
    Roelof, RK
    PROCEEDINGS OF THE EIGHTH IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, 2004, : 386 - 391
  • [4] Neural network models of categorical perception
    R. I. Damper
    S. R. Harnad
    Perception & Psychophysics, 2000, 62 : 843 - 867
  • [5] Neural network models of categorical perception
    Damper, RI
    Harnad, SR
    PERCEPTION & PSYCHOPHYSICS, 2000, 62 (04): : 843 - 867
  • [6] From categorical semantics to neural network design
    Healy, MJ
    Caudell, TP
    Xiao, YH
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 1981 - 1986
  • [7] A hybrid input-type recurrent neural network for LVCSR language modeling
    Vataya Chunwijitra
    Ananlada Chotimongkol
    Chai Wutiwiwatchai
    EURASIP Journal on Audio, Speech, and Music Processing, 2016
  • [8] A Hybrid Genetic Algorithm for Climate Input Features and Neural Network Parameters Selection
    Haidar, Ali
    Verma, Brijesh
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCO'17 COMPANION), 2017, : 281 - 282
  • [9] A hybrid input-type recurrent neural network for LVCSR language modeling
    Chunwijitra, Vataya
    Chotimongkol, Ananlada
    Wutiwiwatchai, Chai
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2016,
  • [10] Recurrent neural network with both side input context dependence for text-to-phoneme mapping
    Bilcu, EB
    Astola, J
    Saarinen, J
    ISCCSP : 2004 FIRST INTERNATIONAL SYMPOSIUM ON CONTROL, COMMUNICATIONS AND SIGNAL PROCESSING, 2004, : 599 - 602