Ensemble modelling or selecting the best model: Many could be better than one

被引:24
|
作者
Barai, SV
Reich, Y
机构
[1] Tel Aviv Univ, Fac Engn, Dept Solid Mech Mat & Struct, IL-69978 Tel Aviv, Israel
[2] Indian Inst Technol, Dept Civil Engn, Kharagpur 721302, W Bengal, India
关键词
ensemble; machine learning; neural networks; data modelling; stacked generalization; model selection;
D O I
10.1017/S0890060499135029
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the course of data modelling, many models could be created. Much work has been done on formulating guidelines for model selection. However, by and large, these guidelines are conservative or too specific. Instead of using general guidelines, models could be selected for a particular task based on statistical tests. When selecting one model, others are discarded. Instead of losing potential sources of information, models could be combined to yield better performance. We review the basics of model selection and combination and discuss their differences. Two examples of opportunistic and principled combinations are presented. The first demonstrates that mediocre quality models could be combined to yield significantly better performance. The latter is the main contribution of the paper; it describes and illustrates a novel heuristic approach called the SG(k-NN) ensemble for the generation of good-quality and diverse models that can even improve excellent quality models.
引用
收藏
页码:377 / 386
页数:10
相关论文
共 50 条
  • [21] Selecting the best normal population better than a standard under unequal variances
    Takada, Yoshikazu
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (09) : 2693 - 2705
  • [22] For many cleaning formulations, two "heads" are better than one
    Quencer, LB
    Kokke-Hall, S
    SOAP & COSMETICS, 1999, 75 (11): : 52 - 56
  • [23] COMMITTEES AND CONSENSUS - HOW MANY HEADS ARE BETTER THAN ONE
    CAWS, P
    JOURNAL OF MEDICINE AND PHILOSOPHY, 1991, 16 (04): : 375 - 391
  • [25] Energy - For nuclear fusion, could two lasers be better than one?
    Schirber, M
    SCIENCE, 2005, 310 (5754) : 1610 - 1611
  • [26] Multimodel ensembles of wheat growth: many models are better than one
    Martre, Pierre
    Wallach, Daniel
    Asseng, Senthold
    Ewert, Frank
    Jones, James W.
    Rotter, Reimund P.
    Boote, Kenneth J.
    Ruane, Alex C.
    Thorburn, Peter J.
    Cammarano, Davide
    Hatfield, Jerry L.
    Rosenzweig, Cynthia
    Aggarwal, Pramod K.
    Angulo, Carlos
    Basso, Bruno
    Bertuzzi, Patrick
    Biernath, Christian
    Brisson, Nadine
    Challinor, Andrew J.
    Doltra, Jordi
    Gayler, Sebastian
    Goldberg, Richie
    Grant, Robert F.
    Heng, Lee
    Hooker, Josh
    Hunt, Leslie A.
    Ingwersen, Joachim
    Izaurralde, Roberto C.
    Kersebaum, Kurt Christian
    Mueller, Christoph
    Kumar, Soora Naresh
    Nendel, Claas
    O'leary, Garry
    Olesen, Jorgen E.
    Osborne, Tom M.
    Palosuo, Taru
    Priesack, Eckart
    Ripoche, Dominique
    Semenov, Mikhail A.
    Shcherbak, Iurii
    Steduto, Pasquale
    Stoeckle, Claudio O.
    Stratonovitch, Pierre
    Streck, Thilo
    Supit, Iwan
    Tao, Fulu
    Travasso, Maria
    Waha, Katharina
    White, Jeffrey W.
    Wolf, Joost
    GLOBAL CHANGE BIOLOGY, 2015, 21 (02) : 911 - 925
  • [27] Group elicitation of probability distributions: Are many heads better than one?
    Phillips, LD
    DECISION SCIENCE AND TECHNOLOGY: REFLECTIONS ON THE CONTRIBUTIONS OF WARD EDWARDS, 1999, : 313 - 330
  • [28] MEASURING SUBJECTIVE WORKLOAD - WHEN IS ONE SCALE BETTER THAN MANY
    HENDY, KC
    HAMILTON, KM
    LANDRY, LN
    HUMAN FACTORS, 1993, 35 (04) : 579 - 601
  • [29] Many Can Work Better than the Best: Diagnosing with Medical Images via Crowdsourcing
    Xiang, Xian-Hong
    Huang, Xiao-Yu
    Zhang, Xiao-Ling
    Cai, Chun-Fang
    Yang, Jian-Yong
    Li, Lei
    ENTROPY, 2014, 16 (07) : 3866 - 3877
  • [30] Many heads are better than one: The spread of motivated opacity via contact
    Owens, Jonathan
    LINGUISTICS, 2014, 52 (01) : 125 - 165