TOP-10 DATA MINING CASE STUDIES

被引:4
|
作者
Melli, Gabor [1 ]
Wu, Xindong [2 ]
Beinat, Paul [3 ]
Bonchi, Francesco [4 ]
Cao, Longbing [5 ]
Duan, Rong [6 ]
Faloutsos, Christos [7 ]
Ghani, Rayid [8 ]
Kitts, Brendan [9 ]
Goethals, Bart [10 ]
Mclachlan, Geoff [11 ]
Pei, Jian [12 ]
Srivastava, Ashok [13 ]
Zaiane, Osmar [14 ]
机构
[1] PredictionWorks Inc, Seattle, WA 98126 USA
[2] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[3] NeuronWorks Int, Hurstville, NSW 2220, Australia
[4] Yahoo Res, Barcelona, Spain
[5] Univ Technol Sydney, Sydney, NSW 2007, Australia
[6] AT&T Labs, Florham Pk, NJ USA
[7] Carnegie Mellon Univ, Dept Comp Sci, Pittsburgh, PA 15213 USA
[8] Accenture Technol Labs, Chicago, IL 60601 USA
[9] Lucid Commerce, Seattle, WA 98104 USA
[10] Univ Antwerp, Dept Math & Comp Sci, Antwerp, Belgium
[11] Univ Queensland, Dept Math, Brisbane, Qld 4072, Australia
[12] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[13] NASA, Washington, DC USA
[14] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
基金
美国国家科学基金会;
关键词
Data mining; cost-benefit analysis; case study;
D O I
10.1142/S021962201240007X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We report on the panel discussion held at the ICDM'10 conference on the top 10 data mining case studies in order to provide a snapshot of where and how data mining techniques have made significant real-world impact. The tasks covered by 10 case studies range from the detection of anomalies such as cancer, fraud, and system failures to the optimization of organizational operations, and include the automated extraction of information from unstructured sources. From the 10 cases we find that supervised methods prevail while unsupervised techniques play a supporting role. Further, significant domain knowledge is generally required to achieve a completed solution. Finally, we find that successful applications are more commonly associated with continual improvement rather than by single "aha moments" of knowledge ("nugget") discovery.
引用
收藏
页码:389 / 400
页数:12
相关论文
共 50 条
  • [21] Top 10 algorithms in data mining
    Wu, Xindong
    Kumar, Vipin
    Quinlan, J. Ross
    Ghosh, Joydeep
    Yang, Qiang
    Motoda, Hiroshi
    McLachlan, Geoffrey J.
    Ng, Angus
    Liu, Bing
    Yu, Philip S.
    Zhou, Zhi-Hua
    Steinbach, Michael
    Hand, David J.
    Steinberg, Dan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 14 (01) : 1 - 37
  • [22] Top 10 algorithms in data mining
    Xindong Wu
    Vipin Kumar
    J. Ross Quinlan
    Joydeep Ghosh
    Qiang Yang
    Hiroshi Motoda
    Geoffrey J. McLachlan
    Angus Ng
    Bing Liu
    Philip S. Yu
    Zhi-Hua Zhou
    Michael Steinbach
    David J. Hand
    Dan Steinberg
    Knowledge and Information Systems, 2008, 14 : 1 - 37
  • [23] Pacing analysis and comparison of TOP-10 and NOT TOP-10 Ultra Trail Cape Town 100-km finishers
    De Waal, Simon J.
    Jacobs, Shaundre D.
    Lamberts, Robert P.
    JOURNAL OF SPORTS MEDICINE AND PHYSICAL FITNESS, 2024,
  • [24] WOMENS SURGERIES STILL AMONG TOP-10
    RIFFER, J
    HOSPITALS, 1986, 60 (02): : 88 - 88
  • [25] Beware of These Top-10 Issues in Modal Testing
    Avitabile, Peter
    SOUND AND VIBRATION, 2017, 51 (01): : 48 - 52
  • [26] CPC aims to be a top-10 Petchems producer
    Alperowicz, N
    CHEMICAL WEEK, 2003, 165 (12) : 12 - 12
  • [27] Identification of the top-10 problematic pediatric medications
    Son, HyeJin
    Smith, Forrest L.
    Smith, Jeanie M.
    Earley, Ashley E.
    Manasco, Kalen B.
    Yates, Kenneth M.
    Kissack, Julie C.
    PHARMACOTHERAPY, 2012, 32 (05): : E132 - E132
  • [28] Companies - PCAS aims for a top-10 spot
    Schmitt, B
    CHEMICAL WEEK, 2000, 162 (14) : 45 - 45
  • [29] REAL-LIFE ROCK - THE TOP-10
    UDOVITCH, M
    ARTFORUM, 1995, 34 (01): : 24 - 24
  • [30] REAL-LIFE ROCK - TOP-10
    MARCUS, G
    ARTFORUM, 1995, 33 (07): : 34 - 34