Computational aspects of data mining

被引:0
|
作者
Marginean, FA [1 ]
机构
[1] Univ York, Dept Comp Sci, York YO10 5DD, N Yorkshire, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The last decade has witnessed an impressive growth of Data Mining through algorithms and applications. Despite the advances, a computational theory of Data, Mining is still largely outstanding. This paper discusses some aspects relevant to computation in Data Mining from the point of view of the Machine Learning theoretician. Computational techniques used in other fields that deal with learning from data, such as Statistics and Machine Learning, are potentially very relevant. However, the specifics of Data Mining are such that most often those techniques axe not directly applicable but require to be re-cast and re-analysed within Data Mining starting from first principles. We illustrate this with a PAC-learnability an alysis for a Data Mining-like task. We show that accounting for Data Mining specific requirements, such as inference of weak predictors and agnosticity assumptions, requires the generalisation. of the classical PAC framework in novel ways.
引用
收藏
页码:614 / 622
页数:9
相关论文
共 50 条
  • [21] Data mining and machine learning in computational creativity
    Toivonen, Hannu
    Gross, Oskar
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 5 (06) : 265 - 275
  • [22] EpiMINE, a computational program for mining epigenomic data
    SriGanesh Jammula
    Diego Pasini
    Epigenetics & Chromatin, 9
  • [23] Aspects for implementation of data mining in gerontology and geriatrics
    Michalski A.I.
    Advances in Gerontology, 2014, 4 (4) : 299 - 304
  • [24] Data mining aspects of a dam monitoring project
    Lehner, K
    Mittrup, I
    Hartmann, D
    Soft Computing as Transdisciplinary Science and Technology, 2005, : 745 - 754
  • [25] Ethical aspects of web log data mining
    Olson, David L.
    International Journal of Information Technology and Management, 2008, 7 (02) : 190 - 200
  • [26] Uncertainty modelling and computational aspects of data association
    Houssineau, Jeremie
    Zeng, Jiajie
    Jasra, Ajay
    STATISTICS AND COMPUTING, 2021, 31 (05)
  • [27] Computational aspects of data assimilation for aerosol dynamics
    Sandu, A
    Liao, W
    Carmichael, GR
    Henze, D
    Seinfeld, JH
    Chai, T
    Daescu, D
    COMPUTATIONAL SCIENCE - ICCS 2004, PT 3, PROCEEDINGS, 2004, 3038 : 709 - 716
  • [28] Uncertainty modelling and computational aspects of data association
    Jeremie Houssineau
    Jiajie Zeng
    Ajay Jasra
    Statistics and Computing, 2021, 31
  • [29] Data mining for materials: Computational experiments with AB compounds
    Saad, Yousef
    Gao, Da
    Thanh Ngo
    Bobbitt, Scotty
    Chelikowsky, James R.
    Andreoni, Wanda
    PHYSICAL REVIEW B, 2012, 85 (10)
  • [30] Machine learning, data mining, and computational statistics applications
    Wegman, Edward J.
    Said, Yasmin H.
    Scott, David W.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (03) : 187 - 187