Information Fusion (Ensemble System) In Data Warehousing & Data Mining

被引:0
|
作者
Ganatra, Amit P. [1 ,2 ]
机构
[1] Charotar Univ Sci & Technol Changa, Chandubhai S Patel Inst Technol, FTE, Changa, Gujarat, India
[2] Charotar Univ Sci & Technol Changa, Chandubhai S Patel Inst Technol, CE, Changa, Gujarat, India
关键词
Information Fusion; Ensemble System; AdaBoost; Multiple Classifiers; Data Mining;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Machine based systems can't keep up with the task of organizing the data in an up-to-date manner unless and until the data acquired is being planned or scheduled and managed in an appropriate manner. Today's datasets start as small chunk of information and grow exponentially over a period of time. Once the size is extremely large it becomes difficult to make decisions and to predict consistently and correctly from the datasets. Most of predictions do not hold true, if proper balancing and diversification in terms of certain conditions and parameters is not done. The present state has focused public attention in terms of making and combining the predictions from the available data i.e. analyzing the current (and past) data to make predictions with increasing predictive accuracy of the overall system. So, keeping these considerations in mind there is a need for the better concept (component) for Information Fusion to combine it with a solid theory in support and foundation. AdaBoost could be very useful with feature selection, especially when considering that it has solid theoretical foundation. Here, Genetic Algorithms are being used to select relevant features from large datasets along with Evaluation techniques. This can further be enhanced by using multiclassifier approach. The central objective is to develop the system that provides approximately 3-5% performance improvement at least over similar existing techniques.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Introduction to the minitrack: Databases, data warehousing, and data mining in health care
    Information Systems and Decision Sciences, College of Business Administration, University of South Florida, Tampa
    FL, United States
    不详
    Proceedings of the Annual Hawaii International Conference on System Sciences, 2000, 2000-January
  • [42] List representation applied to sparse datacubes for data warehousing and data mining
    Wang, F
    Marir, F
    Gordon, J
    Helian, N
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 871 - 875
  • [43] Multiagent data warehousing and multiagent data mining for cerebrum/cerebellum modeling
    Zhang, WR
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 261 - 271
  • [44] I/O problems in preparing data for data warehousing and data mining, Part 1
    Kim, W
    JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1999, 11 (09): : 13 - +
  • [46] A data warehousing approach to multimedia information retrieval
    You, J
    Dillon, T
    Liu, J
    CISST'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, VOLS I AND II, 2000, : 679 - 686
  • [47] A methodology for information quality assessment in data warehousing
    Su, Ying
    Jin, Zhanming
    2008 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, PROCEEDINGS, VOLS 1-13, 2008, : 5521 - +
  • [48] Warehousing and Analyzing Streaming Data Quality Information
    Olbrich, Sebastian
    Klein, Anja
    AMCIS 2010 PROCEEDINGS, 2010,
  • [49] Data model for warehousing historical Web information
    Cao, YY
    Lim, EP
    Ng, WK
    INFORMATION AND SOFTWARE TECHNOLOGY, 2003, 45 (06) : 315 - 334
  • [50] Multidimensional SME performance evaluation:: Upgrading to data warehousing & data mining techniques
    Delisle, S
    Dugré, M
    St-Pierre, J
    IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 371 - 377