Information Fusion (Ensemble System) In Data Warehousing & Data Mining

被引：0

作者：

Ganatra, Amit P. ^{[1
,2
]}

机构：

[1] Charotar Univ Sci & Technol Changa, Chandubhai S Patel Inst Technol, FTE, Changa, Gujarat, India

[2] Charotar Univ Sci & Technol Changa, Chandubhai S Patel Inst Technol, CE, Changa, Gujarat, India

来源：

2015 5TH INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS) | 2015年

关键词：

Information Fusion; Ensemble System; AdaBoost; Multiple Classifiers; Data Mining;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Machine based systems can't keep up with the task of organizing the data in an up-to-date manner unless and until the data acquired is being planned or scheduled and managed in an appropriate manner. Today's datasets start as small chunk of information and grow exponentially over a period of time. Once the size is extremely large it becomes difficult to make decisions and to predict consistently and correctly from the datasets. Most of predictions do not hold true, if proper balancing and diversification in terms of certain conditions and parameters is not done. The present state has focused public attention in terms of making and combining the predictions from the available data i.e. analyzing the current (and past) data to make predictions with increasing predictive accuracy of the overall system. So, keeping these considerations in mind there is a need for the better concept (component) for Information Fusion to combine it with a solid theory in support and foundation. AdaBoost could be very useful with feature selection, especially when considering that it has solid theoretical foundation. Here, Genetic Algorithms are being used to select relevant features from large datasets along with Evaluation techniques. This can further be enhanced by using multiclassifier approach. The central objective is to develop the system that provides approximately 3-5% performance improvement at least over similar existing techniques.

引用

页数：6

共 50 条

[41] Introduction to the minitrack: Databases, data warehousing, and data mining in health care
Information Systems and Decision Sciences, College of Business Administration, University of South Florida, Tampa
FL, United States
不详
Proceedings of the Annual Hawaii International Conference on System Sciences, 2000, 2000-January
[42] List representation applied to sparse datacubes for data warehousing and data mining
Wang, F
Marir, F
Gordon, J
Helian, N
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 871 - 875
[43] Multiagent data warehousing and multiagent data mining for cerebrum/cerebellum modeling
Zhang, WR
DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 261 - 271
[44] I/O problems in preparing data for data warehousing and data mining, Part 1
Kim, W
JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1999, 11 (09): : 13 - +
[45] Encyclopedia of Data Warehousing and Mining: Volume I & II
不详
HEALTHCARE INFORMATICS RESEARCH, 2006, 12 (03) : 273 - 273
[46] A data warehousing approach to multimedia information retrieval
You, J
Dillon, T
Liu, J
CISST'2000: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, VOLS I AND II, 2000, : 679 - 686
[47] A methodology for information quality assessment in data warehousing
Su, Ying
Jin, Zhanming
2008 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, PROCEEDINGS, VOLS 1-13, 2008, : 5521 - +
[48] Warehousing and Analyzing Streaming Data Quality Information
Olbrich, Sebastian
Klein, Anja
AMCIS 2010 PROCEEDINGS, 2010,
[49] Data model for warehousing historical Web information
Cao, YY
Lim, EP
Ng, WK
INFORMATION AND SOFTWARE TECHNOLOGY, 2003, 45 (06) : 315 - 334
[50] Multidimensional SME performance evaluation:: Upgrading to data warehousing & data mining techniques
Delisle, S
Dugré, M
St-Pierre, J
IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 371 - 377

← 1 2 3 4 5 →