Modeling uncertain data using Monte Carlo integration method for clustering

被引:37
|
作者
Sharma, Krishna Kumar [1 ]
Seal, Ayan [1 ]
机构
[1] PDPM Indian Inst Informat Technol Design & Mfg, Jabalpur 482005, Madhya Pradesh, India
关键词
Uncertain data modeling; Monte Carlo integration; Kullback-Leibler divergence; Jeffreys divergence; Clustering analysis; DIVERGENCE ESTIMATION; SELECTION;
D O I
10.1016/j.eswa.2019.06.050
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, data clustering is an important task to the mining research community since the availability of uncertain data is increasing rapidly in many applications such as weather forecasting, business information management systems. In this work, proposed Monte Carlo integration based uncertain objects modeling technique is compared with three state-of-the-art methods namely, kernel density estimation, Dempster-Shafer, and Monte Carlo simulation. Then Kullback-Leibler and Jeffrey divergences are used to measure the similarity between uncertain objects and merge them with modified DBSCAN and k-medoids clustering algorithms. A heuristic algorithm is proposed to find the optimum radius, which is one of the inputs of DBSCAN. All the experiments are performed on one synthesized dataset and three real datasets namely, weather data, Japanese vowels and activity of daily living data. Five performance measures namely, accuracy, precision, recall, F-score, and Jaccard index are considered for comparing proposed method with state-of-the-art methods. Two non-parametric tests namely, Wilcoxon rank sum and sign test are also conducted. These results denote the effectiveness and efficiency of the proposed method over state-of-the-art methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:100 / 116
页数:17
相关论文
共 50 条
  • [31] The effectiveness of importance modeling in the Monte Carlo method
    Mikhailov, GA
    Medvedev, IN
    DOKLADY MATHEMATICS, 2005, 71 (02) : 172 - 176
  • [32] Modeling leaching of viruses by the Monte Carlo method
    Faulkner, BR
    Lyon, WG
    Khan, FA
    Chattopadhyay, S
    WATER RESEARCH, 2003, 37 (19) : 4719 - 4729
  • [33] A MONTE-CARLO METHOD FOR HIGH DIMENSIONAL INTEGRATION
    OGATA, Y
    NUMERISCHE MATHEMATIK, 1989, 55 (02) : 137 - 157
  • [34] A MONTE CARLO METHOD FOR INTEGRATION OF MULTIVARIATE SMOOTH FUNCTIONS
    Ullrich, Mario
    SIAM JOURNAL ON NUMERICAL ANALYSIS, 2017, 55 (03) : 1188 - 1200
  • [35] Pairwise clustering using a Monte Carlo Markov Chain
    Stosic, Borko D.
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2009, 388 (12) : 2373 - 2382
  • [36] Discrete range clustering using Monte Carlo methods
    Chatterji, GB
    Sridhar, B
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1996, 26 (06): : 832 - 837
  • [37] Data Modeling for Renewable Resources and Smart Home using Monte Carlo Simulations
    Naghibi, Bahman
    2019 IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2019 IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC / I&CPS EUROPE), 2019,
  • [38] Method of Monte Carlo grid for data analysis
    Filipowicz, M
    Bystritsky, VM
    Knowles, PE
    Mulhauser, F
    Wozniak, J
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2005, 547 (2-3): : 652 - 662
  • [39] A Monte Carlo Method to Data Stream Analysis
    Kerdprasop, Kittisak
    Kerdprasop, Nittaya
    Sattayatham, Pairote
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 14, 2006, 14 : 240 - +
  • [40] Modeling and sensitivity analysis of the building energy consumption using the Monte Carlo method
    Castillo, Jessica N.
    Resabala, Verny F.
    Freire, Luigi O.
    Corrales, Byron P.
    ENERGY REPORTS, 2022, 8 : 518 - 524