Evaluation of complex petroleum reservoirs based on data mining methods

被引:1
|
作者
Fengqi Tan
Gang Luo
Duojun Wang
Yangkang Chen
机构
[1] University of Chinese Academy of Sciences,College of Earth Science
[2] Chinese Academy of Sciences,Key Laboratory Computational Geodynamics
[3] University of Texas at Austin,Jackson School of Geosciences
来源
Computational Geosciences | 2017年 / 21卷
关键词
Data mining; Feature selection; Performance evaluation; Decision tree; Clustering analysis; Conglomerate reservoir;
D O I
暂无
中图分类号
学科分类号
摘要
In this study, we introduce the application of data mining to petroleum exploration and development to obtain high-performance predictive models and optimal classifications of geology, reservoirs, reservoir beds, and fluid properties. Data mining is a practical method for finding characteristics of, and inherent laws in massive multi-dimensional data. The data mining method is primarily composed of three loops, which are feature selection, model parameter optimization, and model performance evaluation. The method’s key techniques involve applying genetic algorithms to carry out feature selection and parameter optimization and using repeated cross-validation methods to obtain unbiased estimation of generalization accuracy. The optimal model is finally selected from the various algorithms tested. In this paper, the evaluation of water-flooded layers and the classification of conglomerate reservoirs in Karamay oil field are selected as case studies to analyze comprehensively two important functions in data mining, namely predictive modeling and cluster analysis. For the evaluation of water-flooded layers, six feature subset schemes and five distinct types of data mining methods (decision trees, artificial neural networks, support vector machines, Bayesian networks, and ensemble learning) are analyzed and compared. The results clearly demonstrate that decision trees are superior to the other methods in terms of predictive model accuracy and interpretability. Therefore, a decision tree-based model is selected as the final model for identifying water-flooded layers in the conglomerate reservoir. For the reservoir classification, the reservoir classification standards from four types of clustering algorithms, such as those based on division, level, model, and density, are comparatively analyzed. The results clearly indicate that the clustering derived from applying the standard K-means algorithm, which is based on division, provides the best fit to the geological characteristics of the actual reservoir and the greatest accuracy of reservoir classification. Moreover, the internal measurement parameters of this algorithm, such as compactness, efficiency, and resolution, are all better than those of the other three algorithms. Compared with traditional methods from exploration geophysics, the data mining method has obvious advantages in solving problems involving calculation of reservoir parameters and reservoir classification using different specialized field data. Hence, the effective application of data mining methods can provide better services for petroleum exploration and development.
引用
收藏
页码:151 / 165
页数:14
相关论文
共 50 条
  • [21] A Framework for Simulation Studies Based on Complex Healthcare Utilization Data for Methods Evaluation
    Myers, Jessica A.
    Schneeweiss, Sebastian
    Polinski, Jennifer
    Rassen, Jeremy A.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2012, 21 : 121 - 121
  • [22] Ensemble-based big data analytics of lithofacies for automatic development of petroleum reservoirs
    Tewari, Saurabh
    Dwivedi, U. D.
    COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 128 : 937 - 947
  • [23] SEQUENTIAL NUMERICAL-METHODS FOR THE SIMULATION OF FLOW IN PETROLEUM RESERVOIRS
    MARCHESIN, D
    PAESLEME, PJ
    PALMER, JC
    MATEMATICA APLICADA E COMPUTACIONAL, 1983, 2 (01): : 47 - 74
  • [24] Data Mining and Analysis of NLP Methods in Students Evaluation of Teaching
    Acosta-Ugalde, Diego
    Conant-Pablos, Santiago Enrique
    Camacho-Zuniga, Claudia
    Gutierrez-Rodriguez, Andres Eduardo
    ADVANCES IN SOFT COMPUTING, MICAI 2023, PT II, 2024, 14392 : 28 - 38
  • [25] Performance Evaluation of Methods for Mining Frequent Itemsets on Temporal Data
    Tripathi, Tripti
    Yadav, Divakar
    SECOND INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGIES, ICCNCT 2019, 2020, 44 : 910 - 917
  • [26] Effectiveness evaluation of data mining based IDS
    Orfila, Agustin
    Carbo, Javier
    Ribagorda, Arturo
    ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 377 - 388
  • [27] Methods for data mining
    不详
    DATA MINING ON MULTIMEDIA DATA, 2002, 2558 : 23 - 89
  • [28] Optimizing metric access methods for querying and mining complex data types
    de Souza, Jessica Andressa
    Razente, Humberto Luiz
    N Barioni, Maria Camila
    Journal of the Brazilian Computer Society, 2014, 20 (01) : 1 - 14
  • [29] Petroleum Instrument Fault Analysis based on the Data Mining Fault Dictionary Method
    Jia-huiqin
    MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION IV, PTS 1 AND 2, 2012, 128-129 : 942 - 945
  • [30] A Parallel Data Mining Method based on Complex Network
    He Yan-li
    OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS, PTS 1-2, 2011, 216 : 752 - 756