Evaluation of complex petroleum reservoirs based on data mining methods

被引：1

作者：

Fengqi Tan

Gang Luo

Duojun Wang

Yangkang Chen

机构：

[1] University of Chinese Academy of Sciences,College of Earth Science

[2] Chinese Academy of Sciences,Key Laboratory Computational Geodynamics

[3] University of Texas at Austin,Jackson School of Geosciences

来源：

Computational Geosciences | 2017年 / 21卷

关键词：

Data mining; Feature selection; Performance evaluation; Decision tree; Clustering analysis; Conglomerate reservoir;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In this study, we introduce the application of data mining to petroleum exploration and development to obtain high-performance predictive models and optimal classifications of geology, reservoirs, reservoir beds, and fluid properties. Data mining is a practical method for finding characteristics of, and inherent laws in massive multi-dimensional data. The data mining method is primarily composed of three loops, which are feature selection, model parameter optimization, and model performance evaluation. The method’s key techniques involve applying genetic algorithms to carry out feature selection and parameter optimization and using repeated cross-validation methods to obtain unbiased estimation of generalization accuracy. The optimal model is finally selected from the various algorithms tested. In this paper, the evaluation of water-flooded layers and the classification of conglomerate reservoirs in Karamay oil field are selected as case studies to analyze comprehensively two important functions in data mining, namely predictive modeling and cluster analysis. For the evaluation of water-flooded layers, six feature subset schemes and five distinct types of data mining methods (decision trees, artificial neural networks, support vector machines, Bayesian networks, and ensemble learning) are analyzed and compared. The results clearly demonstrate that decision trees are superior to the other methods in terms of predictive model accuracy and interpretability. Therefore, a decision tree-based model is selected as the final model for identifying water-flooded layers in the conglomerate reservoir. For the reservoir classification, the reservoir classification standards from four types of clustering algorithms, such as those based on division, level, model, and density, are comparatively analyzed. The results clearly indicate that the clustering derived from applying the standard K-means algorithm, which is based on division, provides the best fit to the geological characteristics of the actual reservoir and the greatest accuracy of reservoir classification. Moreover, the internal measurement parameters of this algorithm, such as compactness, efficiency, and resolution, are all better than those of the other three algorithms. Compared with traditional methods from exploration geophysics, the data mining method has obvious advantages in solving problems involving calculation of reservoir parameters and reservoir classification using different specialized field data. Hence, the effective application of data mining methods can provide better services for petroleum exploration and development.

引用

页码：151 / 165

页数：14

共 50 条

[21] A Framework for Simulation Studies Based on Complex Healthcare Utilization Data for Methods Evaluation
Myers, Jessica A.
Schneeweiss, Sebastian
Polinski, Jennifer
Rassen, Jeremy A.
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2012, 21 : 121 - 121
[22] Ensemble-based big data analytics of lithofacies for automatic development of petroleum reservoirs
Tewari, Saurabh
Dwivedi, U. D.
COMPUTERS & INDUSTRIAL ENGINEERING, 2019, 128 : 937 - 947
[23] SEQUENTIAL NUMERICAL-METHODS FOR THE SIMULATION OF FLOW IN PETROLEUM RESERVOIRS
MARCHESIN, D
PAESLEME, PJ
PALMER, JC
MATEMATICA APLICADA E COMPUTACIONAL, 1983, 2 (01): : 47 - 74
[24] Data Mining and Analysis of NLP Methods in Students Evaluation of Teaching
Acosta-Ugalde, Diego
Conant-Pablos, Santiago Enrique
Camacho-Zuniga, Claudia
Gutierrez-Rodriguez, Andres Eduardo
ADVANCES IN SOFT COMPUTING, MICAI 2023, PT II, 2024, 14392 : 28 - 38
[25] Performance Evaluation of Methods for Mining Frequent Itemsets on Temporal Data
Tripathi, Tripti
Yadav, Divakar
SECOND INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGIES, ICCNCT 2019, 2020, 44 : 910 - 917
[26] Effectiveness evaluation of data mining based IDS
Orfila, Agustin
Carbo, Javier
Ribagorda, Arturo
ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 377 - 388
[27] Methods for data mining
不详
DATA MINING ON MULTIMEDIA DATA, 2002, 2558 : 23 - 89
[28] Optimizing metric access methods for querying and mining complex data types
de Souza, Jessica Andressa
Razente, Humberto Luiz
N Barioni, Maria Camila
Journal of the Brazilian Computer Society, 2014, 20 (01) : 1 - 14
[29] Petroleum Instrument Fault Analysis based on the Data Mining Fault Dictionary Method
Jia-huiqin
MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION IV, PTS 1 AND 2, 2012, 128-129 : 942 - 945
[30] A Parallel Data Mining Method based on Complex Network
He Yan-li
OPTICAL, ELECTRONIC MATERIALS AND APPLICATIONS, PTS 1-2, 2011, 216 : 752 - 756

← 1 2 3 4 5 →