Data-driven sensitivity analysis of complex machine learning models: A case study of directional drilling

被引:31
|
作者
Tunkiel, Andrzej T. [1 ]
Sui, Dan [1 ]
Wiktorski, Tomasz [2 ]
机构
[1] Univ Stavanger, Fac Sci & Technol, Dept Energy & Petr Engn, 8600 Forus, N-4036 Stavanger, Norway
[2] Univ Stavanger, Fac Sci & Technol, Dept Elect Engn & Comp Sci, 8600 Forus, N-4036 Stavanger, Norway
关键词
Sensitivity analysis; Partial derivative; Directional drilling; Volve dataset; NEURAL-NETWORKS; MATHEMATICAL-MODELS; TRANSPORT; INDEXES;
D O I
10.1016/j.petrol.2020.107630
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Classical sensitivity analysis of machine learning regression models is a topic sparse in literature. Most of data-driven models are complex black boxes with limited potential of extracting mathematical understanding of underlying model self-arranged through the training algorithm. Sensitivity analysis can uncover erratic behavior stemming from overfitting or insufficient size of the training dataset. It can also guide model evaluation and application. In this paper, our work on data-driven sensitivity analysis of complex machine learning models is presented. Rooted in one-at-a-time method it utilizes training, validation and testing datasets to cover the hyperspace of potential inputs. The method is highly scalable, it allows for sensitivity analysis of individual as well as groups of inputs. The method is not computationally expensive, scaling linearly both with the available data samples, and in relation to the quantity of inputs and outputs. Coupled with the fact that calculations are considered embarrassingly parallel, it makes the method attractive for big models. In the case study, a regression model to predict inclinations using recurrent neural network was employed to illustrate our proposed sensitivity analysis method and results.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Sensitivity Analysis of the Composite Data-Driven Pipelines in the Automated Machine Learning
    Barabanova, Irina, V
    Vychuzhanin, Pavel
    Nikitin, Nikolay O.
    10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 484 - 493
  • [2] Data-Driven Traffic Accident Analysis and Prediction Using Machine Learning Models: A Case Study of Philadelphia City
    Lyu, Chengxuan
    SEVENTH INTERNATIONAL CONFERENCE ON TRAFFIC ENGINEERING AND TRANSPORTATION SYSTEM, ICTETS 2023, 2024, 13064
  • [3] Data-driven models in machine learning for crime prediction
    Wawrzyniak, Zbigniew M.
    Jankowski, Stanislaw
    Szczechla, Eliza
    Szymanski, Zbigniew
    Pytlak, Radoslaw
    Michalak, Pawel
    Borowik, Grzegorz
    2018 26TH INTERNATIONAL CONFERENCE ON SYSTEMS ENGINEERING (ICSENG 2018), 2018,
  • [4] Machine learning in project analytics: a data-driven framework and case study
    Uddin, Shahadat
    Ong, Stephen
    Lu, Haohui
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Pure Data-Driven Machine Learning Challenges for pFMEA: A Case Study
    Mokhtarzadeh, Mahdi
    Rodriguez-Echeverria, Jorge
    Zeren, Zafer
    Van Noten, Johan
    Gautama, Sidharta
    IFAC PAPERSONLINE, 2024, 58 (19): : 658 - 663
  • [6] Machine learning in project analytics: a data-driven framework and case study
    Shahadat Uddin
    Stephen Ong
    Haohui Lu
    Scientific Reports, 12
  • [7] Trend and dynamic analysis on temporal drilling data and their data-driven models
    Sui, Dan
    Sahebi, Hamed
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 223
  • [8] Probability models for data-Driven global sensitivity analysis
    Hu, Zhen
    Mahadevan, Sankaran
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2019, 187 : 40 - 57
  • [9] Data-Driven Computational Neuroscience: Machine Learning and Statistical Models
    Kreinovich, Vladik
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 2513 - 2514
  • [10] A Novel Data-Driven Attack Method on Machine Learning Models
    Sadikoglu, Emre
    Kosesoy, Irfan
    Gok, Murat
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2024, 30 (03) : 402 - 417