Data-driven sensitivity analysis of complex machine learning models: A case study of directional drilling

被引:31
|
作者
Tunkiel, Andrzej T. [1 ]
Sui, Dan [1 ]
Wiktorski, Tomasz [2 ]
机构
[1] Univ Stavanger, Fac Sci & Technol, Dept Energy & Petr Engn, 8600 Forus, N-4036 Stavanger, Norway
[2] Univ Stavanger, Fac Sci & Technol, Dept Elect Engn & Comp Sci, 8600 Forus, N-4036 Stavanger, Norway
关键词
Sensitivity analysis; Partial derivative; Directional drilling; Volve dataset; NEURAL-NETWORKS; MATHEMATICAL-MODELS; TRANSPORT; INDEXES;
D O I
10.1016/j.petrol.2020.107630
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Classical sensitivity analysis of machine learning regression models is a topic sparse in literature. Most of data-driven models are complex black boxes with limited potential of extracting mathematical understanding of underlying model self-arranged through the training algorithm. Sensitivity analysis can uncover erratic behavior stemming from overfitting or insufficient size of the training dataset. It can also guide model evaluation and application. In this paper, our work on data-driven sensitivity analysis of complex machine learning models is presented. Rooted in one-at-a-time method it utilizes training, validation and testing datasets to cover the hyperspace of potential inputs. The method is highly scalable, it allows for sensitivity analysis of individual as well as groups of inputs. The method is not computationally expensive, scaling linearly both with the available data samples, and in relation to the quantity of inputs and outputs. Coupled with the fact that calculations are considered embarrassingly parallel, it makes the method attractive for big models. In the case study, a regression model to predict inclinations using recurrent neural network was employed to illustrate our proposed sensitivity analysis method and results.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Efficient Data-Driven Machine Learning Models for Cardiovascular Diseases Risk Prediction
    Dritsas, Elias
    Trigka, Maria
    SENSORS, 2023, 23 (03)
  • [32] Machine learning and system identification for the estimation of data-driven models: an experimental case study illustrated on a tire-suspension system
    Elkafafy, M.
    Csurcsia, P. Z.
    Cornelis, B.
    Risaliti, E.
    Janssens, K.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NOISE AND VIBRATION ENGINEERING (ISMA2020) / INTERNATIONAL CONFERENCE ON UNCERTAINTY IN STRUCTURAL DYNAMICS (USD2020), 2020, : 3287 - 3301
  • [33] Machine Learning and Data-Driven Approaches in Spatial Statistics: A Case Study of Housing Price Estimation
    Soleiman, Sarah
    Randon-Furling, Julien
    Cottrell, Marie
    ADVANCES IN SELF-ORGANIZING MAPS, LEARNING VECTOR QUANTIZATION, CLUSTERING AND DATA VISUALIZATION: DEDICATED TO THE MEMORY OF TEUVO KOHONEN, WSOM+ 2022, 2022, 533 : 31 - 40
  • [34] Data-driven decarbonization framework with machine learning
    Jain, Ayush
    Padmanaban, Manikandan
    Hazra, Jagabondhu
    Guruprasad, Ranjini
    Godbole, Shantanu
    Syam, Heriansyah
    ENVIRONMENTAL DATA SCIENCE, 2024, 3
  • [35] Data-driven analysis and machine learning for energy prediction in distributed photovoltaic generation plants: A case study in Queensland, Australia
    Ramos, Lucas
    Colnago, Marilaine
    Casaca, Wallace
    ENERGY REPORTS, 2022, 8 : 745 - 751
  • [36] Data-Driven Suitability Analysis to Enable Machine Learning Explainability and Security
    Wolf, Shaya
    Foster, Rita
    Haile, Jed
    Borowczak, Mike
    2021 RESILIENCE WEEK (RWS), 2021,
  • [37] Introduction to Focus Issue: Data-driven models and analysis of complex systems
    Martinez, Johann H.
    Lehnertz, Klaus
    Rubido, Nicolas
    CHAOS, 2025, 35 (03)
  • [38] Reliability analysis for data-driven noisy models using active learning
    Pires, Anderson, V
    Moustapha, Maliki
    Marelli, Stefano
    Sudret, Bruno
    STRUCTURAL SAFETY, 2025, 112
  • [39] Leakage localization in water distribution using data-driven models and sensitivity analysis
    Jensen, Tom Norgaard
    Puig, Vicenc
    Romera, Juli
    Kallesoe, Carsten Skovmose
    Wisniewski, Rafal
    Bendtsen, Jan Dimon
    IFAC PAPERSONLINE, 2018, 51 (24): : 736 - 741
  • [40] Sensitivity Analysis of Empirical and Data-Driven Models on Longitudinal Dispersion Coefficient in Streams
    Nezaratian H.
    Zahiri J.
    Kashefipour S.M.
    Environmental Processes, 2018, 5 (4) : 833 - 858