Formation lithology classification using scalable gradient boosted decision trees

被引:146
|
作者
Dev, Vikrant A. [1 ]
Eden, Mario R. [1 ]
机构
[1] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
关键词
Lithology classification; Scalable gradient boosting; XGBoost; LightGBM; Cat Boost; MACHINE LEARNING-METHODS; PREDICTION;
D O I
10.1016/j.compchemeng.2019.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The classification of underground formation lithology is an important task in petroleum exploration and engineering since it forms the basis of geological research studies and reservoir parameter calculations. Hence, there have recently been increased efforts to automate lithology classification by incorporating various data science tools and principles. In this regard, efforts were made recently to evaluate machine learning methods to classify formation lithology by using data from the Daniudui gas field (DGF) and Hangjinqi gas field (HGF), both located in China. Although the boosted ensemble learners utilized in the studies performed well, there is still scope for improvement with respect to the prediction metrics. Additionally, the issue of scalability of some of these algorithms is also of concern. Hence, building upon the success of these algorithms in the previous studies, we tap into the state of the art of scalable ensemble decision tree algorithms, in our study. Specifically, we applied recently developed gradient boosted decision tree (GBDT) systems, namely, XGBoost, LightGBM and CatBoost, after combining well log data obtained from DGF and HGF. We compare their performance with random forests (RFs), AdaBoost and gradient boosting machines (GBMs) which serve as a baseline. We evaluated the algorithms using metrics such as the micro average, macro average and weighted average of precision (Pr), recall (Re) and F1-score (F1) on the test set after hyperparameter tuning. In our analysis, among the applied algorithms, we found that LightGBM possessed the highest metrics. Our work identifies LightGBM and CatBoost as good first-choice algorithms for the supervised classification of lithology when utilizing well log data. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:392 / 404
页数:13
相关论文
共 50 条
  • [41] Estimation of inorganic crystal densities using gradient boosted trees
    Zhao, Jesse
    FRONTIERS IN MATERIALS, 2022, 9
  • [42] Gradient Boosted Trees for Corrective Learning
    Oguz, Baris U.
    Shinohara, Russell T.
    Yushkevich, Paul A.
    Oguz, Ipek
    MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2017), 2017, 10541 : 203 - 211
  • [43] A Data-Driven Wall-Shear Stress Model for LES Using Gradient Boosted Decision Trees
    Radhakrishnan, Sarath
    Adu Gyamfi, Lawrence
    Miro, Arnau
    Font, Bernat
    Calafell, Joan
    Lehmkuhl, Oriol
    HIGH PERFORMANCE COMPUTING - ISC HIGH PERFORMANCE DIGITAL 2021 INTERNATIONAL WORKSHOPS, 2021, 12761 : 105 - 121
  • [44] Prediction of the Probability and Risk Factors of Early Abdominal Aortic Aneurysm Using the Gradient Boosted Decision Trees Model
    Chen, Song
    Liao, Chuan-Jun
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [45] Classifying stars, galaxies, and AGNs in CLAUDS plus HSC-SSP using gradient boosted decision trees
    Golob, Anneya
    Sawicki, Marcin
    Goulding, Andy D.
    Coupon, Jean
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2021, 503 (03) : 4136 - 4146
  • [46] Improving Supreme Court Forecasting Using Boosted Decision Trees
    Kaufman, Aaron Russell
    Kraft, Peter
    Sen, Maya
    POLITICAL ANALYSIS, 2019, 27 (03) : 381 - 387
  • [47] Predicting the outcome of construction litigation using boosted decision trees
    Arditi, D
    Pulket, T
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2005, 19 (04) : 387 - 393
  • [48] Supervised Hashing Using Graph Cuts and Boosted Decision Trees
    Lin, Guosheng
    Shen, Chunhua
    van den Hengel, Anton
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (11) : 2317 - 2331
  • [49] Classification of mammograms using decision trees
    Vibha, L.
    Harshavardhan, G. M.
    Pranaw, K.
    Deepa, Shenoy P.
    Venugopal, K. R.
    Patnaik, L. M.
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 263 - 266
  • [50] Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees
    Brophy, Jonathan
    Hammoudeh, Zayd
    Lowd, Daniel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24