Real-time prediction of rock mass classification based on TBM operation big data and stacking technique of ensemble learning

被引:126
|
作者
Hou, Shaokang [1 ]
Liu, Yaoru [1 ]
Yang, Qiang [1 ]
机构
[1] Tsinghua Univ, State Key Lab Hydrosci & Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Tunnel boring machine (TBM) operation data; Rock mass classification; Stacking ensemble learning; Sample imbalance; Synthetic minority oversampling technique (SMOTE); PERFORMANCE PREDICTION; GEOLOGICAL CONDITIONS; TUNNEL; FRAMEWORK; SELECTION; AGREEMENT; MODEL;
D O I
10.1016/j.jrmge.2021.05.004
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
Real-time prediction of the rock mass class in front of the tunnel face is essential for the adaptive adjustment of tunnel boring machines (TBMs). During the TBM tunnelling process, a large number of operation data are generated, reflecting the interaction between the TBM system and surrounding rock, and these data can be used to evaluate the rock mass quality. This study proposed a stacking ensemble classifier for the real-time prediction of the rock mass classification using TBM operation data. Based on the Songhua River water conveyance project, a total of 7538 TB M tunnelling cycles and the corresponding rock mass classes are obtained after data preprocessing. Then, through the tree-based feature selection method, 10 key TBM operation parameters are selected, and the mean values of the 10 selected features in the stable phase after removing outliers are calculated as the inputs of classifiers. The preprocessed data are randomly divided into the training set (90%) and test set (10%) using simple random sampling. Besides stacking ensemble classifier, seven individual classifiers are established as the comparison. These classifiers include support vector machine (SVM), k-nearest neighbors (KNN), random forest (RF), gradient boosting decision tree (GBDT), decision tree (DT), logistic regression (LR) and multilayer perceptron (MLP), where the hyper-parameters of each classifier are optimised using the grid search method. The prediction results show that the stacking ensemble classifier has a better performance than individual classifiers, and it shows a more powerful learning and generalisation ability for small and imbalanced samples. Additionally, a relative balance training set is obtained by the synthetic minority oversampling technique (SMOTE), and the influence of sample imbalance on the prediction performance is discussed. (C) 2022 Institute of Rock and Soil Mechanics, Chinese Academy of Sciences. Production and hosting by Elsevier B.V.
引用
收藏
页码:123 / 143
页数:21
相关论文
共 50 条
  • [41] A Study of Deterioration in Classification Models in Real-Time Big Data Environment
    Uddin, Vali
    Rizvi, Syed Sajjad Hussain
    Hashmani, Manzoor Ahmed
    Jameel, Syed Muslim
    Ansari, Tayyab
    EMERGING TRENDS IN INTELLIGENT COMPUTING AND INFORMATICS: DATA SCIENCE, INTELLIGENT INFORMATION SYSTEMS AND SMART COMPUTING, 2020, 1073 : 79 - 87
  • [42] Real-time Classification of Fetal Status Based on Deep Learning and Cardiotocography Data
    Kwang-Sig Lee
    Eun Saem Choi
    Young Jin Nam
    Nae Won Liu
    Yong Seok Yang
    Ho Yeon Kim
    Ki Hoon Ahn
    Soon Cheol Hong
    Journal of Medical Systems, 47
  • [43] Real-time big data image classification under MapReduce framework
    Feng, Lin, 1600, Institute of Computing Technology (26):
  • [44] Prediction model for the compressive strength of rock based on stacking ensemble learning and shapley additive explanations
    Wu, Luyuan
    Li, Jianhui
    Zhang, Jianwei
    Wang, Zifa
    Tong, Jingbo
    Ding, Fei
    Li, Meng
    Feng, Yi
    Li, Hui
    BULLETIN OF ENGINEERING GEOLOGY AND THE ENVIRONMENT, 2024, 83 (11)
  • [45] Real-time hard-rock tunnel prediction model for rock mass classification using CatBoost integrated with Sequential Model-Based Optimization
    Bo, Yin
    Liu, Quansheng
    Huang, Xing
    Pan, Yucong
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2022, 124
  • [46] Fuzzy clustering theory based rock mass cuttability classification prediction model for TBM tunnelling
    Wang, Pan
    Guo, Wei
    Zhu, Dianhua
    Modern Tunnelling Technology, 2014, 51 (06) : 58 - 65
  • [47] A spark-based big data analysis framework for real-time sentiment prediction on streaming data
    Kilinc, Deniz
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (09): : 1352 - 1364
  • [48] VisMillion: A novel interactive visualization technique for real-time big data
    Pires, Goncalo
    Mendes, Daniel
    Goncalves, Daniel
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON GRAPHICS AND INTERACTION (ICGI 2019), 2019, : 86 - 93
  • [49] Productivity estimation of cutter suction dredger operation through data mining and learning from real-time big data
    Fu, Jiake
    Tian, Huijing
    Song, Lingguang
    Li, Mingchao
    Bai, Shuo
    Ren, Qiubing
    ENGINEERING CONSTRUCTION AND ARCHITECTURAL MANAGEMENT, 2021, 28 (07) : 2023 - 2041
  • [50] Congestion Prediction With Big Data for Real-Time way Highway Traffic
    Tseng, Fan-Hsun
    Hsueh, Jen-Hao
    Tseng, Chia-Wei
    Yang, Yao-Tsung
    Chao, Han-Chieh
    Chou, Li-Der
    IEEE ACCESS, 2018, 6 : 57311 - 57323