Chemometric Classification of Crude Oils in Complex Petroleum Systems Using t-Distributed Stochastic Neighbor Embedding Machine Learning Algorithm

被引:9
|
作者
Tao, Keyu [1 ]
Cao, Jian [1 ]
Wang, Yuce [1 ]
Mi, Julei [2 ]
Ma, Wanyun [2 ]
Shi, Chunhua [1 ]
机构
[1] Nanjing Univ, Sch Earth Sci & Engn, State Key Lab Mineral Deposits Res, Nanjing 210023, Jiangsu, Peoples R China
[2] PetroChina Xinjiang Oilfield Co, Res Inst Petr Explorat & Dev, Karamay 843000, Xinjiang, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
PERMIAN FENGCHENG FORMATION; NORTHWESTERN JUNGGAR BASIN; NW CHINA; TRIASSIC RESERVOIRS; ORGANIC-MATTER; SOURCE ROCKS; ORIGIN; GEOCHEMISTRY; PYROLYSIS; MIGRATION;
D O I
10.1021/acs.energyfuels.0c01333
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The origin of crude oils is fundamental in the study of petroleum systems, but it encounters difficulties in complex systems because traditional geochemistry proxies are influenced by multiple factors (e.g., oil mixing, secondary alteration) and the interpretation of the data is challenging. To develop new potential approaches, a pilot study using the t-distributed stochastic neighbor embedding (t-SNE) machine learning algorithm was performed, based on a case study of the saline and alkaline lake petroleum systems in the lower Permian Mahu Sag, northwestern Junggar Basin, China. The algorithm revealed three main types of alkaline lacustrine related source rocks in the studied Fengcheng Formation: (i) argillaceous rocks deposited in brackish water and a weakly reducing environment; (ii) dolomitic mudstones deposited in saline water and a reducing environment; (iii) argillaceous dolomites deposited in hypersaline water and a strongly reducing environment. These organic facies are not time equivalent and vary temporally and spatially in the context of the alkaline lake evolution. Analysis of 43 crude oil samples showed that 5, 48, and 42% of the total number of samples were derived from argillaceous, dolomitic mudstone, and argillaceous dolomite source rocks, respectively, while the remaining 5% oil samples had a mixed origin from the former two end members. This suggests that hydrocarbon generation in the Fengcheng petroleum systems results mainly in large-scale oil generation from dolomitic source rocks. The biological precursors in the dolomitic rocks are dominated by haloduric algae, and the oil generation window is prolonged through organic-inorganic interactions during the hydrocarbon generation. This might be favorable for the preservation of an oil phase during deep burial and at high maturity. This represents a shale oil accumulation system in general as the source rocks and oils are the within the Fengcheng sequence. Our data suggest that the machine learning algorithm can find further application in this field with promising prospects.
引用
收藏
页码:5884 / 5899
页数:16
相关论文
共 50 条
  • [21] Monitoring of papermaking wastewater treatment processes using t-distributed stochastic neighbor embedding
    Ma, Xiaobo
    Zhang, Yuchen
    Zhang, Fengshan
    Liu, Hongbin
    JOURNAL OF ENVIRONMENTAL CHEMICAL ENGINEERING, 2021, 9 (06):
  • [22] On the solidification of the manifold of the t-distributed stochastic neighbour embedding for condition classification of machine tools
    Wang, Jing
    Cheng, Xiaobin
    Wang, Xun
    Gao, Yan
    Liu, Bin
    Han, Mingmei
    Yang, Jun
    ENGINEERING RESEARCH EXPRESS, 2021, 3 (04):
  • [23] Classification of Weld Seam Width Based on Detrended Fluctuation Analysis, t-Distributed Stochastic Neighbor Embedding, and Support Vector Machine
    Yong Huang
    Yang, Dongqing
    Lei Wang
    Gu Jieren
    Zhang Xiaoyong
    Wang, Kehong
    JOURNAL OF MATERIALS ENGINEERING AND PERFORMANCE, 2022, 31 (05) : 3975 - 3984
  • [24] Classification of Weld Seam Width Based on Detrended Fluctuation Analysis, t-Distributed Stochastic Neighbor Embedding, and Support Vector Machine
    Yong Huang
    Dongqing Yang
    Lei Wang
    Gu Jieren
    Zhang Xiaoyong
    Kehong Wang
    Journal of Materials Engineering and Performance, 2022, 31 : 3975 - 3984
  • [25] On the Use of t-Distributed Stochastic Neighbor Embedding for Data Visualization and Classification of Individuals with Parkinson's Disease
    Oliveira, Fabio Henrique M.
    Machado, Alessandro R. P.
    Andrade, Adriano O.
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2018, 2018
  • [26] T-distributed Stochastic Neighbor Network for unsupervised representation learning
    Wang, Zheng
    Xie, Jiaxi
    Nie, Feiping
    Wang, Rong
    Jia, Yanyan
    Liu, Shichang
    NEURAL NETWORKS, 2024, 179
  • [27] Dimensionality Reduction of Diabetes Mellitus Patient Data Using the T-Distributed Stochastic Neighbor Embedding
    Meniailov, Ievgen
    Krivtsov, Serhii
    Chumachenko, Tetyana
    SMART TECHNOLOGIES IN URBAN ENGINEERING, STUE-2022, 2023, 536 : 86 - 95
  • [28] Photovoltaic Array Fault Detection and Classification based on T-Distributed Stochastic Neighbor Embedding and Robust Soft Learning Vector Quantization
    Afrasiabi, Shahabodin
    Afrasiabi, Mousa
    Behdani, Behzad
    Mohammadi, Mohammad
    Javadi, Mohammad S.
    Osorio, Gerardo J.
    Catalao, Joao P. S.
    2021 21ST IEEE INTERNATIONAL CONFERENCE ON ENVIRONMENT AND ELECTRICAL ENGINEERING AND 2021 5TH IEEE INDUSTRIAL AND COMMERCIAL POWER SYSTEMS EUROPE (EEEIC/I&CPS EUROPE), 2021,
  • [29] Vibration-based detection and classification of structural changes using principal component analysis and t-distributed stochastic neighbor embedding
    Agis, David
    Tibaduiza, Diego A.
    Pozo, Francesc
    STRUCTURAL CONTROL & HEALTH MONITORING, 2020, 27 (06):
  • [30] Revealing Geochemical Patterns Associated with Mineralization Using t-Distributed Stochastic Neighbor Embedding and Random Forest
    Zixian Shi
    Renguang Zuo
    Yihui Xiong
    Siquan Sun
    Bao Zhou
    Mathematical Geosciences, 2023, 55 : 321 - 344