Chemometric Classification of Crude Oils in Complex Petroleum Systems Using t-Distributed Stochastic Neighbor Embedding Machine Learning Algorithm

被引:9
|
作者
Tao, Keyu [1 ]
Cao, Jian [1 ]
Wang, Yuce [1 ]
Mi, Julei [2 ]
Ma, Wanyun [2 ]
Shi, Chunhua [1 ]
机构
[1] Nanjing Univ, Sch Earth Sci & Engn, State Key Lab Mineral Deposits Res, Nanjing 210023, Jiangsu, Peoples R China
[2] PetroChina Xinjiang Oilfield Co, Res Inst Petr Explorat & Dev, Karamay 843000, Xinjiang, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
PERMIAN FENGCHENG FORMATION; NORTHWESTERN JUNGGAR BASIN; NW CHINA; TRIASSIC RESERVOIRS; ORGANIC-MATTER; SOURCE ROCKS; ORIGIN; GEOCHEMISTRY; PYROLYSIS; MIGRATION;
D O I
10.1021/acs.energyfuels.0c01333
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The origin of crude oils is fundamental in the study of petroleum systems, but it encounters difficulties in complex systems because traditional geochemistry proxies are influenced by multiple factors (e.g., oil mixing, secondary alteration) and the interpretation of the data is challenging. To develop new potential approaches, a pilot study using the t-distributed stochastic neighbor embedding (t-SNE) machine learning algorithm was performed, based on a case study of the saline and alkaline lake petroleum systems in the lower Permian Mahu Sag, northwestern Junggar Basin, China. The algorithm revealed three main types of alkaline lacustrine related source rocks in the studied Fengcheng Formation: (i) argillaceous rocks deposited in brackish water and a weakly reducing environment; (ii) dolomitic mudstones deposited in saline water and a reducing environment; (iii) argillaceous dolomites deposited in hypersaline water and a strongly reducing environment. These organic facies are not time equivalent and vary temporally and spatially in the context of the alkaline lake evolution. Analysis of 43 crude oil samples showed that 5, 48, and 42% of the total number of samples were derived from argillaceous, dolomitic mudstone, and argillaceous dolomite source rocks, respectively, while the remaining 5% oil samples had a mixed origin from the former two end members. This suggests that hydrocarbon generation in the Fengcheng petroleum systems results mainly in large-scale oil generation from dolomitic source rocks. The biological precursors in the dolomitic rocks are dominated by haloduric algae, and the oil generation window is prolonged through organic-inorganic interactions during the hydrocarbon generation. This might be favorable for the preservation of an oil phase during deep burial and at high maturity. This represents a shale oil accumulation system in general as the source rocks and oils are the within the Fengcheng sequence. Our data suggest that the machine learning algorithm can find further application in this field with promising prospects.
引用
收藏
页码:5884 / 5899
页数:16
相关论文
共 50 条
  • [41] Applying t-Distributed Stochastic Neighbor Embedding for Improving Fingerprinting-Based Localization System
    Tarekegn, Getaneh Berie
    Tai, Li-Chia
    Lin, Hsin-Piao
    Tesfaw, Belayneh Abebe
    Juang, Rong-Terng
    Hsu, Huan-Chia
    Huang, Kai-Lun
    Singh, Kanishk
    IEEE SENSORS LETTERS, 2023, 7 (09)
  • [42] T-Distributed Stochastic Neighbor Embedding Based on Cockroach Swarm Optimization with Student Distribution Parameters
    Qiu, Mengdie
    Yang, Zan
    Nai, Wei
    Li, Dan
    Xing, Yidan
    Li, Kai
    PROCEEDINGS OF 2021 IEEE 12TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2021, : 291 - 294
  • [43] Fault diagnosis of industrial process based on the optimal parametric t-distributed stochastic neighbor embedding
    Ruixue Jia
    Jing Wang
    Jinglin Zhou
    Science China Information Sciences, 2021, 64
  • [44] Fault diagnosis of industrial process based on the optimal parametric t-distributed stochastic neighbor embedding
    Jia, Ruixue
    Wang, Jing
    Zhou, Jinglin
    SCIENCE CHINA-INFORMATION SCIENCES, 2021, 64 (05)
  • [45] Clustering Heterogeneous Conformational Ensembles of Intrinsically Disordered Proteins with t-Distributed Stochastic Neighbor Embedding
    Appadurai, Rajeswari
    Koneru, Jaya Krishna
    Bonomi, Massimiliano
    Robustelli, Paul
    Srivastava, Anand
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2023, 19 (14) : 4711 - 4727
  • [46] Fault diagnosis of industrial process based on the optimal parametric t-distributed stochastic neighbor embedding
    Ruixue JIA
    Jing WANG
    Jinglin ZHOU
    ScienceChina(InformationSciences), 2021, 64 (05) : 233 - 235
  • [47] Time-Lagged t-Distributed Stochastic Neighbor Embedding (t-SNE) of Molecular Simulation Trajectories
    Spiwok, Vojtech
    Kriz, Pavel
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2020, 7
  • [48] Top corner gas concentration prediction using t-distributed Stochastic Neighbor Embedding and Support Vector Regression algorithms
    Wu, Haibo
    Shi, Shiliang
    Lu, Yi
    Liu, Yong
    Huang, Weihong
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (14):
  • [49] t-Distributed Stochastic Neighbor Embedding (t-SNE): A tool for eco-physiological transcriptomic analysis
    Cieslak, Matthew C.
    Castelfranco, Ann M.
    Roncalli, Vittoria
    Lenz, Petra H.
    Hartline, Daniel K.
    MARINE GENOMICS, 2020, 51
  • [50] Chatter Detection Approach Based on Wavelet Synchrosqueezing and t-Distributed Stochastic Neighbor Embedding for a Turning Process
    Kuo, Ping-Huan
    Lin, Po-Lun
    Yau, Her-Terng
    IEEE SENSORS JOURNAL, 2024, 24 (07) : 9660 - 9670