Chemometric Classification of Crude Oils in Complex Petroleum Systems Using t-Distributed Stochastic Neighbor Embedding Machine Learning Algorithm

被引:9
|
作者
Tao, Keyu [1 ]
Cao, Jian [1 ]
Wang, Yuce [1 ]
Mi, Julei [2 ]
Ma, Wanyun [2 ]
Shi, Chunhua [1 ]
机构
[1] Nanjing Univ, Sch Earth Sci & Engn, State Key Lab Mineral Deposits Res, Nanjing 210023, Jiangsu, Peoples R China
[2] PetroChina Xinjiang Oilfield Co, Res Inst Petr Explorat & Dev, Karamay 843000, Xinjiang, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
PERMIAN FENGCHENG FORMATION; NORTHWESTERN JUNGGAR BASIN; NW CHINA; TRIASSIC RESERVOIRS; ORGANIC-MATTER; SOURCE ROCKS; ORIGIN; GEOCHEMISTRY; PYROLYSIS; MIGRATION;
D O I
10.1021/acs.energyfuels.0c01333
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The origin of crude oils is fundamental in the study of petroleum systems, but it encounters difficulties in complex systems because traditional geochemistry proxies are influenced by multiple factors (e.g., oil mixing, secondary alteration) and the interpretation of the data is challenging. To develop new potential approaches, a pilot study using the t-distributed stochastic neighbor embedding (t-SNE) machine learning algorithm was performed, based on a case study of the saline and alkaline lake petroleum systems in the lower Permian Mahu Sag, northwestern Junggar Basin, China. The algorithm revealed three main types of alkaline lacustrine related source rocks in the studied Fengcheng Formation: (i) argillaceous rocks deposited in brackish water and a weakly reducing environment; (ii) dolomitic mudstones deposited in saline water and a reducing environment; (iii) argillaceous dolomites deposited in hypersaline water and a strongly reducing environment. These organic facies are not time equivalent and vary temporally and spatially in the context of the alkaline lake evolution. Analysis of 43 crude oil samples showed that 5, 48, and 42% of the total number of samples were derived from argillaceous, dolomitic mudstone, and argillaceous dolomite source rocks, respectively, while the remaining 5% oil samples had a mixed origin from the former two end members. This suggests that hydrocarbon generation in the Fengcheng petroleum systems results mainly in large-scale oil generation from dolomitic source rocks. The biological precursors in the dolomitic rocks are dominated by haloduric algae, and the oil generation window is prolonged through organic-inorganic interactions during the hydrocarbon generation. This might be favorable for the preservation of an oil phase during deep burial and at high maturity. This represents a shale oil accumulation system in general as the source rocks and oils are the within the Fengcheng sequence. Our data suggest that the machine learning algorithm can find further application in this field with promising prospects.
引用
收藏
页码:5884 / 5899
页数:16
相关论文
共 50 条
  • [31] Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding
    Xian Fang
    Zhixin Tie
    Yinan Guan
    Shanshan Rao
    Soft Computing, 2019, 23 : 5645 - 5657
  • [32] Visualization of vibrational spectroscopy for agro-food samples using t-Distributed Stochastic Neighbor Embedding
    Luo, Na
    Yang, Xinting
    Sun, Chuanheng
    Xing, Bin
    Han, Jiawei
    Zhao, Chunjiang
    FOOD CONTROL, 2021, 126
  • [33] Using Visualization of t-Distributed Stochastic Neighbor Embedding To Identify Immune Cell Subsets in Mouse Tumors
    Acuff, Nicole V.
    Linden, Joel
    JOURNAL OF IMMUNOLOGY, 2017, 198 (11): : 4539 - 4546
  • [34] Revealing Geochemical Patterns Associated with Mineralization Using t-Distributed Stochastic Neighbor Embedding and Random Forest
    Shi, Zixian
    Zuo, Renguang
    Xiong, Yihui
    Sun, Siquan
    Zhou, Bao
    MATHEMATICAL GEOSCIENCES, 2023, 55 (03) : 321 - 344
  • [35] Visualizing temporal brain-state changes for fMRI using t-distributed stochastic neighbor embedding
    Parmar, Harshit
    Nutter, Brian
    Long, Rodney
    Antani, Sameer
    Mitra, Sunanda
    JOURNAL OF MEDICAL IMAGING, 2021, 8 (04)
  • [36] t-Distributed Stochastic Neighbor Embedding Method with the Least Information Loss for Macromolecular Simulations
    Zhou, Hongyu
    Wang, Feng
    Tao, Peng
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2018, 14 (11) : 5499 - 5510
  • [37] Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding
    Fang, Xian
    Tie, Zhixin
    Guan, Yinan
    Rao, Shanshan
    SOFT COMPUTING, 2019, 23 (14) : 5645 - 5657
  • [38] Persistent-Homology-Based Microstructural Optimization of Materials Using t-Distributed Stochastic Neighbor Embedding
    Wang, Zhi-Lei
    Ogawa, Toshio
    Adachi, Yoshitaka
    ADVANCED THEORY AND SIMULATIONS, 2020, 3 (07)
  • [39] Using t-distributed stochastic neighbor embedding for visualization and segmentation of 3D point clouds of plants
    Dutagaci, Helin
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2023, 31 (05) : 792 - 813
  • [40] Using parametric t-distributed stochastic neighbor embedding combined with hierarchical neural network for network intrusion detectione
    Yao, Huijun
    Li, Chaopeng
    Sun, Peng
    International Journal of Network Security, 2020, 22 (02) : 265 - 274