Sediment grain segmentation in thin-section images using dual-modal Vision Transformer

被引:3
|
作者
Zheng, Dongyu [1 ,2 ,3 ]
Hou, Li [4 ]
Hu, Xiumian [5 ]
Hou, Mingcai [1 ,2 ,3 ]
Dong, Kai [1 ]
Hu, Sihai [1 ]
Teng, Runlin [1 ]
Ma, Chao [1 ,2 ,3 ]
机构
[1] Chengdu Univ Technol, State Key Lab Oil & Gas Reservoir Geol & Exploitat, Chengdu 610059, Peoples R China
[2] Chengdu Univ Technol, MNR, Key Lab Deep time Geog & Environm Reconstruct & Ap, Chengdu, Peoples R China
[3] Chengdu Univ Technol, Inst Sedimentary Geol, Chengdu, Peoples R China
[4] Chengdu Univ Technol, Coll Comp Sci & Cyber Secur, Chengdu 610059, Peoples R China
[5] Nanjing Univ, Sch Earth Sci & Engn, Nanjing 210023, Peoples R China
关键词
Thin-section images; Deep learning; Vision Transformer; Dual; -modal; Semantic segmentation; Petrography; RECOGNITION; ROCKS;
D O I
10.1016/j.cageo.2024.105664
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Accurately identifying grain types in thin sections of sandy sediments or sandstones is crucial for understanding their provenance, depositional environments, and potential as natural resources. Although traditional computer vision methods and machine learning algorithms have been used for automatic grain identification, recent advancements in deep learning techniques have opened up new possibilities for achieving more reliable results with less manual labor. In this study, we present Trans-SedNet, a state-of-the-art dual-modal Vision-Transformer (ViT) model that uses both cross- (XPL) and plane-polarized light (PPL) images to achieve semantic segmentation of thin-section images. Our model classifies a total of ten grain types, including subtypes of quartz, feldspar, and lithic fragments, to emulate the manual identification process in sedimentary petrology. To optimize performance, we use SegFormer as the model backbone and add window- and mix-attention to the encoder to identify local information in the images and to best use XPL and PPL images. We also use a combination of focal and dice loss and a smoothing procedure to address imbalances and reduce over-segmentation. Our comparative analysis of several deep convolution neural networks and ViT models, including FCN, U-Net, DeepLabV3Plus, SegNeXT, and CMX, shows that Trans-SedNet outperforms the other models with a significant increase in evaluation metrics of mIoU and mPA. We also conduct an experiment to test the models' ability to handle dual-modal information, which reveals that the dual-modal models, including Trans-SedNet, achieve better results than single-modal models with the extra input of PPL images. Our study demonstrates the potential of ViT models in semantic segmentation of thin-section images and highlights the importance of dual-modal models for handling complex input in various geoscience disciplines. By improving data quality and quantity, our model has the potential to enhance the efficiency and reliability of grain identification in sedimentary petrology and relevant subjects.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Computer-assisted diagnosis for axillary lymph node metastasis of early breast cancer based on transformer with dual-modal adaptive mid-term fusion using ultrasound elastography
    Gong, Chihao
    Wu, Yinglan
    Zhang, Guangyuan
    Liu, Xuan
    Zhu, Xiaoyao
    Cai, Nian
    Li, Jian
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2025, 119
  • [42] One-dimensional quantitative evaluation of peripheral lung adenocarcinoma with or without ground-glass opacity on thin-section CT images using profile curves
    Yanagawa, M.
    Kuriyama, K.
    Kunitomi, Y.
    Tomiyama, N.
    Honda, O.
    Sumikawa, H.
    Inone, A.
    Mihara, N.
    Yoshida, S.
    Johkoh, T.
    Nakamura, H.
    BRITISH JOURNAL OF RADIOLOGY, 2009, 82 (979): : 532 - 540
  • [43] The correlation of thin-section CT findings with clinicopathological features and survival in small-sized lung adenocarcinoma: novel two methods using lung window level setting images
    Ishikawa, Yoshihiro
    Kondo, Tetsuro
    Saito, Haruhiro
    Oshita, Fumihiro
    Ito, Hiroyuki
    Tsuboi, Masahiro
    Yokose, Tomoyuki
    Kameda, Youichi
    Noda, Masakazu
    Yamada, Kouzo
    Nakayama, Haruhiko
    JOURNAL OF THORACIC ONCOLOGY, 2009, 4 (09) : S736 - S736
  • [44] Commercially Available Computer-Aided Detection System for Pulmonary Nodules on Thin-Section Images Using 64 Detectors-Row CT: Preliminary Study of 48 Cases
    Yanagawa, Masahiro
    Honda, Osamu
    Yoshida, Shigeyuki
    Ono, Yusuke
    Inoue, Atsuo
    Daimon, Tadahisa
    Sumikawa, Hiromitsu
    Mihara, Naoki
    Johkoh, Takeshi
    Tomiyama, Noriyuki
    Nakamura, Hironobu
    ACADEMIC RADIOLOGY, 2009, 16 (08) : 924 - 933
  • [45] Segmentation of sandstone thin section images with separation of touching grains using optimum path forest operators (vol 57, pg 146, 2013)
    Mingireanov Filho, Ivan
    Spina, Thiago Vallin
    Falcao, Alexandre Xavier
    Vidal, Alexandre Campane
    COMPUTERS & GEOSCIENCES, 2014, 62 : 241 - 242
  • [46] AN INTERPRETABLE GLAUCOMA DETECTION USING DUAL SCALE CROSS-ATTENTION VISION TRANSFORMER-BASED LONG SHORT TERM MEMORY WITH OPTICAL CUP AND DISK SEGMENTATION
    Krishnamoorthy, V.
    Logeswari, S.
    JOURNAL OF MECHANICS IN MEDICINE AND BIOLOGY, 2025, 25 (01)
  • [47] Semi-automatic segmentation of petrographic thin section images using a "seeded-region growing algorithm" with an application to characterize wheathered subarkose sandstone
    Asmussen, Pascal
    Conrad, Olaf
    Guenther, Andreas
    Kirsch, Moritz
    Riller, Ulrich
    COMPUTERS & GEOSCIENCES, 2015, 83 : 89 - 99
  • [48] Semi-automatic segmentation of petrographic thin section images using a seeded-region growing algorithm with an application to characterize wheathered subarkose sandstone
    Institut für Geologie, Universität Hamburg, Bundesstrasse 55, Hamburg
    20146, Germany
    不详
    20146, Germany
    不详
    Comput. Geosci., (89-99):
  • [49] Pulmonary MRI with ultra-short TE using single- and dual-echo methods: comparison of capability for quantitative differentiation of non- or minimally invasive adenocarcinomas from other lung cancers with that of standard-dose thin-section CT
    Ohno, Yoshiharu
    Yui, Masao
    Yamamoto, Kaori
    Ikedo, Masato
    Oshima, Yuka
    Hamabuchi, Nayu
    Hanamatsu, Satomu
    Nagata, Hiroyuki
    Ueda, Takahiro
    Ikeda, Hirotaka
    Takenaka, Daisuke
    Yoshikawa, Takeshi
    Ozawa, Yoshiyuki
    Toyama, Hiroshi
    EUROPEAN RADIOLOGY, 2024, 34 (02) : 1065 - 1076
  • [50] Pulmonary MRI with ultra-short TE using single- and dual-echo methods: comparison of capability for quantitative differentiation of non- or minimally invasive adenocarcinomas from other lung cancers with that of standard-dose thin-section CT
    Yoshiharu Ohno
    Masao Yui
    Kaori Yamamoto
    Masato Ikedo
    Yuka Oshima
    Nayu Hamabuchi
    Satomu Hanamatsu
    Hiroyuki Nagata
    Takahiro Ueda
    Hirotaka Ikeda
    Daisuke Takenaka
    Takeshi Yoshikawa
    Yoshiyuki Ozawa
    Hiroshi Toyama
    European Radiology, 2024, 34 : 1065 - 1076