Research on Effectiveness of the Pre-Training Model in Improving the Performance of Spectral Feature Extraction

被引:0
|
作者
Ren, Ju-xiang [1 ]
Liu, Zhong-bao [2 ]
机构
[1] Shanxi Vocat Univ Engn Sci & Technol, Coll Informat Engn, Jinzhong 030619, Peoples R China
[2] Beijing Language & Culture Univ, Sch Informat Sci, Beijing 100083, Peoples R China
关键词
Massive spectral data; Spectral feature extraction; Pre-training model; Validation of effectiveness; CLASSIFICATION;
D O I
10.3964/j.issn.1000-0593(2024)12-3480-05
中图分类号
O433 [光谱学];
学科分类号
0703 ; 070302 ;
摘要
The development of observation technology has led to massive spectral data, How to automatically classify these data has received attention from researchers, the most important of which is feature extraction, Given the limitations of manual processing, most of the research uses machine learning algorithms to extract feature-based spectral data, However, these machine learning algorithms cannot handle massive spectral data due to the high spatial and temporal complexities. The pretrained models emerging in recent years have excellent feature extraction capabilities, Still, there is little research on the effectiveness of such a model in the feature extraction of spectral data, Therefore, this paper takes the stellar spectral data as the research object separately introduces the pre-training models such as BERT, ALBERT, GTP, and Convolutional Neural Networks (CNN) for feature extraction and classification of the stellar spectral data, and tries to verify the effectiveness of these pre-training models for feature extraction of stellar spectral data by comparing the experimental results. Python programming language is used to write the spectral classification program, Based on the feature extraction of the pre-trained models, the CNN model in TensorFlow 1. 14 is utilized for spectral data classification, The dataset used for the experiment is the SDSS DR10 stellar spectral dataset, including K-type, F-type, and G-type. The grid search and 5-fold cross-validation are utilized to obtain the experimental optimal parameters. The BERT model has the highest classification accuracies compared to ALBERT and GPT with the same experimental conditions. In terms of the average classification accuracies, the average classification accuracies of the BERT model are 0. 025 1, 0. 021 5, and 0. 022 5 higher than that of ALBERT, and 0. 049 7, 0. 042 4. and 0. 043 2 higher than that of GPT, on the K-type, F-type, and G-type stellar datasets. It is casy to draw the following conclusions by analyzing the experimental results, Firstly, the classification accuracies improve with the scale increase of training data; Secondly, the same model has the highest classification accuracies on the same training dataset of K-type stellar, followed by the F-type and the G-type: Thirdly, the BERT model has the best ability of feature extraction compared with ALBERT and GPT.
引用
收藏
页码:3480 / 3484
页数:5
相关论文
共 11 条
  • [1] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [2] [何东远 He Dongyuan], 2020, [北京师范大学学报. 自然科学版, Journal of Beijing Normal University. Natural Science], V56, P37
  • [3] Decomposition and Classification of Stellar Spectra Based on t-SNE
    Jiang Bin
    Zhao Zi-liang
    Wang Shu-ting
    Wei Ji-yu
    Qu Mei-xia
    [J]. SPECTROSCOPY AND SPECTRAL ANALYSIS, 2020, 40 (09) : 2913 - 2917
  • [4] Automated Classification of Massive Spectra Based on Enhanced Multi-Scale Coded Convolutional Neural Network
    Jiang, Bin
    Wei, Donglai
    Liu, Jiazhen
    Wang, Shuting
    Cheng, Liyun
    Wang, Zihao
    Qu, Meixia
    [J]. UNIVERSE, 2020, 6 (04)
  • [5] Lan Z., 2019, ARXIV
  • [6] Classification of large-scale stellar spectra based on deep convolutional neural network
    Liu, W.
    Zhu, M.
    Dai, C.
    He, D. Y.
    Yao, Jiawen
    Tian, H. F.
    Wang, B. Y.
    Wu, K.
    Zhan, Y.
    Chen, B. -Q.
    Luo, A-Li
    Wang, R.
    Cao, Y.
    Yu, X. C.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2019, 483 (04) : 4774 - 4783
  • [7] Stellar Spectral Classification with Locality Preserving Projections and Support Vector Machine
    Liu Zhong-bao
    [J]. JOURNAL OF ASTROPHYSICS AND ASTRONOMY, 2016, 37 (02)
  • [8] Stellar Spectral Subclasses Classification Based on Fisher Criterion and Manifold Learning
    Liu Zhong-Bao
    Song Li-Peng
    [J]. PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2015, 127 (954) : 789 - 794
  • [9] Stellar classification with convolutional neural networks and photometric images: a new catalogue of 50 million SDSS stars without spectra
    Shi, Jing-Hang
    Qiu, Bo
    Luo, A-Li
    He, Zhen-Dong
    Kong, Xiao
    Jiang, Xia
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 520 (02) : 2269 - 2280
  • [10] Stellar spectral classification using principal component analysis and artificial neural networks
    Singh, HP
    Gulati, RK
    Gupta, R
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1998, 295 (02) : 312 - 318