Development of a Large-scale Korean Language Model in the Field of Geosciences

被引:0
|
作者
Lee, Sang-ho [1 ]
机构
[1] Korea Inst Geosci & Mineral Resources, Mineral Resources Div, Daejeon 34132, South Korea
来源
ECONOMIC AND ENVIRONMENTAL GEOLOGY | 2024年 / 57卷 / 05期
关键词
large language model; generative model; natural language processing; artificial intelligence; geoscience;
D O I
10.9719/EEG.2024.57.5.539
中图分类号
P5 [地质学];
学科分类号
0709 ; 081803 ;
摘要
With the rapid development and commercialization of large-scale generative language models, concerns regarding the appropriateness of model outputs, expertise, and data security have been emerged. In particular, Korean generative language models specialized in the field of geoscience have not yet been studied due to difficulties in data processing, preprocessing and a lack of development cases. This study conducted the entire process for developing a Korean language model specialized in the field of geoscience and evaluated its applicability in related fields. To achieve this, academic data related to geoscience were collected and preprocessed to create a dataset suitable for the training of the language model. The dataset was applied to the Llama2 model for the training. The trained model was quantitatively evaluated using 19 different evaluation datasets from various fields. The results demonstrated improved functionalities related to scientific question-answering and Korean text interpretation compared to the original model. The language model developed through this study can potentially enhance research productivity in the field of geoscience, offering benefits such as idea generation. The outcomes of this study are expected to stimulate further research and the utilization of generative language models in geoscience in the future.
引用
收藏
页码:539 / 550
页数:12
相关论文
共 50 条
  • [1] TC-BERT: large-scale language model for Korean technology commercialization documents
    Kim, Taero
    Oh, Changdae
    Hwang, Hyeji
    Lee, Eunkyeong
    Kim, Yewon
    Choi, Yunjeong
    Kim, Sungjin
    Choi, Hosik
    Song, Kyungwoo
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [2] The Waterfall Model in Large-Scale Development
    Petersen, Kai
    Wohlin, Claes
    Baca, Dejan
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT, PROCEEDINGS, 2009, 32 : 386 - 400
  • [3] A MODEL OF THE LARGE-SCALE GRAVITATIONAL-FIELD OF GALAXIES
    KUTUZOV, SA
    OSIPKOV, LP
    VESTNIK LENINGRADSKOGO UNIVERSITETA SERIYA MATEMATIKA MEKHANIKA ASTRONOMIYA, 1981, (01): : 99 - 105
  • [4] Implementation of a large-scale language model adaptation in a cloud environment
    Kwang-Ho Kim
    Dae-Young Jung
    Donghyun Lee
    Hyuk-Jun Lee
    Sung-Yong Park
    Myoung-Wan Koo
    Ji-Hwan Kim
    Jeong-sik Park
    Hyung-Bae Jeon
    Yun-Keun Lee
    Multimedia Tools and Applications, 2016, 75 : 5029 - 5045
  • [5] Implementation of a large-scale language model adaptation in a cloud environment
    Kim, Kwang-Ho
    Jung, Dae-Young
    Lee, Donghyun
    Lee, Hyuk-Jun
    Park, Sung-Yong
    Koo, Myoung-Wan
    Kim, Ji-Hwan
    Park, Jeong-sik
    Jeon, Hyung-Bae
    Lee, Yun-Keun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (09) : 5029 - 5045
  • [6] MODEL DESCRIPTION OF THE LARGE-SCALE STRUCTURE OF THE UNIVERSE DEVELOPMENT
    GURBATOV, SN
    SAICHEV, AI
    SHANDARIN, SF
    DOKLADY AKADEMII NAUK SSSR, 1985, 285 (02): : 323 - 326
  • [7] Development of a large-scale transport model with focus on cycling
    Liu, Chengxi
    Tapani, Andreas
    Kristoffersson, Ida
    Rydergren, Clas
    Jonsson, Daniel
    TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2020, 134 (134) : 164 - 183
  • [8] Sociocracy - An Organization Model for Large-Scale Agile Development
    Eckstein, Jutta
    PROCEEDINGS OF THE XP2016 SCIENTIFIC WORKSHOPS, 2016,
  • [9] Development of a Generic Model for Large-Scale Healthcare Organizations
    Alkhaldi, Faisal A.
    Alouani, Ali T.
    201919TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING (HASE 2019), 2019, : 200 - 207
  • [10] STRATEGIC MANAGEMENT OF A LARGE-SCALE TECHNOLOGY DEVELOPMENT - THE CASE OF THE KOREAN TELECOMMUNICATIONS INDUSTRY
    LEE, J
    BAE, ZT
    LEE, J
    JOURNAL OF ENGINEERING AND TECHNOLOGY MANAGEMENT, 1994, 11 (02) : 149 - 170