Learning How to Translate North Korean through South Korean

被引：0

作者：

Kim, Hwichan ^{[1
]}

Moon, Sangwhan ^{[2
,3
]}

Okazaki, Naoaki ^{[2
]}

Komachi, Mamoru ^{[1
]}

机构：

[1] Tokyo Metropolitan Univ, 6-6 Asahigaoka, Hino, Tokyo 1910065, Japan

[2] Tokyo Inst Technol, 2-12-1 Ookayama, Tokyo 1528550, Japan

[3] Google LLC, 1600 Amphitheatre Pkwy, Mountain View, CA 1600 USA

来源：

LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION | 2022年

关键词：

Parallel corpus construction; Machine translation; Korean;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

South and North Korea both use the Korean language, but there are some differences in their linguistic aspects, such as vocabulary and spelling rules. Korean NLP research has focused on South Korean only, and existing NLP systems for the Korean language, such as neural machine translation (NMT) models, cannot properly handle North Korean input. Training a model using North Korean data is the most straightforward approach to solving this problem, but there is insufficient data to train NMT models. In this study, we create data for North Korean NMT models using a comparable corpus. First, we manually create evaluation data for automatic alignment and machine translation. Then, we investigate automatic alignment methods suitable for North Korean data. Finally, we verify that a model trained using North Korean bilingual data without human annotation can significantly increase North Korean translation accuracy compared to existing South Korean models in zero-shot settings.

引用

页码：6711 / 6718

页数：8

共 50 条

[1] North Korean Neural Machine Translation through South Korean Resources
Kim, Hwichan
Tosho, Hirasawa
Moon, Sangwhan
Okazaki, Naoaki
Komachi, Mamoru
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (09)
[2] South Korean Scholars Studying North Korean Movies
Yoon, Jiwon
ASIAN CINEMA, 2007, 18 (02) : 160 - 179
[3] North Korean US nuclear rapprochement: The South Korean dilemma
Kim, T
THIRD WORLD QUARTERLY, 1995, 16 (04) : 661 - 673
[4] Social Distance towards the North Korean Refugees in South Korean Society
Kim, Hee Jin
Yoo, Ho Yeol
Chung, Yun Kyung
KOREA OBSERVER, 2015, 46 (02) : 295 - 320
[5] The stories of North Korean refugees settling in South Korea: implications for South Korean educators
Kim, Hagyun
Hocking, Clare
MULTICULTURAL EDUCATION REVIEW, 2018, 10 (03) : 203 - 223
[6] Fire and fury: North Korean threats and South Korean adolescent health
Bethmann, Dirk
Cho, Jae Il.
SOCIAL SCIENCE & MEDICINE, 2025, 364
[7] The representation of the enemy in North and South Korean literature from the Korean War
de Wit, Jerome
MEMORY STUDIES, 2013, 6 (02) : 146 - 160
[8] Social Integration of North Korean Refugees through Sport in South Korea
Park, Kyoungho
Ok, Gwang
INTERNATIONAL JOURNAL OF THE HISTORY OF SPORT, 2017, 34 (12): : 1294 - 1305
[9] How I Became a North Korean
Koh, E. J.
WORLD LITERATURE TODAY, 2017, 91 (01) : 70 - 71
[10] HOW I BECAME A NORTH KOREAN
Hazelton, Claire Kohda
TLS-THE TIMES LITERARY SUPPLEMENT, 2017, (5965): : 31 - 31

← 1 2 3 4 5 →