Orthographic representation and variation within the Japanese writing system Some corpus-based observations

被引:8
|
作者
Joyce, Terry [1 ]
Hodoscek, Bor [2 ]
Nishina, Kikuko [3 ]
机构
[1] Tama Univ, Sch Global Studies, 802 Engyo, Fujisawa, Kanagawa 2520805, Japan
[2] Tokyo Inst Technol, Dept Human Syst Sci, Meguro Ku, Tokyo 1528552, Japan
[3] Tokyo Inst Technol, Setagaya Ku, Tokyo 1540014, Japan
来源
WRITTEN LANGUAGE AND LITERACY | 2012年 / 15卷 / 02期
关键词
Japanese; Balanced Corpus of Contemporary Written Japanese (BCCWJ); kanji; hiragana; katakana; orthographic variation; UniDic;
D O I
10.1075/wll.15.2.07joy
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Given its multi-scriptal nature, the Japanese writing system can potentially yield some important insights into the complex relationships that can exist between units of language and units of writing. This paper discusses some of the difficult issues surrounding the notions of orthographic representation and variation within the Japanese writing system, as seen from the perspective of creating word lists based on the Kokuritsu Kokugo Kenkyujo's 'Balanced Corpus of Contemporary Written Japanese' (BCCWJ) Project. More specifically, the paper (i) reflects on the treatment of lemmas within UniDic, the morphological analyzer dictionary developed for the project, (ii) notes some concerns for extracting word lists that stem from the project's approach towards defining orthographic words which draws on its conceptualization of short and long unit words, and (iii) attempts to quantify the extent of orthographic variation within the Japanese writing system as represented by the BCCWJ.
引用
收藏
页码:254 / 278
页数:25
相关论文
共 50 条
  • [1] A Corpus-Based Variation in the Processing of Determiners in Nigerian Undergraduates Descriptive Writing
    Muhammad, Anas Saidu
    Singh, Manvender Kaur Sarjit
    APPLIED LINGUISTICS RESEARCH JOURNAL, 2020, 4 (05): : 22 - 38
  • [2] Coarticulatory reinterpretation of allophonic variation: Corpus-based analysis of |z| in spontaneous Japanese
    Maekawa, Kikuo
    JOURNAL OF PHONETICS, 2010, 38 (03) : 360 - 374
  • [3] Corpus-based approaches to register variation
    Pinto, Marcia Veirano
    ENGLISH LANGUAGE & LINGUISTICS, 2023, 27 (03) : 640 - 646
  • [4] Corpus-based Approaches to Register Variation
    Bottini, Raffaella
    Asimakopoulos, Anastasios
    Seoane, Elena
    Biber, Douglas
    REGISTER STUDIES, 2025,
  • [5] A Corpus-based Analysis of Filipino Writing Errors
    Octaviano, Manolito, Jr.
    Go, Matthew Phillip
    Borra, Allan
    Oco, Nathaniel
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 95 - 98
  • [6] Corpus-based induction of lexical representation and meaning
    Lapata, Maria
    Proceedings of the National Conference on Artificial Intelligence, 1999,
  • [7] Evidence from a within-language comparison in Japanese for orthographic depth theory: Monte Carlo simulations, corpus-based analyses, neural networks, and human experiment
    Inohara, Keisuke
    Ueno, Taiji
    JOURNAL OF MEMORY AND LANGUAGE, 2023, 132
  • [8] A Corpus-based Study on Chinese Students' Writing Errors
    Pan Jichun
    Wang Junsong
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCE, EDUCATION MANAGEMENT AND SPORTS EDUCATION, 2015, 39 : 113 - 116
  • [9] Design and Evaluation of WriteBetter: A Corpus-Based Writing Assistant
    Bellino, Alessio
    Bascunan, Daniela
    IEEE ACCESS, 2020, 8 : 70216 - 70233
  • [10] A Corpus-based Study on "However" in Chinese Learners' Writing
    韩国春
    海外英语, 2011, (11) : 334 - 335