Orthographic representation and variation within the Japanese writing system Some corpus-based observations

被引:8
|
作者
Joyce, Terry [1 ]
Hodoscek, Bor [2 ]
Nishina, Kikuko [3 ]
机构
[1] Tama Univ, Sch Global Studies, 802 Engyo, Fujisawa, Kanagawa 2520805, Japan
[2] Tokyo Inst Technol, Dept Human Syst Sci, Meguro Ku, Tokyo 1528552, Japan
[3] Tokyo Inst Technol, Setagaya Ku, Tokyo 1540014, Japan
来源
WRITTEN LANGUAGE AND LITERACY | 2012年 / 15卷 / 02期
关键词
Japanese; Balanced Corpus of Contemporary Written Japanese (BCCWJ); kanji; hiragana; katakana; orthographic variation; UniDic;
D O I
10.1075/wll.15.2.07joy
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Given its multi-scriptal nature, the Japanese writing system can potentially yield some important insights into the complex relationships that can exist between units of language and units of writing. This paper discusses some of the difficult issues surrounding the notions of orthographic representation and variation within the Japanese writing system, as seen from the perspective of creating word lists based on the Kokuritsu Kokugo Kenkyujo's 'Balanced Corpus of Contemporary Written Japanese' (BCCWJ) Project. More specifically, the paper (i) reflects on the treatment of lemmas within UniDic, the morphological analyzer dictionary developed for the project, (ii) notes some concerns for extracting word lists that stem from the project's approach towards defining orthographic words which draws on its conceptualization of short and long unit words, and (iii) attempts to quantify the extent of orthographic variation within the Japanese writing system as represented by the BCCWJ.
引用
收藏
页码:254 / 278
页数:25
相关论文
共 50 条
  • [31] Productive Vocabulary Development in EFL Writing: A Corpus-based Study
    Shao, Changzhong
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ELECTRONICS, MECHANICS, CULTURE AND MEDICINE, 2016, 45 : 123 - 129
  • [32] Orthographic variation as evidence for the development of the Linear B writing system
    Judson, Anna P.
    WRITTEN LANGUAGE AND LITERACY, 2019, 22 (02): : 179 - 197
  • [33] ATTITUDE and Identity Categorizations: A Corpus-based Study of Gender Representation
    Bakar, Kesumawati A.
    INTERNATIONAL CONFERENCE ON EDUCATION & EDUCATIONAL PSYCHOLOGY 2013 (ICEEPSY 2013), 2014, 112 : 747 - 756
  • [34] Learn to blend in!: A corpus-based analysis of the representation of women in mining
    Norberg, Cathrine
    Faltholm, Ylva
    EQUALITY DIVERSITY AND INCLUSION, 2018, 37 (07): : 698 - 712
  • [35] Corpus-based analysis of Japanese-English emotional expressions
    Minato, Junko
    Matsumoto, Kazuyuki
    Ren, Fuji
    Kuroiwa, Shingo
    PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (NLP-KE'07), 2007, : 413 - +
  • [36] A corpus-based study of geminate devoicing in Japanese: linguistic factors
    Kawahara, Shigeto
    Sano, Shin-ichiro
    LANGUAGE SCIENCES, 2013, 40 : 300 - 307
  • [37] Advances in Corpus-Based Research on Academic Writing: Effects of Discipline, Register, and Writing Expertise
    Goulart, Larissa
    CORPORA, 2021, 16 (01) : 157 - 159
  • [38] A corpus-based speech synthesis system with emotion
    Iida, A
    Campbell, N
    Higuchi, F
    Yasumura, M
    SPEECH COMMUNICATION, 2003, 40 (1-2) : 161 - 187
  • [39] A corpus-based speech synthesis system for Uyghur
    Silamu, Wushour
    Tursun, Nasirjan
    Tursun, Mamateli
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 373 - 376
  • [40] Corpus-based typology: applications, challenges and some solutions
    Levshina, Natalia
    LINGUISTIC TYPOLOGY, 2022, 26 (01) : 129 - 160