Construction of a Probabilistic hierarchical structure based on a Japanese corpus and a Japanese thesaurus

被引:0
|
作者
Terai, Asuka [1 ]
Liu, Bin [2 ]
Nakagawa, Masanori [1 ]
机构
[1] Tokyo Inst Technol, Meguro Ku, 2-12-1 Ookayama, Tokyo 152, Japan
[2] Nissay Informat Technol Co Ltd, Tokyo, Japan
基金
日本学术振兴会;
关键词
D O I
10.1007/978-3-540-78159-2_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of this study is to construct a probabilistic hierarchical structure of categories based on a statistical analysis of Japanese corpus data and to verify the validity of the structure by conducting a psychological experiment. At first, the co-occurrence frequencies of adjectives and nouns within modification relations were extracted from a Japanese corpus. Secondly, a probabilistic hierarchical structure was constructed based on the probability, P (category I noun), representing the category membership of the nouns, and utilizing categorization information in a thesaurus and a soft clustering method (Rose's method [1]) with co-occurrence frequencies as initial values. This method makes it possible to identify the constructed hierarchical structure. In order to examine the validity of the constructed hierarchy, a psychological experiment was conducted. The results of the experiment verified the psychological validity of the hierarchical structure.
引用
收藏
页码:132 / +
页数:3
相关论文
共 50 条
  • [31] Corpus-based analysis of Japanese relative clause constructions
    Abekawa, T
    Okumura, M
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 46 - 57
  • [32] JACAPPELLA CORPUS: A JAPANESE A CAPPELLA VOCAL ENSEMBLE CORPUS
    Nakamura, Tomohiko
    Takamichi, Shinnosuke
    Tanji, Naoko
    Fukayama, Satoru
    Saruwatari, Hiroshi
    arXiv, 2022,
  • [33] GUIDELINES FOR THESAURUS STRUCTURE CONSTRUCTION AND USE
    ROBERTS, M
    INFORMATION SCIENTIST, 1972, 6 (04): : 166 - 168
  • [34] Japanese construction alliances
    Sillars, DN
    Kangari, R
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT-ASCE, 1997, 123 (02): : 146 - 152
  • [35] CONSTRUCTION NP NO NO IN JAPANESE
    YAMADA, S
    LINGUISTICS, 1971, 76 (DEC) : 77 - 96
  • [36] Japanese construction industry
    Tucker, R.L.
    Automation in Construction, 1992, 1 (01) : 27 - 34
  • [37] Building Japanese Predicate-argument Structure Corpus using Lexical Conceptual Structure
    Matsubayashi, Yuichiroh
    Miyao, Yusuke
    Aizawa, Akiko
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 1554 - 1558
  • [38] Balanced corpus of contemporary written Japanese
    Maekawa, Kikuo
    Yamazaki, Makoto
    Ogiso, Toshinobu
    Maruyama, Takehiko
    Ogura, Hideki
    Kashino, Wakako
    Koiso, Hanae
    Yamaguchi, Masaya
    Tanaka, Makiro
    Den, Yasuharu
    LANGUAGE RESOURCES AND EVALUATION, 2014, 48 (02) : 345 - 371
  • [39] Japanese Realistic Textual Entailment Corpus
    Hayashibe, Yuta
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6827 - 6834
  • [40] JCoLA: Japanese Corpus of Linguistic Acceptability
    Someya, Taiga
    Sugimoto, Yushi
    Oseki, Yohei
    2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, 2024, : 9477 - 9488