A method for the construction of a probabilistic hierarchical structure based on a statistical analysis of a large-scale corpus

被引:1
|
作者
Terai, Asuka [1 ]
Bin Liu [2 ]
Nakagawa, Masanori [1 ]
机构
[1] Tokyo Inst Technol, Meguro Ku, Ookayama 2-12-1, Tokyo 152, Japan
[2] Nissay Informat Technol Co Ltd, Tokyo, Japan
关键词
D O I
10.1109/ICSC.2007.60
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The purpose of this study is to develop a method of constructing a probabilistic hierarchical structure based on a statistical analysis of a Japanese corpus using a combination of Kameya and Sato's statistical language analysis(7) and Rose's model(10). First, the co-occurrence frequencies of adjectives and nouns are calculated from a Japanese corpus based on modification relations. Second, latent classes are extracted from a statistical language analysis of the co-occurrence data. Third, the centroid vectors of the latent classes are calculated from the analysis results and a probabilistic hierarchical structure of the latent classes is constructed by utilizing Rose's model. Finally, the conditional probabilities of the categories given the latent classes are computed as the association probabilities of the concepts to the categories and the conditional probabilities of the categories given the concepts are computed as the association probabilities of the concepts to the categories.
引用
收藏
页码:129 / +
页数:2
相关论文
共 50 条
  • [1] Construction of a Probabilistic hierarchical structure based on a Japanese corpus and a Japanese thesaurus
    Terai, Asuka
    Liu, Bin
    Nakagawa, Masanori
    LARGE-SCALE KNOWLEDGE RESOURCES: CONSTRUCTION AND APPLICATION, 2008, 4938 : 132 - +
  • [2] Developing a Hierarchical Road Layout Method for Large-Scale Construction Site
    Chen, Tian
    Hu, Hao
    Zhu, Fengfeng
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2020, 146 (09)
  • [3] Statistical Analysis for Large-Scale Hierarchical Networks Using Network Coding
    Chang, Shih Yu
    Wu, Hsiao-Chun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2011, 60 (05) : 2152 - 2163
  • [4] Construction of Adverbial-Verb Collocation Database Based on Large-Scale Corpus
    Xing, Dan
    Xun, Endong
    Wang, Chengwen
    Rao, Gaoqi
    Ma, Luyao
    CHINESE LEXICAL SEMANTICS (CLSW 2019), 2020, 11831 : 585 - 595
  • [5] A regularity-based hierarchical symbolic analysis method for large-scale analog networks
    Doboli, A
    Vemuri, R
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2001, 48 (11) : 1054 - 1068
  • [6] A regularity-based hierarchical symbolic analysis method for large-scale analog networks
    Doboli, A
    Vemuri, R
    DESIGN, AUTOMATION AND TEST IN EUROPE, CONFERENCE AND EXHIBITION 2001, PROCEEDINGS, 2001, : 806 - 806
  • [7] Distributed monitoring for large-scale processes based on multivariate statistical analysis and Bayesian method
    Jiang, Qingchao
    Huang, Biao
    JOURNAL OF PROCESS CONTROL, 2016, 46 : 75 - 83
  • [8] BIASING AND HIERARCHICAL STATISTICS IN LARGE-SCALE STRUCTURE
    FRY, JN
    GAZTANAGA, E
    ASTROPHYSICAL JOURNAL, 1993, 413 (02): : 447 - 452
  • [9] A Solution to the Problems in Large-Scale Corpus Construction for Police Translation
    Hao, Ding
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL CONFERENCE ON LAW AND LANGUAGE OF THE INTERNATIONAL ACADEMY OF LINGUISTIC LAW (IALL2017): LAW, LANGUAGE AND JUSTICE, 2017, : 232 - 239
  • [10] Construction of a large-scale Japanese ASR corpus on TV recordings
    Ando, Shintaro
    Fujihara, Hiromasa
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2021, 2021-June : 6948 - 6952