A method for the construction of a probabilistic hierarchical structure based on a statistical analysis of a large-scale corpus

被引:1
|
作者
Terai, Asuka [1 ]
Bin Liu [2 ]
Nakagawa, Masanori [1 ]
机构
[1] Tokyo Inst Technol, Meguro Ku, Ookayama 2-12-1, Tokyo 152, Japan
[2] Nissay Informat Technol Co Ltd, Tokyo, Japan
关键词
D O I
10.1109/ICSC.2007.60
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The purpose of this study is to develop a method of constructing a probabilistic hierarchical structure based on a statistical analysis of a Japanese corpus using a combination of Kameya and Sato's statistical language analysis(7) and Rose's model(10). First, the co-occurrence frequencies of adjectives and nouns are calculated from a Japanese corpus based on modification relations. Second, latent classes are extracted from a statistical language analysis of the co-occurrence data. Third, the centroid vectors of the latent classes are calculated from the analysis results and a probabilistic hierarchical structure of the latent classes is constructed by utilizing Rose's model. Finally, the conditional probabilities of the categories given the latent classes are computed as the association probabilities of the concepts to the categories and the conditional probabilities of the categories given the concepts are computed as the association probabilities of the concepts to the categories.
引用
收藏
页码:129 / +
页数:2
相关论文
共 50 条
  • [31] SSL: A Surrogate-Based Method for Large-Scale Statistical Latency Measurement
    Zhang, Xu
    Yin, Hao
    Wu, Dapeng Oliver
    Huang, Haojun
    Min, Geyong
    Zhang, Ying
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2020, 13 (05) : 958 - 968
  • [32] Empirical analysis of a large-scale hierarchical storage system
    Yu, Weikuan
    Oral, H. Sarp
    Canon, R. Shane
    Vetter, Jeffrey S.
    Sankaran, Ramanan
    EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGS, 2008, 5168 : 130 - 140
  • [33] ANALYSIS OF THE LARGE-SCALE STRUCTURE OF THE UNIVERSE
    DOROSHKEVICH, AG
    KOTOK, EV
    SHANDARIN, SF
    SIGOV, YS
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 1983, 202 (02) : 537 - 552
  • [34] The Research on Automatic Construction Techniques of Large-scale Corpus for Chinese Text Categorization
    Hu, Yan
    Wu, Wei
    Miao, Miao
    IEEC 2009: FIRST INTERNATIONAL SYMPOSIUM ON INFORMATION ENGINEERING AND ELECTRONIC COMMERCE, PROCEEDINGS, 2009, : 640 - 645
  • [35] A Large-Scale Corpus for Conversation Disentanglement
    Kummerfeld, Jonathan K.
    Athreya, Vignesh
    Patel, Siva Sankalp
    Gouravajhala, Sai R.
    Gunasekara, Chulaka
    Polymenakos, Lazaros
    Peper, Joseph J.
    Ganhotra, Jatin
    Lasecki, Walter S.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3846 - 3856
  • [36] A Corpus for Large-Scale Phonetic Typology
    Salesky, Elizabeth
    Chodroff, Eleanor
    Pimentel, Tiago
    Wiesner, Matthew
    Cotterell, Ryan
    Black, Alan W.
    Eisner, Jason
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4526 - 4546
  • [37] ON A NEW HIERARCHICAL STRUCTURE FOR LARGE-SCALE SYSTEMS DESIGN.
    Bakule, L.
    Problems of control and information theory, 1982, 11 (05): : 379 - 387
  • [38] A HIERARCHICAL METHOD FOR LARGE-SCALE TWO-DIMENSIONAL LAYOUT
    LAM, KP
    JOURNAL OF MECHANISMS TRANSMISSIONS AND AUTOMATION IN DESIGN-TRANSACTIONS OF THE ASME, 1983, 105 (02): : 242 - 248
  • [39] Large-Scale Full Wave Analysis of Electromagnetic Field by Hierarchical Domain Decomposition Method
    Takei, A.
    Yoshimura, S.
    Kanayama, H.
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2009, 40 (01): : 63 - 81
  • [40] Large-scale analysis of high frequency electromagnetic field by hierarchical domain decomposition method
    Department of Quantum Engineering and Systems Science, University of Tokyo, 7-3-1, Bunkyo-ku, Tokyo 113-8656, Japan
    不详
    IEEJ Trans. Fundam. Mater., 2008, 9 (591-597+3):