Random clusterings for language modeling

被引:0
|
作者
Emami, A [1 ]
Jelinek, F [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an application of randomization techniques to class-based n-gram language models. The idea is to derive a language model from the combination of a set of random class-based models. Each of the constituent random class-based models is built using a separate clustering obtained via a different run of a randomized clustering algorithm. The random class-based model can compensate for some of the shortcomings of conventional class-based models by combining the different solutions obtained through random clusterings. Experimental results show that the combined random class-based model improves considerably in perplexity (PPL) and word error rate (WER) over both the n-gram and baseline class-based models.
引用
收藏
页码:581 / 584
页数:4
相关论文
共 50 条
  • [21] Explaining clusterings of process instances
    Pieter De Koninck
    Jochen De Weerdt
    Seppe K. L. M. vanden Broucke
    Data Mining and Knowledge Discovery, 2017, 31 : 774 - 808
  • [22] Weighted mean of a pair of clusterings
    Franek, Lucas
    Jiang, Xiaoyi
    He, Changzheng
    PATTERN ANALYSIS AND APPLICATIONS, 2014, 17 (01) : 153 - 166
  • [23] RAPIDITY CLUSTERINGS IN HADRONIC PRODUCTION
    TAN, CI
    PHYSICAL REVIEW D, 1973, 8 (03): : 935 - 945
  • [24] Learning Multiple Nonredundant Clusterings
    Cui, Ying
    Fern, Xiaoli Z.
    Dy, Jennifer G.
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2010, 4 (03)
  • [25] Detecting alternative graph clusterings
    Mandala, Supreet
    Kumara, Soundar
    Yao, Tao
    PHYSICAL REVIEW E, 2012, 86 (01):
  • [26] Finding Multiple Stable Clusterings
    Hu, Juhua
    Qian, Qi
    Pei, Jian
    Jin, Rong
    Zhu, Shenghuo
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 171 - 180
  • [27] Hierarchy cost of hierarchical clusterings
    Bock, Felix
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2022, 44 (01) : 617 - 634
  • [28] Multiple Independent Subspace Clusterings
    Wang, Xing
    Wang, Jun
    Domeniconi, Carlotta
    Yu, Guoxian
    Xiao, Guoqiang
    Guo, Maozu
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5353 - 5360
  • [29] Exploiting syntactic, semantic and lexical regularities in language modeling via directed Markov random fields
    Wang, SJ
    Wang, SM
    Greiner, R
    Schuurmans, D
    Cheng, L
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 305 - 308
  • [30] Large-Scale Language Modeling with Random Forests for Mandarin Chinese Speech-to-Text
    Oparin, Ilya
    Lamel, Lori
    Gauvain, Jean-Luc
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2010, 6233 : 269 - 280