Random clusterings for language modeling

被引:0
|
作者
Emami, A [1 ]
Jelinek, F [1 ]
机构
[1] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present an application of randomization techniques to class-based n-gram language models. The idea is to derive a language model from the combination of a set of random class-based models. Each of the constituent random class-based models is built using a separate clustering obtained via a different run of a randomized clustering algorithm. The random class-based model can compensate for some of the shortcomings of conventional class-based models by combining the different solutions obtained through random clusterings. Experimental results show that the combined random class-based model improves considerably in perplexity (PPL) and word error rate (WER) over both the n-gram and baseline class-based models.
引用
收藏
页码:581 / 584
页数:4
相关论文
共 50 条
  • [41] Multiple Co-Clusterings
    Wang, Xing
    Yu, Guoxian
    Domeniconi, Carlotta
    Wang, Jun
    Yu, Zhiwen
    Zhang, Zili
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1308 - 1313
  • [42] On clusterings: Good, bad and spectral
    Kannan, R
    Vempala, S
    Vetta, A
    JOURNAL OF THE ACM, 2004, 51 (03) : 497 - 515
  • [43] Combining multiple weak clusterings
    Topchy, A
    Jain, AK
    Punch, W
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 331 - 338
  • [44] Engineering comparators for graph clusterings
    Delling, Daniel
    Gaertler, Marco
    Goerke, Robert
    Wagner, Dorothea
    ALGORITHMIC ASPECTS IN INFORMATION AND MANAGEMENT, PROCEEDINGS, 2008, 5034 : 131 - 142
  • [45] Comparing Hard and Overlapping Clusterings
    Horta, Danilo
    Campello, Ricardo J. G. B.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2015, 16 : 2949 - 2997
  • [46] Comparing clusterings by the variation of information
    Meila, M
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 173 - 187
  • [47] Modeling random walkers on growing random networks
    Ross, Robert J. H.
    Fontana, Walter
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 526
  • [48] CLICOM: Cliques for combining multiple clusterings
    Mimaroglu, Selim
    Yagci, Murat
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (02) : 1889 - 1901
  • [49] Inertial Entropy and External Validation of Clusterings
    Dan Simovici
    Joshua Yee
    Journal of Harbin Institute of Technology(New Series), 2024, 31 (05) : 41 - 54
  • [50] Modeling and pricing with a random walk in random environment
    Castro, Isabel
    Pacheco, Carlos G.
    INTERNATIONAL JOURNAL OF FINANCIAL ENGINEERING, 2020, 7 (04)