A method to build a super small but practically accurate language model for handheld devices

被引:9
|
作者
Wu, GQ [1 ]
Zheng, F [1 ]
机构
[1] Tsinghua Univ, Ctr Speech Technol, State Key Lab Intelligent Technol & Syst, Dept Comp Sci & Technol, Beijing 100084, Peoples R China
关键词
language model; language model compression; piecewise linear warping; rank-based quantization;
D O I
10.1007/BF02945463
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an important question, whether a small language model can be practically accurate enough, is raised. Afterwards, the purpose of a language model, the problems that a language model faces, and the factors that affect the performance of a language model, are analyzed. Finally, a novel method for language model compression is proposed, which makes the large language model usable for applications in handheld devices, such as mobiles, smart phones, personal digital assistants (PDAs), and handheld personal computers (HPCs). In the proposed language model compression method, three aspects are included. First, the language model parameters are analyzed and a criterion based on the importance measure of n-grams is used to determine which n-grams should be kept and which removed. Second, a piecewise linear warping method is proposed to be used to compress the uni-gram count values in the full language model. And third, a rank-based quantization method is adopted to quantize the bi-gram probability values. Experiments show that by using this compression method the language model can be reduced dramatically to only about 1M bytes while the performance almost does not decrease. This provides good evidence that a language model compressed by means of a well-designed compression technique is practically accurate enough, and it makes the language model usable in handheld devices.
引用
收藏
页码:747 / 755
页数:9
相关论文
共 15 条
  • [1] A method to build a super small but practically accurate language model for handheld devices
    GenQing Wu
    Fang Zheng
    Journal of Computer Science and Technology, 2003, 18 : 747 - 755
  • [2] Simple, accurate method for characterising two-port devices with small reflections
    Wan, CH
    ELECTRONICS LETTERS, 1998, 34 (18) : 1761 - 1763
  • [3] An accurate parameter extraction method for small signal model of CNFET
    Wang, Jinye
    Liu, Jun
    Chen, Zhanfei
    Zhang, Tingting
    INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2021, 34 (05)
  • [4] An Accurate Parameter Extraction Method for RF LDMOSFET Small-Signal Model
    Song, Wenna
    Fu, Jun
    Wang, Yudong
    Zhou, Wei
    Zhang, Wei
    Cui, Jie
    Zhao, Yue
    Li, Gaoqing
    Liu, Zhihong
    2015 IEEE INTERNATIONAL WIRELESS SYMPOSIUM (IWS 2015), 2015,
  • [5] A new small signal model parameter extraction method applied to GaN devices
    Jarndal, A
    Kompa, G
    2005 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM, VOLS 1-4, 2005, : 1423 - 1426
  • [6] SC-Phi2: A Fine-Tuned Small Language Model for StarCraft II Build Order Prediction
    Khan, Muhammad Junaid
    Sukthankar, Gita
    AI, 2024, 5 (04) : 2338 - 2352
  • [7] Accurate language achievement prediction method based on multi-model ensemble using personality factors
    Lin, Yuping
    Song, Panpan
    Long, Hong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 17415 - 17428
  • [8] Accurate language achievement prediction method based on multi-model ensemble using personality factors
    Yuping Lin
    Panpan Song
    Hong Long
    Multimedia Tools and Applications, 2021, 80 : 17415 - 17428
  • [9] Noisy practical facial super-resolution method via deformable constrained model with small dataset
    Liang Chen
    Qing Li
    Junjun Jiang
    Multimedia Tools and Applications, 2020, 79 : 2577 - 2600
  • [10] Noisy practical facial super-resolution method via deformable constrained model with small dataset
    Chen, Liang
    Li, Qing
    Jiang, Junjun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) : 2577 - 2600