A method to build a super small but practically accurate language model for handheld devices

被引：9

作者：

Wu, GQ ^{[1
]}

Zheng, F ^{[1
]}

机构：

[1] Tsinghua Univ, Ctr Speech Technol, State Key Lab Intelligent Technol & Syst, Dept Comp Sci & Technol, Beijing 100084, Peoples R China

来源：

JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY | 2003年 / 18卷 / 06期

关键词：

language model; language model compression; piecewise linear warping; rank-based quantization;

D O I：

10.1007/BF02945463

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an important question, whether a small language model can be practically accurate enough, is raised. Afterwards, the purpose of a language model, the problems that a language model faces, and the factors that affect the performance of a language model, are analyzed. Finally, a novel method for language model compression is proposed, which makes the large language model usable for applications in handheld devices, such as mobiles, smart phones, personal digital assistants (PDAs), and handheld personal computers (HPCs). In the proposed language model compression method, three aspects are included. First, the language model parameters are analyzed and a criterion based on the importance measure of n-grams is used to determine which n-grams should be kept and which removed. Second, a piecewise linear warping method is proposed to be used to compress the uni-gram count values in the full language model. And third, a rank-based quantization method is adopted to quantize the bi-gram probability values. Experiments show that by using this compression method the language model can be reduced dramatically to only about 1M bytes while the performance almost does not decrease. This provides good evidence that a language model compressed by means of a well-designed compression technique is practically accurate enough, and it makes the language model usable in handheld devices.

引用

页码：747 / 755

页数：9

共 15 条

[1] A method to build a super small but practically accurate language model for handheld devices
GenQing Wu
Fang Zheng
Journal of Computer Science and Technology, 2003, 18 : 747 - 755
[2] Simple, accurate method for characterising two-port devices with small reflections
Wan, CH
ELECTRONICS LETTERS, 1998, 34 (18) : 1761 - 1763
[3] An accurate parameter extraction method for small signal model of CNFET
Wang, Jinye
Liu, Jun
Chen, Zhanfei
Zhang, Tingting
INTERNATIONAL JOURNAL OF NUMERICAL MODELLING-ELECTRONIC NETWORKS DEVICES AND FIELDS, 2021, 34 (05)
[4] An Accurate Parameter Extraction Method for RF LDMOSFET Small-Signal Model
Song, Wenna
Fu, Jun
Wang, Yudong
Zhou, Wei
Zhang, Wei
Cui, Jie
Zhao, Yue
Li, Gaoqing
Liu, Zhihong
2015 IEEE INTERNATIONAL WIRELESS SYMPOSIUM (IWS 2015), 2015,
[5] A new small signal model parameter extraction method applied to GaN devices
Jarndal, A
Kompa, G
2005 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM, VOLS 1-4, 2005, : 1423 - 1426
[6] SC-Phi2: A Fine-Tuned Small Language Model for StarCraft II Build Order Prediction
Khan, Muhammad Junaid
Sukthankar, Gita
AI, 2024, 5 (04) : 2338 - 2352
[7] Accurate language achievement prediction method based on multi-model ensemble using personality factors
Lin, Yuping
Song, Panpan
Long, Hong
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (11) : 17415 - 17428
[8] Accurate language achievement prediction method based on multi-model ensemble using personality factors
Yuping Lin
Panpan Song
Hong Long
Multimedia Tools and Applications, 2021, 80 : 17415 - 17428
[9] Noisy practical facial super-resolution method via deformable constrained model with small dataset
Liang Chen
Qing Li
Junjun Jiang
Multimedia Tools and Applications, 2020, 79 : 2577 - 2600
[10] Noisy practical facial super-resolution method via deformable constrained model with small dataset
Chen, Liang
Li, Qing
Jiang, Junjun
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (3-4) : 2577 - 2600

← 1 2 →