Evaluation of Persian Text Based on Huffman Data Compression

被引:0
|
作者
Jalilian, Omid [1 ]
Haghighat, Abolfazl Toroghi [2 ]
Rezvanian, Alireza [1 ]
机构
[1] Islamic Azad Univ, Hamedan Branch, Tehran, Iran
[2] Islamic Azad Univ, Qazvin Branch, Tehran, Iran
关键词
component; Data mining; Persian language; Persian web; Text compression; Huffman data compression; Persian texts compression;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
According to the growth of information sources in recent years along the web, many of web servers have been dedicated to the information sources storage. Until yet many methods are presented for storing and transforming information on the web in the case of paralleling or processing. But one of the researcher's challenges in derivation and restoring data in data mining and information retrievals are to face to this huge amount of information for storing. One of the solutions of this problem is compression of information resources. Notice that the published statistics, Persian language is one of the oldest and the most diffused languages all around the world and web and also according to its kind of alphabets and variety along the Persian texts, an evaluation on compression for Persian texts will be useful. First of all in this paper variety difficulties and huge amount of information on the web, general aspects of Huffman compression methods are introduced, and also some features of Persian language. The state of choosing Persian texts collections has been investigated and the result of tests in compare with some experimental datasets form Persian, English and Arabic were shown. The experimental results are given at the end of paper.
引用
收藏
页码:180 / +
页数:2
相关论文
共 50 条
  • [41] Optimal selective Huffman coding for test-data compression
    Kavousianos, Xrysovalantis
    Kalligeros, Emmanouil
    Nikolos, Dimitris
    IEEE TRANSACTIONS ON COMPUTERS, 2007, 56 (08) : 1146 - 1152
  • [42] Data Compression Scheme of Dynamic Huffman Code for Different Languages
    Pathak, Shivani
    Singh, Shradha
    Singh, Smita
    Jain, Mamta
    Sharma, Anand
    INFORMATION AND NETWORK TECHNOLOGY, 2011, 4 : 201 - 206
  • [43] Sampled-data audio signal compression with Huffman coding
    Ashida, S
    Kakemizu, H
    Nagahara, M
    Yamamoto, Y
    SICE 2004 ANNUAL CONFERENCE, VOLS 1-3, 2004, : 972 - 976
  • [44] Multimedia data compression storage of sensor network based on improved Huffman coding algorithm in cloud
    Wang, Shuxia
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35369 - 35382
  • [45] A novel compression scheme based on SMVQ and Huffman coding
    Lin, C.-C. (mhlin3@pu.edu.tw), 1600, ICIC International (10):
  • [46] A NOVEL COMPRESSION SCHEME BASED ON SMVQ AND HUFFMAN CODING
    Chang, Chin-Chen
    Thai-Son Nguyen
    Lin, Chia-Chen
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1041 - 1050
  • [47] Micro Distortion Image Compression Based on Huffman Encoding
    Bao, Kai-xuan
    Qian, Qi-pei
    Yang, Ya-xuan
    2020 5TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS 2020), 2020, : 319 - 323
  • [48] Multimedia data compression storage of sensor network based on improved Huffman coding algorithm in cloud
    Shuxia Wang
    Multimedia Tools and Applications, 2020, 79 : 35369 - 35382
  • [49] Improved Huffman coding-based data transmission and compression method for agricultural machinery operation
    Yang, Jingfeng
    Zhang, Nanfeng
    Li, Yong
    Xue, Yueju
    Lü, Wei
    He, Kun
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2014, 30 (13): : 153 - 159
  • [50] FPGA-Based lossless data compression using Huffman and LZ77 algorithms
    Rigler, Suzanne
    Bishop, William
    Kennings, Andrew
    2007 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, 2007, : 1235 - 1238