Evaluation of Persian Text Based on Huffman Data Compression

被引:0
|
作者
Jalilian, Omid [1 ]
Haghighat, Abolfazl Toroghi [2 ]
Rezvanian, Alireza [1 ]
机构
[1] Islamic Azad Univ, Hamedan Branch, Tehran, Iran
[2] Islamic Azad Univ, Qazvin Branch, Tehran, Iran
关键词
component; Data mining; Persian language; Persian web; Text compression; Huffman data compression; Persian texts compression;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
According to the growth of information sources in recent years along the web, many of web servers have been dedicated to the information sources storage. Until yet many methods are presented for storing and transforming information on the web in the case of paralleling or processing. But one of the researcher's challenges in derivation and restoring data in data mining and information retrievals are to face to this huge amount of information for storing. One of the solutions of this problem is compression of information resources. Notice that the published statistics, Persian language is one of the oldest and the most diffused languages all around the world and web and also according to its kind of alphabets and variety along the Persian texts, an evaluation on compression for Persian texts will be useful. First of all in this paper variety difficulties and huge amount of information on the web, general aspects of Huffman compression methods are introduced, and also some features of Persian language. The state of choosing Persian texts collections has been investigated and the result of tests in compare with some experimental datasets form Persian, English and Arabic were shown. The experimental results are given at the end of paper.
引用
收藏
页码:180 / +
页数:2
相关论文
共 50 条
  • [31] Test response compression based on Huffman coding
    Ichihara, H
    Shintani, M
    Ohara, T
    Inoue, T
    ATS 2003: 12TH ASIAN TEST SYMPOSIUM, PROCEEDINGS, 2003, : 446 - 449
  • [32] A Combination of Encryption and Compression Algorithm Based on Huffman
    Zhao, Naidong
    Zhang, Runtong
    Han, Ling
    EIGHTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, VOLS I-III, 2009, : 411 - 415
  • [33] Test data compression based on variable-to-variable Huffman encoding with codeword reusability
    Kavousianos, Xrysovalantis
    Kalligeros, Emmanouil
    Nikolos, Dimitris
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2008, 27 (07) : 1333 - 1338
  • [34] Huffman and Lempel-Ziv based Data Compression Algorithms for Wireless Sensor Networks
    Renugadevi, S.
    Darisini, P. S. Nithya
    2013 INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, INFORMATICS AND MEDICAL ENGINEERING (PRIME), 2013,
  • [35] Text and image compression based on data mining perspective
    Oswald C.
    Sivaselvan B.
    Data Science Journal, 2018, 17
  • [36] Text encryption based on huffman coding and elgamal cryptosystem
    Singh, Khoirom Motilal
    Singh, Laiphrakpam Dolendro
    Tuithung, Themrichon
    Recent Patents on Engineering, 2021, 15 (04)
  • [37] Advanced Encryption Standard (AES)-Based Text Encryption for Near Field Communication (NFC) Using Huffman Compression
    Ajagbe S.A.
    Adeniji O.D.
    Olayiwola A.A.
    Abiona S.F.
    SN Computer Science, 5 (1)
  • [38] An Adaptive Huffman Algorithm for Data Compression in Wireless Sensor Networks
    Sacaleanu, Dragos Ioan
    Stoian, Rodica
    Ofrim, Dragos Mihai
    2011 10TH INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2011,
  • [39] Development of an Application for Data Compression by Using the Huffman Algorithm.
    Alvarado, Ivan
    Zavala, Alvaro
    Linares, Marvin
    2017 IEEE 37TH CENTRAL AMERICA AND PANAMA CONVENTION (CONCAPAN XXXVII), 2017,
  • [40] DATA-COMPRESSION USING WORD ENCODING WITH HUFFMAN CODE
    LIU, CW
    YU, C
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1991, 42 (09): : 685 - 698