Context-aware text classification system to improve the quality of text: A detailed investigation and techniques

被引:4
|
作者
Saleem, Zeeshan [1 ]
Alhudhaif, Adi [2 ]
Qureshi, Kashif Naseer [1 ]
Jeon, Gwanggil [3 ]
机构
[1] Bahria Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Prince Sattam bin Abdulaziz Univ, Coll Comp Engn & Sci Al Kharj, Dept Comp Sci, Al Kharj, Saudi Arabia
[3] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon, South Korea
来源
关键词
accuracy; algorithm; classification; context-aware; data mining; dataset; methods; computer;
D O I
10.1002/cpe.6489
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Text classification is one of the most important tasks to extract information from the Internet and identifying the best text representation settings. With the increase of data volume on the world wide web, the significance of text classification increases. This situation requires huge human efforts to understand and classify the digital data available on the Internet. Text classification is classifying the number of text files into different classes. The data or text available on the Internet is in an unstructured form which increases the difficulty to understand and classify it for useful purposes. This paper proposes a context-aware text classification system to improve text quality. We use a content-aware recommendation system to extract the data from well-known news databases. Text preprocessing techniques like tokenization, stemming, and stop words removal are studied in detail. Furthermore, unigram, bigram, and trigram attributes are also being tested. Attribute selection methods are also examined and their impact on the text classification results. To carry out a detailed investigation, 11 versions are created of each dataset to save the time in experimentation process and applied the different preprocessing techniques to understand the impact of each technique on classification results. The proposed system is compared with the existing approach to check the accuracy where the proposed system achieved better performance.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Scene text recognition with context-aware autonomous bidirectional iterative models
    Zhao X.
    Xu M.
    Li Y.
    Huang H.
    Silamu W.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 8605 - 8616
  • [32] Bringing Context-Aware Completion Suggestions to Arbitrary Text Entry Interfaces
    Aveni, Timothy J.
    Fox, Armando
    Hartmann, Bjorn
    ADJUNCT PROCEEDINGS OF THE 36TH ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE & TECHNOLOGY, UIST 2023 ADJUNCT, 2023,
  • [33] Stacking VAE and GAN for Context-aware Text-to-Image Generation
    Zhang, Chenrui
    Peng, Yuxin
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,
  • [34] GCNs-Based Context-Aware Short Text Similarity Model
    Sun, Xiaoqi
    Wu, Shaochun
    Liu, Yue
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1329 - 1335
  • [35] Context-Aware Confidence Estimation for Rejection in Handwritten Chinese Text Recognition
    Liu, Yangyang
    Chen, Yi
    Yin, Fei
    Liu, Cheng-Lin
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT I, 2024, 14804 : 134 - 151
  • [36] Context-Aware Edge-Cloud Collaborative Scene Text Recognition
    Zhang, Puning
    Liu, Changfeng
    Wang, Honggang
    Wu, Dapeng
    Wang, Ruyan
    Zou, Hong
    2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC, 2024, : 611 - 617
  • [37] CONTEXT-AWARE PROSODY CORRECTION FOR TEXT-BASED SPEECH EDITING
    Morrison, Max
    Rencker, Lucas
    Jin, Zeyu
    Bryan, Nicholas J.
    Caceres, Juan-Pablo
    Pardo, Bryan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7038 - 7042
  • [38] Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech
    Bae, Jae-Sung
    Bak, Tae-Jun
    Joo, Young-Sun
    Cho, Hoon-Young
    INTERSPEECH 2021, 2021, : 3610 - 3614
  • [39] Text Assisted Insight Ranking Using Context-Aware Memory Network
    Zeng, Qi
    Luo, Liangchen
    Huang, Wenhao
    Tang, Yang
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 427 - 434
  • [40] Context-Aware Text Representation for Social Relation Aided Sentiment Analysis
    Nguyen, Minh Luan
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'16 COMPANION), 2016, : 85 - 86