Context-aware text classification system to improve the quality of text: A detailed investigation and techniques

被引:4
|
作者
Saleem, Zeeshan [1 ]
Alhudhaif, Adi [2 ]
Qureshi, Kashif Naseer [1 ]
Jeon, Gwanggil [3 ]
机构
[1] Bahria Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Prince Sattam bin Abdulaziz Univ, Coll Comp Engn & Sci Al Kharj, Dept Comp Sci, Al Kharj, Saudi Arabia
[3] Incheon Natl Univ, Dept Embedded Syst Engn, Incheon, South Korea
来源
关键词
accuracy; algorithm; classification; context-aware; data mining; dataset; methods; computer;
D O I
10.1002/cpe.6489
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Text classification is one of the most important tasks to extract information from the Internet and identifying the best text representation settings. With the increase of data volume on the world wide web, the significance of text classification increases. This situation requires huge human efforts to understand and classify the digital data available on the Internet. Text classification is classifying the number of text files into different classes. The data or text available on the Internet is in an unstructured form which increases the difficulty to understand and classify it for useful purposes. This paper proposes a context-aware text classification system to improve text quality. We use a content-aware recommendation system to extract the data from well-known news databases. Text preprocessing techniques like tokenization, stemming, and stop words removal are studied in detail. Furthermore, unigram, bigram, and trigram attributes are also being tested. Attribute selection methods are also examined and their impact on the text classification results. To carry out a detailed investigation, 11 versions are created of each dataset to save the time in experimentation process and applied the different preprocessing techniques to understand the impact of each technique on classification results. The proposed system is compared with the existing approach to check the accuracy where the proposed system achieved better performance.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Conceptual Data Modelling Framework for Context-Aware Text Classification
    Tazeen, Nazia
    Rani, K. Sandhya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (11) : 124 - 131
  • [2] A Context-Aware Fuzzy Classification Technique for OLAP Text Analysis
    Chakrabarty, Anirban
    Roy, Santanu
    Roy, Sudipta
    RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 3, 2018, 709 : 73 - 85
  • [3] Context-Aware Unsupervised Text Stylization
    Yang, Shuai
    Liu, Jiaying
    Yang, Wenhan
    Guo, Zongming
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1688 - 1696
  • [4] Sentiment and Context-Aware Hybrid DNN With Attention for Text Sentiment Classification
    Khan, Jawad
    Ahmad, Niaz
    Khalid, Shah
    Ali, Farman
    Lee, Youngmoon
    IEEE ACCESS, 2023, 11 : 28162 - 28179
  • [5] A gating context-aware text classification model with BERT and graph convolutional networks
    Gao, Weiqi
    Huang, Hao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 4331 - 4343
  • [6] Context Extraction from Reviews for Context Aware Recommendation Using Text Classification Techniques
    Lahlou, Fatima Zahra
    Benbrahimand, Houda
    Mountassir, Asmaa
    Kassou, Ismail
    2013 ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2013,
  • [7] Semantic & Syntactic Context-Aware Text Entry Methods
    Gong, Jun
    ASSETS'07: PROCEEDINGS OF THE NINTH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2007, : 261 - 262
  • [8] A Context-Aware Recommender Method Based on Text Mining
    Sundermann, Camila Vaccari
    de Padua, Renan
    Tonon, Vitor Rodrigues
    Domingues, Marcos Aurelio
    Rezende, Solange Oliveira
    PROGRESS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11805 : 385 - 396
  • [9] Learning Context-Aware Convolutional Filters for Text Processing
    Shen, Dinghan
    Min, Martin Renqiang
    Li, Yitong
    Carin, Lawrence
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1839 - 1848
  • [10] A context-aware index based text extraction framework
    Wu, M.-H. (mhwu@zucc.edu.cn), 1600, Zhejiang University (47):