Hybrid unstructured text features for meta-heuristic assisted deep CNN-based hierarchical clustering

被引:0
|
作者
Jyothi, Bankapalli [1 ]
Sumalatha, L. [2 ]
Eluri, Suneetha [1 ]
机构
[1] JNTUK Kakinada, Comp Sci & Engn, Kakinada, Andhra Pradesh, India
[2] Jawaharlal Nehru Technol Univ, Comp Sci & Engn, Hyderabad, Telangana, India
来源
关键词
Unstructured data; text clustering; feature extraction; optimal feature selection; deep CNN-based hierarchical clustering; hybrid sea lion grasshopper optimization; ALGORITHM; MODEL;
D O I
10.3233/IDT-220201
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The text clustering model becomes an essential process to sort the unstructured text data in an appropriate format. But, it does not give the pave for extracting the information to facilitate the document representation. In today's date, it becomes crucial to retrieve the relevant text data. Mostly, the data comprises an unstructured text format that it is difficult to categorize the data. The major intention of this work is to implement a new text clustering model of unstructured data using classifier approaches. At first, the unstructured data is taken from standard benchmark datasets focusing on both English and Telugu languages. The collected text data is then given to the pre-processing stage. The pre-processed data is fed into the model of the feature extraction stage 1, in which the GloVe embedding technique is used for extracting text features. Similarly, in the feature extraction stage 2, the pre-processed data is used to extract the deep text features using Text Convolutional Neural Network (Text CNN). Then, the text features from Stage 1 and deep features from Stage 2 are all together and employed for optimal feature selection using the Hybrid Sea Lion Grasshopper Optimization (HSLnGO), where the traditional SLnO is superimposed with GOA. Finally, the text clustering is processed with the help of Deep CNN-assisted hierarchical clustering, where the parameter optimization is done to improve the clustering performance using HSLnGO. Thus, the simulation findings illustrate that the framework yields impressive performance of text classification in contrast with other techniques while implementing the unstructured text data using different quantitative measures.
引用
收藏
页码:1323 / 1350
页数:28
相关论文
共 50 条
  • [1] An adaptive meta-heuristic for music plagiarism detection based on text similarity and clustering
    Malandrino, Delfina
    De Prisco, Roberto
    Ianulardo, Mario
    Zaccagnino, Rocco
    DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 36 (04) : 1301 - 1334
  • [2] An adaptive meta-heuristic for music plagiarism detection based on text similarity and clustering
    Delfina Malandrino
    Roberto De Prisco
    Mario Ianulardo
    Rocco Zaccagnino
    Data Mining and Knowledge Discovery, 2022, 36 : 1301 - 1334
  • [3] Intelligent deep learning-based hierarchical clustering for unstructured text data
    Jyothi, Bankapalli
    Lingamgunta, Sumalatha
    Eluri, Suneetha
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
  • [4] Advances in Meta-Heuristic Optimization Algorithms in Big Data Text Clustering
    Abualigah, Laith
    Gandomi, Amir H.
    Elaziz, Mohamed Abd
    Hamad, Husam Al
    Omari, Mahmoud
    Alshinwan, Mohammad
    Khasawneh, Ahmad M.
    ELECTRONICS, 2021, 10 (02) : 1 - 29
  • [5] Hybrid meta-heuristic algorithm based deep neural network for face recognition
    Soni, Neha
    Sharma, Enakshi Khular
    Kapoor, Amita
    JOURNAL OF COMPUTATIONAL SCIENCE, 2021, 51 (51)
  • [6] Meta-Heuristic Optimized Hybrid Wavelet Features for Arrhythmia Classification
    Deepa, S. R.
    Subramoniam, M.
    Swarnalatha, R.
    Poornapushpakala, S.
    Barani, S.
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 37 (01): : 745 - 761
  • [7] Improved Meta-Heuristic Model for Text Document Clustering by Adaptive Weighted Similarity
    Venkanna, Gugulothu
    Bharati, K. F.
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2023, 31 (05) : 749 - 771
  • [8] Content-based medical image retrieval using deep learning-based features and hybrid meta-heuristic optimization
    Shetty, Rani
    Bhat, Vandana S.
    Pujari, Jagadeesh
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 92
  • [9] A Meta-heuristic Based Clustering Mechanism for Wireless Sensor Networks
    Krishna, M. P. Nidhish
    Abirami, K.
    ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT II, 2022, 1614 : 332 - 345
  • [10] Intelligent accounting optimization method based on meta-heuristic algorithm and CNN
    Dong, Yanrui
    PEERJ COMPUTER SCIENCE, 2024, 10