Effective Pattern Discovery and Dimensionality Reduction for Text Under Text Mining

被引:1
|
作者
Vijayakumar, T. [1 ]
Priya, R. [1 ]
Palanisamy, C. [1 ]
机构
[1] Bannari Amman Inst Technol, Dept Informat Technol, Erode, Tamil Nadu, India
关键词
Text mining; Polysemy; RefixSpan; FP-tree; SPADE; SLPmine; GST;
D O I
10.1007/978-81-322-2135-7_65
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Huge data mining techniques have been used for mining useful pattern in text document. Text mining can be used to extract the data in document. It is effectively use and update the discovered pattern; still the research is not yet completed. The existing approach is term-based approach; they suffer the problem of polysemy and synonymy. In the past years, people have used pattern-based approaches for hypothesis, which perform better than the term-based ones, but many of the experiments do not support this hypothesis. This paper presents a new idea about the effective pattern discovery technique which involved the processes of pattern deploying and pattern evolving, to improve the effectiveness of using and updating discovered patterns for finding relevant and useful information.
引用
收藏
页码:615 / 623
页数:9
相关论文
共 50 条
  • [21] Pattern and Cluster Mining on Text Data
    Agnihotri, Deepak
    Verma, Kesari
    Tripathi, Priyanka
    2014 FOURTH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2014, : 428 - 432
  • [22] Topic discovery based on text mining techniques
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    Ruiz-Shulcloper, Jose
    INFORMATION PROCESSING & MANAGEMENT, 2007, 43 (03) : 752 - 768
  • [23] Knowledge discovery out of text data: a systematic review via text mining
    Usai, Antonio
    Pironti, Marco
    Mital, Monika
    Mejri, Chiraz Aouina
    JOURNAL OF KNOWLEDGE MANAGEMENT, 2018, 22 (07) : 1471 - 1488
  • [24] Text Dimensionality Reduction with Mutual Information Preserving Mapping
    YANG Zhen
    YAO Fei
    FAN Kefeng
    HUANG Jian
    Chinese Journal of Electronics, 2017, 26 (05) : 919 - 925
  • [25] SDRS: A new lossless dimensionality reduction for text corpora
    Velez de Mendizabal, Inaki
    Basto-Fernandes, Vitor
    Ezpeleta, Enaitz
    Mendez, Jose R.
    Zurutuza, Urko
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
  • [26] Dimensionality Reduction Approach for High Dimensional Text Documents
    Reddy, G. Suresh
    2016 INTERNATIONAL CONFERENCE ON ENGINEERING & MIS (ICEMIS), 2016,
  • [27] A Comparative Approach of Dimensionality Reduction Techniques in Text Classification
    Basha, Shaik Rahamat
    Rani, J. Keziya
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2019, 9 (06) : 4974 - 4979
  • [28] Text Dimensionality Reduction with Mutual Information Preserving Mapping
    Yang Zhen
    Yao Fei
    Fan Kefeng
    Huang Jian
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (05) : 919 - 925
  • [29] Dimensionality reduction in text classification using scatter method
    Saarikoski, Jyri
    Laurikkala, Jorma
    Jarvelin, Kalervo
    Siermala, Markku
    Juhola, Martti
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2014, 6 (01) : 1 - 21
  • [30] Text mining for plagiarism detection: Multivariate pattern detection for recognition of text similarities
    Xylogiannopoulos, Konstantinos
    Karampelas, Panagiotis
    Alhajj, Reda
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 938 - 945