Text feature extraction based on deep learning: a review

被引:163
|
作者
Liang, Hong [1 ]
Sun, Xiao [1 ]
Sun, Yunlei [1 ]
Gao, Yuan [1 ]
机构
[1] China Univ Petr East China, Coll Comp & Commun Engn, 66 Changjiang West Rd, Qingdao 266580, Peoples R China
关键词
Deep learning; Feature extraction; Text characteristic; Natural language processing; Text mining; FEATURE-SELECTION; DIMENSION REDUCTION; NEURAL-NETWORK; CLASSIFICATION; RECOGNITION;
D O I
10.1186/s13638-017-0993-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Selection of text feature item is a basic and important matter for text mining and information retrieval. Traditional methods of feature extraction require handcrafted features. To hand-design, an effective feature is a lengthy process, but aiming at new applications, deep learning enables to acquire new effective feature representation from training data. As a new feature extraction method, deep learning has made achievements in text mining. The major difference between deep learning and conventional methods is that deep learning automatically learns features from big data, instead of adopting handcrafted features, which mainly depends on priori knowledge of designers and is highly impossible to take the advantage of big data. Deep learning can automatically learn feature representation from big data, including millions of parameters. This thesis outlines the common methods used in text feature extraction first, and then expands frequently used deep learning methods in text feature extraction and its applications, and forecasts the application of deep learning in feature extraction.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Palmprint Phenotype Feature Extraction and Classification Based on Deep Learning
    Yu Fan
    Jinxi Li
    Shaoying Song
    Haiguo Zhang
    Sijia Wang
    Guangtao Zhai
    Phenomics, 2022, 2 : 219 - 229
  • [22] Aircarft Signal Feature Extraction and Recognition Based on Deep Learning
    Wang, Guanhua
    Zou, Cong
    Zhang, Chao
    Pan, Changyong
    Song, Jian
    Yang, Fang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (09) : 9625 - 9634
  • [23] Association Rules Based Feature Extraction for Deep Learning Classification
    Kharsa, Ruba
    Al Aghbari, Zaher
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 72 - 83
  • [24] Signal Feature Extraction of Music Melody Based on Deep Learning
    Jiang, Jinwen
    TRAITEMENT DU SIGNAL, 2022, 39 (06) : 2203 - 2209
  • [25] Deep Web Data Source Classification Based on Text Feature Extension and Extraction
    Li, Yuancheng
    Wu, Guixian
    Wang, Xiaohan
    INFOCOMMUNICATIONS JOURNAL, 2019, 11 (03): : 42 - 49
  • [26] An Entity Recognition Model Based on Deep Learning Fusion of Text Feature
    Shang, Fengjun
    Ran, Chunfu
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (02)
  • [27] Deep successor feature learning for text generation
    Xu, Cong
    Li, Qing
    Zhang, Dezheng
    Xie, Yonghong
    Li, Xisheng
    NEUROCOMPUTING, 2020, 396 : 495 - 500
  • [28] Feature Extraction Based on Deep Learning for Some Traditional Machine Learning Methods
    Cayir, Aykut
    Yenidogan, Isil
    Dag, Hasan
    2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2018, : 494 - 497
  • [29] Accurate Context Extraction from Unstructured Text Based on Deep Learning
    Mack, Maha
    Guetari, Ramazi
    Fournier, Sebastian
    Chaari, Wided Lejouad
    Espinasse, Bernard
    2022 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WI-IAT, 2022, : 309 - 314
  • [30] Petroleum Exploration and Development Text Triplet Extraction Based on Deep Learning
    Li, Jianli
    Yilahun, Hankiz
    Hamdulla, Askar
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 225 - 230