Automatic transfer learning for short text mining

被引:16
|
作者
Yang, Lei [1 ]
Zhang, Jianpei [1 ]
机构
[1] Harbin Engn Univ, Coll Comp Sci & Technol, Nantong St 145, Harbin 150001, Peoples R China
关键词
Transfer learning; Short text mining; Latent semantic analysis;
D O I
10.1186/s13638-017-0815-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a new emerging technique, transfer learning enjoys the advantage of integrating the well-learnt knowledge from another related work to facilitate an improved learning result of one task. Most of the existing transfer learning methods are designed for long texts and short texts. However, the latter one distinguishes from the former one in terms of its sparse nature, noise words, syntactical structure, and colloquial terminologies used. A transfer learning algorithm called automatic transfer learning (AutoTL) is proposed for short text mining. By transferring knowledge automatically learnt from theonline information, the proposed method enables training data to be selected automatically. Furthermore, it does not make any a priori assumption about probability distribution. Our experimental results on 20Newsgroups, Simulated Real Auto Aviation, and Reuter-21578 validate the higher performance of the proposed AutoTL over several state-of-of-the-art methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Automatic extraction of microorganisms and their habitats from free text using text mining workflows
    Kolluru, BalaKrishna
    Nakjang, Sirintra
    Hirt, Robert P.
    Wipat, Anil
    Ananiadou, Sophia
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2011, 8 (02):
  • [32] The case study approach to learning Text Mining
    Baiburin, Yerzhan
    Zhantassova, Zheniskul
    Nugumanova, Aliya
    Syzdykpayeva, Aigul
    Bessmertny, Igor
    2016 IEEE 10TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2016, : 3 - 5
  • [33] Machine Learning Techniques Used for Text Mining
    Godoy Viera, Angel Freddy
    INVESTIGACION BIBLIOTECOLOGICA, 2017, 31 (71): : 103 - 126
  • [34] Learning Tone and Attribution for Financial Text Mining
    El-Haj, Mahmoud
    Rayson, Paul
    Young, Steven
    Moore, Andrew
    Walker, Martin
    Schleicher, Thomas
    Athanasakou, Vasiliki
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1820 - 1825
  • [35] Machine learning and text mining of trophic links
    Milani, Ghazal Afroozi
    Bohan, David
    Dunbar, Stuart
    Muggleton, Stephen
    Raybould, Alan
    Tamaddoni-Nezhad, Alireza
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2, 2012, : 410 - 415
  • [36] Learning Scientific Concepts with Text Mining Support
    Reategui, Eliseo
    Costa, Ana Paula M.
    Epstein, Daniel
    Carniato, Michel
    METHODOLOGIES AND INTELLIGENT SYSTEMS FOR TECHNOLOGY ENHANCED LEARNING, 2019, 804 : 97 - 105
  • [37] Active Learning for Text Mining from Crowds
    Shao, Hao
    ADVANCES IN ARTIFICIAL INTELLIGENCE: FROM THEORY TO PRACTICE (IEA/AIE 2017), PT II, 2017, 10351 : 409 - 418
  • [38] Automatic Surveillance of Pandemics Using Big Data and Text Mining
    Alharbi, Abdullah
    Alosaimi, Wael
    Uddin, M. Irfan
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 303 - 317
  • [39] Automatic Rule Definition for Pattern-Based Text Mining
    Kuriu, Minoki
    Mendonca, Israel
    Aritsugi, Masayoshi
    2023 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, BIGCOMP, 2023, : 187 - 194
  • [40] Automatic classification of academic documents using text mining techniques
    Nunez, Haydemar
    Ramos, Esmeralda
    2012 XXXVIII CONFERENCIA LATINOAMERICANA EN INFORMATICA (CLEI), 2012,