Transductive transfer learning based Genetic Programming for balanced and unbalanced document classification using different types of features

被引:4
|
作者
Fu, Wenlong [1 ]
Xue, Bing [1 ]
Gao, Xiaoying [1 ]
Zhang, Mengjie [1 ]
机构
[1] Sch Engn & Comp Sci, POB 600, Wellington 6140, New Zealand
关键词
Genetic Programming; Document classification; Transfer learning; TEXT CLASSIFICATION; REPRESENTATIONS; WORDS; IDF;
D O I
10.1016/j.asoc.2021.107172
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document classification is one of the predominant tasks in Natural Language Processing. However, some document classification tasks do not have ground truth while other similar datasets may have ground truth. Transfer learning can utilize similar datasets with ground truth to train effective classifiers on the dataset without ground truth. This paper introduces a transductive transfer learning method for document classification using two different text feature representations?the term frequency (TF) and the semantic feature doc2vec. It has three main contributions. First, it enables the sharing knowledge in a dataset using TF and a dataset using doc2vec in transductive transfer learning for performance improvement. Second, it demonstrates that the partially learned programs from TFs and from doc2vecs can be alternatively used to ?label then learn?and they improve each other. Lastly, it addresses the unbalanced dataset problem by considering the unbalanced distributions on categories for evolving proper Genetic Programming (GP) programs on the target domains. Our experimental results on two popular document datasets show that the proposed technique effectively transfers knowledge from the GP programs evolved from the source domains to the new GP programs on the target domains using TF or doc2vec. There are obviously more than 10 percentages improvement achieved by the GP programs evolved by the proposed method over the GP programs directly evolved from the source domains. Also, the proposed technique effectively utilizes GP programs evolved from unbalanced datasets (on the source and target domains) to evolve new GP programs on the target domains, which balances predictions on different categories. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Genetic Programming for Document Classification: A Transductive Transfer Learning System
    Fu, Wenlong
    Xue, Bing
    Gao, Xiaoying
    Zhang, Mengjie
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 1119 - 1132
  • [2] Output-based transfer learning in genetic programming for document classification
    Fu, Wenlong
    Xue, Bing
    Gao, Xiaoying
    Zhang, Mengjie
    KNOWLEDGE-BASED SYSTEMS, 2021, 212
  • [3] Genetic Programming based Transfer Learning for Document Classification with Self-taught and Ensemble Learning
    Fu, Wenlong
    Xue, Bing
    Gao, Xiaoying
    Zhang, Mengjie
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 2260 - 2267
  • [4] On the Transfer Learning of Genetic Programming Classification Algorithms
    Nyathi, Thambo
    Pillay, Nelishia
    THEORY AND PRACTICE OF NATURAL COMPUTING (TPNC 2021), 2021, 13082 : 47 - 58
  • [5] Intelligent Classification of Different Types of Plastics using Deep Transfer Learning
    Chazhoor, Anthony Ashwin Peter
    Zhu, Manli
    Ho, Edmond S. L.
    Gao, Bin
    Woo, Wai Lok
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, COMPUTER VISION AND INTELLIGENT SYSTEMS (ROBOVIS), 2021, : 190 - 195
  • [6] Genetic programming with transfer learning for texture image classification
    Muhammad Iqbal
    Harith Al-Sahaf
    Bing Xue
    Mengjie Zhang
    Soft Computing, 2019, 23 : 12859 - 12871
  • [7] Genetic programming with transfer learning for texture image classification
    Iqbal, Muhammad
    Al-Sahaf, Harith
    Xue, Bing
    Zhang, Mengjie
    SOFT COMPUTING, 2019, 23 (23) : 12859 - 12871
  • [8] Evolving Diverse Ensembles Using Genetic Programming for Classification With Unbalanced Data
    Bhowan, Urvesh
    Johnston, Mark
    Zhang, Mengjie
    Yao, Xin
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2013, 17 (03) : 368 - 386
  • [9] Automatic selection of features for classification using genetic programming
    Sherrah, J
    Bogner, RE
    Bouzerdoum, A
    ANZIIS 96 - 1996 AUSTRALIAN NEW ZEALAND CONFERENCE ON INTELLIGENT INFORMATION SYSTEMS, PROCEEDINGS, 1996, : 284 - 287
  • [10] Ensemble Learning and Pruning in Multi-Objective Genetic Programming for Classification with Unbalanced Data
    Bhowan, Urvesh
    Johnston, Mark
    Zhang, Mengjie
    AI 2011: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2011, 7106 : 192 - 202