Substructural Regularization With Data-Sensitive Granularity for Sequence Transfer Learning

被引:10
|
作者
Sun, Shichang [1 ,2 ,3 ]
Liu, Hongbo [4 ]
Meng, Jiana [1 ,2 ,3 ]
Chen, C. L. Philip [5 ,6 ]
Yang, Yu [4 ]
机构
[1] Dalian Univ Technol, Sch Comp, Dalian 116024, Peoples R China
[2] Dalian Minzu Univ, Sch Comp, Dalian 116600, Peoples R China
[3] Dalian Maritime Univ, Inst Cognit Intelligence Technol, Dalian 116026, Peoples R China
[4] Dalian Maritime Univ, Inst Cognit Intelligence Technol, Sch Informat Sci & Technol, Dalian 116026, Peoples R China
[5] Univ Macau, UMacau Res Inst, Fac Sci & Technol, Zhuhai 99999, Peoples R China
[6] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian 116026, Peoples R China
基金
中国国家自然科学基金;
关键词
Data-sensitive granularity; hidden Markov model (HMM); relative entropy (RE); sequence transfer learning; substructural regularization; A-POSTERIORI ADAPTATION; HMM PARAMETERS; MARKOV; CATEGORIZATION; DISTANCE; MODEL;
D O I
10.1109/TNNLS.2016.2638321
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sequence transfer learning is of interest in both academia and industry with the emergence of numerous new text domains from Twitter and other social media tools. In this paper, we put forward the data-sensitive granularity for transfer learning, and then, a novel substructural regularization transfer learning model (STLM) is proposed to preserve target domain features at substructural granularity in the light of the condition of labeled data set size. Our model is underpinned by hidden Markov model and regularization theory, where the substructural representation can be integrated as a penalty after measuring the dissimilarity of substructures between target domain and STLM with relative entropy. STLM can achieve the competing goals of preserving the target domain substructure and utilizing the observations from both the target and source domains simultaneously. The estimation of STLM is very efficient since an analytical solution can be derived as a necessary and sufficient condition. The relative usability of substructures to act as regularization parameters and the time complexity of STLM are also analyzed and discussed. Comprehensive experiments of part-of-speech tagging with both Brown and Twitter corpora fully justify that our model can make improvements on all the combinations of source and target domains.
引用
收藏
页码:2545 / 2557
页数:13
相关论文
共 50 条
  • [1] Data-sensitive visual navigation
    Mindek, Peter
    Mistelbauer, Gabriel
    Groeller, Eduard
    Bruckner, Stefan
    COMPUTERS & GRAPHICS-UK, 2017, 67 : 77 - 85
  • [2] Regularization of Sequence Data for Machine Learning
    Bai, Bryan
    Kremer, Stefan C.
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS, 2011, : 19 - 25
  • [3] Optimal Nonlinear Pricing with Data-Sensitive Consumers
    Kraehmer, Daniel
    Strausz, Roland
    AMERICAN ECONOMIC JOURNAL-MICROECONOMICS, 2023, 15 (02) : 80 - 108
  • [4] A comprehensive review on Federated Learning for Data-Sensitive Application: Open issues & challenges
    Narula, Manu
    Meena, Jasraj
    Vishwakarma, Dinesh Kumar
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [5] Federated workload-aware quantized framework for secure learning in data-sensitive applications
    Narula, Manu
    Meena, Jasraj
    Vishwakarma, Dinesh Kumar
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 168
  • [6] Hierarchical Granularity Transfer Learning
    Min, Shaobo
    Xie, Hongtao
    Yao, Hantao
    Deng, Xuran
    Zha, Zheng-Jun
    Zhang, Yongdong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [7] Characterization of data-sensitive wireless distributed networked-control-systems
    Gupta, Rachana Ashok
    Agarwal, Avesh Kumar
    Chow, Mo-Yuen
    Wang, Wenye
    2007 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2007, : 983 - 988
  • [8] Online fraud detection: 'In the moment' digital accountability in a data-sensitive setting
    Yekta, Semire
    Neyland, Daniel
    BIG DATA & SOCIETY, 2024, 11 (03):
  • [9] Edge AI and Blockchain for Privacy-Critical and Data-Sensitive Applications
    Nawaz, A.
    Gia, T. N.
    Queralta, J. Pena
    Westerlund, T.
    2019 TWELFTH INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND UBIQUITOUS NETWORK (ICMU), 2019,
  • [10] High-dimensional similarity search using data-sensitive space partitioning
    Kulkarni, Sachin
    Orlandic, Ratko
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, 4080 : 738 - 750