Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue

被引:39
|
作者
Quanz, Brian [1 ]
Huan, Jun [1 ]
Mishra, Meenakshi [1 ]
机构
[1] Univ Kansas, Informat & Telecommun Technol Ctr, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
Knowledge transfer; transfer learning; feature extraction; sparse coding; low-quality data; ADAPTATION;
D O I
10.1109/TKDE.2012.75
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effectively utilizing readily available auxiliary data to improve predictive performance on new modeling tasks is a key problem in data mining. In this research, the goal is to transfer knowledge between sources of data, particularly when ground-truth information for the new modeling task is scarce or is expensive to collect where leveraging any auxiliary sources of data becomes a necessity. Toward seamless knowledge transfer among tasks, effective representation of the data is a critical but yet not fully explored research area for the data engineer and data miner. Here, we present a technique based on the idea of sparse coding, which essentially attempts to find an embedding for the data by assigning feature values based on subspace cluster membership. We modify the idea of sparse coding by focusing the identification of shared clusters between data when source and target data may have different distributions. In our paper, we point out cases where a direct application of sparse coding will lead to a failure of knowledge transfer. We then present the details of our extension to sparse coding, by incorporating distribution distance estimates for the embedded data, and show that the proposed algorithm can overcome the shortcomings of the sparse coding algorithm on synthetic data and achieve improved predictive performance on a real world chemical toxicity transfer learning task.
引用
收藏
页码:1789 / 1802
页数:14
相关论文
共 50 条
  • [32] Texture-Guided Transfer Learning for Low-Quality Face Recognition
    Zhang, Meng
    Liu, Rujie
    Deguchi, Daisuke
    Murase, Hiroshi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 95 - 107
  • [33] Enhancing action recognition from low-quality skeleton data via part-level knowledge distillation
    Liu, Cuiwei
    Jiang, Youzhi
    Du, Chong
    Li, Zhaokui
    SIGNAL PROCESSING, 2024, 221
  • [34] A federated transfer learning method with low-quality knowledge filtering and dynamic model aggregation for rolling bearing fault diagnosis
    Wang, Ran
    Yan, Fucheng
    Yu, Liang
    Shen, Changqing
    Hu, Xiong
    Chen, Jin
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 198
  • [35] Performance in low-quality jobs
    Bayona, Jaime Andres
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 406 - 406
  • [36] UTILIZATION OF LOW-QUALITY TIMBER
    KING, KFS
    COMMONWEALTH FORESTRY REVIEW, 1977, 56 (03): : 223 - 234
  • [37] Towards Automatically Refining Low-Quality Domain Knowledge: A Case Study in Healthcare
    Bielski, Pawel
    Jendral, Soenke
    Witterauf, Lena
    Bach, Jakob
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2023, PT III, 2025, 2135 : 361 - 367
  • [38] BEWARE OF LOW-QUALITY EARNINGS
    HERSHMAN, A
    DUNS BUSINESS MONTH, 1982, 120 (01): : 87 - 88
  • [39] Query by low-quality image
    Fauzi, Mohammad Faizal Ahmad
    Lewis, Paul H.
    IMAGE AND VISION COMPUTING, 2009, 27 (06) : 713 - 724
  • [40] Low-quality multivariate spatio-temporal serial data preprocessing
    Yu, Tao
    Li, Le
    Chen, Lajiao
    Song, Weijing
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 2357 - 2370