Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue

被引:39
|
作者
Quanz, Brian [1 ]
Huan, Jun [1 ]
Mishra, Meenakshi [1 ]
机构
[1] Univ Kansas, Informat & Telecommun Technol Ctr, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA
基金
美国国家科学基金会;
关键词
Knowledge transfer; transfer learning; feature extraction; sparse coding; low-quality data; ADAPTATION;
D O I
10.1109/TKDE.2012.75
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effectively utilizing readily available auxiliary data to improve predictive performance on new modeling tasks is a key problem in data mining. In this research, the goal is to transfer knowledge between sources of data, particularly when ground-truth information for the new modeling task is scarce or is expensive to collect where leveraging any auxiliary sources of data becomes a necessity. Toward seamless knowledge transfer among tasks, effective representation of the data is a critical but yet not fully explored research area for the data engineer and data miner. Here, we present a technique based on the idea of sparse coding, which essentially attempts to find an embedding for the data by assigning feature values based on subspace cluster membership. We modify the idea of sparse coding by focusing the identification of shared clusters between data when source and target data may have different distributions. In our paper, we point out cases where a direct application of sparse coding will lead to a failure of knowledge transfer. We then present the details of our extension to sparse coding, by incorporating distribution distance estimates for the embedded data, and show that the proposed algorithm can overcome the shortcomings of the sparse coding algorithm on synthetic data and achieve improved predictive performance on a real world chemical toxicity transfer learning task.
引用
收藏
页码:1789 / 1802
页数:14
相关论文
共 50 条
  • [41] Online Detection of Low-Quality Synchrophasor Data Considering Frequency Similarity
    Ju, Wenyun
    Silva-Saravia, Horacio
    Nayak, Neeraj
    Yao, Wenxuan
    Zhang, Yichen
    Shi, Qingxin
    Ye, Fan
    IEEE TRANSACTIONS ON POWER DELIVERY, 2021, 36 (06) : 3988 - 3991
  • [42] An aerodynamic model identification method suitable for low-quality flight data
    Li, Jinsheng
    Zhuang, Ling
    Song, Jiahong
    Dong, Chao
    Guo, Ke
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 381 - 388
  • [43] Genetic Programming-Based Discriminative Feature Learning for Low-Quality Image Classification
    Bi, Ying
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (08) : 8272 - 8285
  • [44] Identifying low-quality patterns in accident reports from textual data
    Macedo, July B.
    Ramos, Plinio M. S.
    Maior, Caio B. S.
    Moura, Marcio J. C.
    Lins, Isis D.
    Vilela, Romulo F. T.
    INTERNATIONAL JOURNAL OF OCCUPATIONAL SAFETY AND ERGONOMICS, 2023, 29 (03) : 1088 - 1100
  • [45] Special Issue on Recent Approaches in Data Mining and Knowledge Extraction
    Solanki, Vijender Kumar
    Lloret Mauri, Jaime
    Semwal, Vijay Bhaskar
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (01) : V - VI
  • [46] Low-quality multivariate spatio-temporal serial data preprocessing
    Tao Yu
    Le Li
    Lajiao Chen
    Weijing Song
    Cluster Computing, 2019, 22 : 2357 - 2370
  • [47] Knowledge Transfer in Commercial Feature Extraction for the Retail Store Location Problem
    Ahedo, Virginia
    Santos, Jose Ignacio
    Galan, Jose Manuel
    IEEE ACCESS, 2021, 9 : 132967 - 132979
  • [48] Taming the Low-Quality Path for Reinforcement Learning-Based Knowledge Graph Reasoning
    Wang, Ruisheng
    Zhang, Weiwen
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 2197 - 2202
  • [49] Low-quality carbs: a mainstay for Americans
    Malochleb, Margaret
    FOOD TECHNOLOGY, 2019, 73 (11) : 10 - 12
  • [50] FINGERPRINT RECOGNITION IN LOW-QUALITY IMAGES
    COETZEE, L
    BOTHA, EC
    PATTERN RECOGNITION, 1993, 26 (10) : 1441 - 1460