Denoising in Representation Space via Data-Dependent Regularization for Better Representation

被引:0
|
作者
Chen, Muyi [1 ,2 ]
Wang, Daling [1 ]
Feng, Shi [1 ]
Zhang, Yifei [1 ]
机构
[1] Northeastern Univ, Sch Comp Sci & Engn, Shenyang 110169, Peoples R China
[2] Shenyang Ligong Univ, Sch Automat & Elect Engn, Shenyang 110159, Peoples R China
基金
中国国家自然科学基金;
关键词
deep neural network; representation space; fully connected layer; feature extractor;
D O I
10.3390/math11102327
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Despite the success of deep learning models, it remains challenging for the over-parameterized model to learn good representation under small-sample-size settings. In this paper, motivated by previous work on out-of-distribution (OoD) generalization, we study the representation learning problem from an OoD perspective to identify the fundamental factors affecting representation quality. We formulate a notion of "out-of-feature subspace (OoFS) noise" for the first time, and we link the OoFS noise in the feature extractor to the OoD performance of the model by proving two theorems that demonstrate that reducing OoFS noise in the feature extractor is beneficial in achieving better representation. Moreover, we identify two causes of OoFS noise and prove that the OoFS noise induced by random initialization can be filtered out via L-2 regularization. Finally, we propose a novel data-dependent regularizer that acts on the weights of the fully connected layer to reduce noise in the representations, thus implicitly forcing the feature extractor to focus on informative features and to rely less on noise via back-propagation. Experiments on synthetic datasets show that our method can learn hard-to-learn features; can filter out noise effectively; and outperforms GD, AdaGrad, and KFAC. Furthermore, experiments on the benchmark datasets show that our method achieves the best performance for three tasks among four.
引用
收藏
页数:33
相关论文
共 50 条
  • [31] Speech Reconstruction via Sparse Representation using Harmonic Regularization
    Tang, Yibin
    Chen, Ying
    Xu, Ning
    Zhu, Changping
    Zhou, Lin
    2015 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2015,
  • [32] Speech reconstruction via sparse representation using harmonic regularization
    College of IOT Engineerings, Hohai University, Changzhou, China
    不详
    Int. Conf. Wirel. Commun. Signal Process., WCSP, 2015,
  • [33] Interpretable Representation Learning of Cardiac MRI via Attribute Regularization
    Di Folco, Maxime
    Bercea, Cosmin I.
    Chan, Emily
    Schnabel, Julia A.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT X, 2024, 15010 : 492 - 501
  • [34] Data-Dependent Hashing via Nonlinear Spectral Gaps
    Andoni, Alexandr
    Naor, Assaf
    Nikolov, Aleksandar
    Razenshteyn, Ilya
    Waingarten, Erik
    STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, : 787 - 800
  • [35] Similarity Learning via Optimizing the Data-Dependent Kernel
    Xiong, Huilin
    Shi, Panfei
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 512 - 516
  • [36] Data representation learning via dictionary learning and self-representation
    Deyu Zeng
    Jing Sun
    Zongze Wu
    Chris Ding
    Zhigang Ren
    Applied Intelligence, 2023, 53 : 26988 - 27000
  • [37] Data representation learning via dictionary learning and self-representation
    Zeng, Deyu
    Su, Jing
    Wu, Zongze
    Ding, Chris
    Ren, Zhigang
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26988 - 27000
  • [38] Damped Dreamlet Representation for Exploration Seismic Data Interpolation and Denoising
    Huang, Weilin
    Wu, Ru-Shan
    Wang, Runqiu
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06): : 3159 - 3172
  • [39] fi-divergence NMF with biorthogonal regularization for data representation
    Yuan, Ruixue
    Leng, Chengcai
    Li, Bing
    Basu, Anup
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [40] Learning via compact data representation
    Davis, MW
    Foltz, PW
    PROCEEDINGS OF THE TWENTIETH ANNUAL CONFERENCE OF THE COGNITIVE SCIENCE SOCIETY, 1998, : 285 - 290