Efficient Low-rank Backpropagation for Vision Transformer Adaptation

被引:0
|
作者
Yang, Yuedong [1 ]
Chiang, Hung-Yueh [1 ]
Li, Guihong [1 ]
Marculescu, Diana [1 ]
Marculescu, Radu [1 ]
机构
[1] Univ Texas Austin, Chandra Family Dept Elect & Comp Engn, Austin, TX 78712 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The increasing scale of vision transformers (ViT) has made the efficient fine-tuning of these large models for specific needs a significant challenge in various applications. This issue originates from the computationally demanding matrix multiplications required during the backpropagation process through linear layers in ViT. In this paper, we tackle this problem by proposing a new Low-rank Back-Propagation viaWalsh-Hadamard Transformation (LBP-WHT) method. Intuitively, LBP-WHT projects the gradient into a low-rank space and carries out backpropagation. This approach substantially reduces the computation needed for adapting ViT, as matrix multiplication in the low-rank space is far less resource-intensive. We conduct extensive experiments with different models (ViT, hybrid convolution-ViT model) on multiple datasets to demonstrate the effectiveness of our method. For instance, when adapting an EfficientFormer-L1 model on CIFAR100, our LBP-WHT achieves 10.4% higher accuracy than the state-of-the-art baseline, while requiring 9 MFLOPs less computation. As the first work to accelerate ViT adaptation with low-rank backpropagation, our LBP-WHT method is complementary to many prior efforts and can be combined with them for better performance. Code: https://github.com/SLDGroup/LBP- WHT
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Lightweight Low-Rank Adaptation Vision Transformer Framework for Cervical Cancer Detection and Cervix Type Classification
    Hong, Zhenchen
    Xiong, Jingwei
    Yang, Han
    Mo, Yu K.
    BIOENGINEERING-BASEL, 2024, 11 (05):
  • [2] DBA: Efficient Transformer With Dynamic Bilinear Low-Rank Attention
    Qin, Bosheng
    Li, Juncheng
    Tang, Siliang
    Zhuang, Yueting
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [3] TriAxial Low-Rank Transformer for Efficient Medical Image Segmentation
    Shang, Jiang
    Fang, Xi
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II, 2024, 14426 : 91 - 102
  • [4] Low-Rank Rescaled Vision Transformer Fine-Tuning: A Residual Design Approach
    Dong, Wei
    Zhang, Xing
    Chen, Bihui
    Yang, Dawei
    Lin, Zhijun
    Yang, Qingsen
    Wang, Peng
    Yang, Yang
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16101 - 16110
  • [5] Low-Rank Continual Pyramid Vision Transformer: Incrementally Segment Whole-Body Organs in CT with Light-Weighted Adaptation
    Zhu, Vince
    Ji, Zhanghexuan
    Guo, Dazhou
    Wang, Puyang
    Xia, Yingda
    Lu, Le
    Ye, Xianghua
    Zhu, Wei
    Jin, Dakai
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT VIII, 2024, 15008 : 371 - 381
  • [6] MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning
    Agiza, Ahmed
    Neseem, Marina
    Reda, Sherief
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16196 - 16205
  • [7] ViTALiTy: Unifying Low-rank and Sparse Approximation for Vision Transformer Acceleration with a Linear Taylor Attention
    Dass, Jyotikrishna
    Wu, Shang
    Shi, Huihong
    Li, Chaojian
    Ye, Zhifan
    Wang, Zhongfeng
    Lin, Yingyan
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 415 - 428
  • [8] LIGHTWEIGHT AND EFFICIENT END-TO-END SPEECH RECOGNITION USING LOW-RANK TRANSFORMER
    Winata, Genta Indra
    Cahyawijaya, Samuel
    Lin, Zhaojiang
    Liu, Zihan
    Fung, Pascale
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6144 - 6148
  • [9] Optimizing low-rank adaptation with decomposed matrices and adaptive rank allocation
    Zhang, Dacao
    Yang, Fan
    Zhang, Kun
    Li, Xin
    Wei, Si
    Hong, Richang
    Wang, Meng
    FRONTIERS OF COMPUTER SCIENCE, 2025, 19 (05)
  • [10] Low-Rank Correlation Learning for Unsupervised Domain Adaptation
    Lu, Yuwu
    Wong, Wai Keung
    Yuan, Chun
    Lai, Zhihui
    Li, Xuelong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4153 - 4167