Robust Representation Learning via Sparse Attention Mechanism for Similarity Models

被引:0
|
作者
Ermilova, Alina [1 ]
Baramiia, Nikita [1 ]
Kornilov, Valerii [1 ]
Petrakov, Sergey [1 ]
Zaytsev, Alexey [1 ,2 ]
机构
[1] Skolkovo Inst Sci & Technol, Moscow 121205, Russia
[2] Sber, Risk Management, Moscow 121165, Russia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Transformers; Oil insulation; Task analysis; Time series analysis; Meteorology; Training; Deep learning; Representation learning; efficient transformer; robust transformer; representation learning; similarity learning; TRANSFORMER;
D O I
10.1109/ACCESS.2024.3418779
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The attention-based models are widely used for time series data. However, due to the quadratic complexity of attention regarding input sequence length, the application of Transformers is limited by high resource demands. Moreover, their modifications for industrial time series need to be robust to missing or noisy values, which complicates the expansion of their application horizon. To cope with these issues, we introduce the class of efficient Transformers named Regularized Transformers (Reguformers). We implement the regularization technique inspired by the dropout ideas to improve robustness and reduce computational expenses without significantly modifying the pipeline. The focus in our experiments is on oil&gas data. For well-interval similarity task, our best Reguformer configuration reaches ROC AUC 0.97, which is comparable to Informer (0.978) and outperforms baselines: the previous LSTM model (0.934), the classical Transformer model (0.967), and three recent most promising modifications of the original Transformer, namely, Performer (0.949), LRformer (0.955), and DropDim (0.777). We also conduct the corresponding experiments on three additional datasets from different domains and obtain superior results. The increase in the quality of the best Reguformer relative to Transformer for different datasets varies from 3.7% to 9.6%, while the increase range relative to Informer is wider: from 1.7% to 18.4%.
引用
收藏
页码:97833 / 97850
页数:18
相关论文
共 50 条
  • [21] Demo: Robust Face Recognition via Sparse Representation
    Wright, John
    Ganesh, Arvind
    Zhou, Zihan
    Wagner, Andrew
    Ma, Yi
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 942 - 943
  • [22] Exploring attention mechanism for graph similarity learning
    Tan, Wenhui
    Gao, Xin
    Li, Yiyang
    Wen, Guangqi
    Cao, Peng
    Yang, Jinzhu
    Li, Weiping
    Zaiane, Osmar R.
    KNOWLEDGE-BASED SYSTEMS, 2023, 276
  • [23] Attributed network representation learning via improved graph attention with robust negative sampling
    Huilian Fan
    Yuanchang Zhong
    Guangpu Zeng
    Lili Sun
    Applied Intelligence, 2021, 51 : 416 - 426
  • [24] Attributed network representation learning via improved graph attention with robust negative sampling
    Fan, Huilian
    Zhong, Yuanchang
    Zeng, Guangpu
    Sun, Lili
    APPLIED INTELLIGENCE, 2021, 51 (01) : 416 - 426
  • [25] Robust multi-view learning via M-estimator joint sparse representation
    Hu, Yutao
    Wang, Yulong
    Li, Han
    Chen, Hong
    PATTERN RECOGNITION, 2024, 151
  • [26] Hierarchical Deep Multitask Learning With the Attention Mechanism for Similarity Learning
    Huang, Yan
    Wang, Qicong
    Yang, Wenming
    Liao, Qingmin
    Meng, Hongying
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (04) : 1729 - 1742
  • [27] SIMILARITY-BASED IMAGE CLASSIFICATION VIA KERNELIZED SPARSE REPRESENTATION
    Zeng, Zhi
    Li, Heping
    Liang, Wei
    Zhang, Shuwu
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 277 - 280
  • [28] Robust Visual Tracking and Vehicle Classification via Sparse Representation
    Mei, Xue
    Ling, Haibin
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (11) : 2259 - 2272
  • [29] Robust Face Recognition Via Gabor Feature and Sparse Representation
    Hao, Yu-Juan
    Zhang, Li-Quan
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2016), 2016, 7
  • [30] Robust Visual Tracking via Appearance Modeling and Sparse Representation
    Li, Ming
    Ma, Fanglan
    Nian, Fuzhong
    JOURNAL OF COMPUTERS, 2014, 9 (07) : 1612 - 1619