Transformer-Based Cascade U-shaped Network for Action Segmentation

被引:0
|
作者
Bao, Wenxia [1 ]
Lin, An [1 ]
Huang, Hua [2 ]
Yang, Xianjun [3 ]
Chen, Hemu [4 ]
机构
[1] Anhui Univ, Sch Elect & Informat Engn, Hefei, Peoples R China
[2] China Tobacco Zhejiang Ind Co Ltd, Hangzhou, Zhejiang, Peoples R China
[3] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei, Peoples R China
[4] Anhui Med Univ, Affiliated Hosp 1, Hefei, Peoples R China
关键词
Action Segmentation; Transformer; U-net;
D O I
10.1109/ICIPMC62364.2024.10586708
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Action segmentation requires predicting the action that occurs in each frame of the original video, and existing methods tend to focus on the global relationship of the sequence, ignoring the contextual information at different granularities. To address this problem, this paper proposes a Transformer-based cascaded U-network for action segmentation. The proposed method adopts a cascaded transformer structure, where the feature sequences between the encoder-decoder are connected in a U-shape, which fully combines the global context information as well as the local context information between neighboring frames. The extended temporal convolution as well as the local window attention mechanism are used to enhance the model's ability to perceive long-range action interactions. The proposed method outperforms the current mainstream action segmentation methods on two challenging datasets 50 salads and GTEA.
引用
收藏
页码:157 / 161
页数:5
相关论文
共 50 条
  • [1] Parotid Gland Segmentation Using Purely Transformer-Based U-Shaped Network and Multimodal MRI
    Xu, Zi'an
    Dai, Yin
    Liu, Fayu
    Li, Siqi
    Liu, Sheng
    Shi, Lifu
    Fu, Jun
    ANNALS OF BIOMEDICAL ENGINEERING, 2024, 52 (08) : 2101 - 2117
  • [2] Research on pancreas segmentation method based on cascade U-shaped network
    Yan, Songcai
    Hu, Xinjun
    Hu, Shanshan
    Tian, Jianping
    Xue, Qinyuan
    He, Lin
    Peng, Jianheng
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 471 - 477
  • [3] RockFormer: A U-Shaped Transformer Network for Martian Rock Segmentation
    Liu, Haiqiang
    Yao, Meibao
    Xiao, Xueming
    Xiong, Yonggang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] A Swin Transformer-Based Encoding Booster Integrated in U-Shaped Network for Building Extraction
    Xiao, Xiao
    Guo, Wenliang
    Chen, Rui
    Hui, Yilong
    Wang, Jianing
    Zhao, Hongyu
    REMOTE SENSING, 2022, 14 (11)
  • [5] Collaborative transformer U-shaped network for medical image segmentation
    Gao, Yufei
    Zhang, Shichao
    Shi, Lei
    Zhao, Guohua
    Shi, Yucheng
    APPLIED SOFT COMPUTING, 2025, 173
  • [6] Optimization of U-shaped pure transformer medical image segmentation network
    Dan, Yongping
    Jin, Weishou
    Wang, Zhida
    Sun, Changhao
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [7] U-shaped network based on Transformer for 3D point clouds semantic segmentation
    Zhang, Jiazhe
    Li, Xingwei
    Zhao, Xianfa
    Ge, Yizhi
    Zhang, Zheng
    2021 THE 5TH INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, ICVIP 2021, 2021, : 170 - 176
  • [8] FTUNet: A Feature-Enhanced Network for Medical Image Segmentation Based on the Combination of U-Shaped Network and Vision Transformer
    Wang, Yuefei
    Yu, Xi
    Yang, Yixi
    Zeng, Shijie
    Xu, Yuquan
    Feng, Ronghui
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [9] FTUNet: A Feature-Enhanced Network for Medical Image Segmentation Based on the Combination of U-Shaped Network and Vision Transformer
    Yuefei Wang
    Xi Yu
    Yixi Yang
    Shijie Zeng
    Yuquan Xu
    Ronghui Feng
    Neural Processing Letters, 56
  • [10] A Transformer-based Cascade Network with Boundary Enhancement Loss for Retinal Vessel Segmentation
    Cai, Binke
    Ma, Liyan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4292 - 4298