Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

被引:0
|
作者
Stoller, Daniel [1 ]
Tian, Mi [2 ]
Ewert, Sebastian [2 ]
Dixon, Simon [1 ]
机构
[1] Queen Mary Univ London, London, England
[2] Spotify, Stockholm, Sweden
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) with dilated filters such as the Wavenet or the Temporal Convolutional Network (TCN) have shown good results in a variety of sequence modelling tasks. While their receptive field grows exponentially with the number of layers, computing the convolutions over very long sequences of features in each layer is time and memory-intensive, and prohibits the use of longer receptive fields in practice. To increase efficiency, we make use of the "slow feature" hypothesis stating that many features of interest are slowly varying over time. For this, we use a UNet architecture that computes features at multiple time-scales and adapt it to our auto-regressive scenario by making convolutions causal. We apply our model ("Seq-U-Net") to a variety of tasks including language and audio generation. In comparison to TCN and Wavenet, our network consistently saves memory and computation time, with speed-ups for training and inference of over 4x in the audio generation experiment in particular, while achieving a comparable performance on real-world tasks.
引用
收藏
页码:2893 / 2900
页数:8
相关论文
共 50 条
  • [1] UIU-Net: U-Net in U-Net for Infrared Small Object Detection
    Wu, Xin
    Hong, Danfeng
    Chanussot, Jocelyn
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 364 - 376
  • [2] Slim U-Net: Efficient Anatomical Feature Preserving U-net Architecture for Ultrasound Image Segmentation
    Raina, Deepak
    Verma, Kashish
    Chandrashekhara, Sheragaru Hanumanthappa
    Saha, Subir Kumar
    2022 9TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, ICBBE 2022, 2022, : 41 - 48
  • [3] Chimeric U-Net - Modifying the standard U-Net towards explainability
    Schulze, Kenrick
    Peppert, Felix
    Schuette, Christof
    Sunkara, Vikram
    ARTIFICIAL INTELLIGENCE, 2025, 338
  • [4] U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?
    Jia, Xi
    Bartlett, Joseph
    Zhang, Tianyang
    Lu, Wenqi
    Qiu, Zhaowen
    Duan, Jinming
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 151 - 160
  • [5] Wavelet U-Net: Incorporating Wavelet Transform Into U-Net for Liver Segmentation
    Chang, J.
    Chang, C.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [6] Breast tumor segmentation in ultrasound images: comparing U-net and U-net + +
    de Oliveira, Carlos Eduardo Gonçalves
    Vieira, Sílvio Leão
    Paranaiba, Caio Felipe Brito
    Itikawa, Emerson Nobuyuki
    Research on Biomedical Engineering, 2025, 41 (01)
  • [7] Auto-Segmentation On Liver With U-Net And Pixel Deconvolutional U-Net
    Yao, H.
    Chang, J.
    MEDICAL PHYSICS, 2020, 47 (06) : E584 - E584
  • [8] Chaining a U-Net With a Residual U-Net for Retinal Blood Vessels Segmentation
    Alfonso Francia, Gendry
    Pedraza, Carlos
    Aceves, Marco
    Tovar-Arriaga, Saul
    IEEE ACCESS, 2020, 8 : 38493 - 38500
  • [9] An Improved U-Net Method for Sequence Images Segmentation
    Wen, Peizhi
    Sun, Menglong
    Lei, Yongqing
    2019 ELEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI 2019), 2019, : 184 - 189
  • [10] Sharp dense U-Net: an enhanced dense U-Net architecture for nucleus segmentation
    Senapati, Pradip
    Basu, Anusua
    Deb, Mainak
    Dhal, Krishna Gopal
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (06) : 2079 - 2094