Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

被引:0
|
作者
Stoller, Daniel [1 ]
Tian, Mi [2 ]
Ewert, Sebastian [2 ]
Dixon, Simon [1 ]
机构
[1] Queen Mary Univ London, London, England
[2] Spotify, Stockholm, Sweden
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) with dilated filters such as the Wavenet or the Temporal Convolutional Network (TCN) have shown good results in a variety of sequence modelling tasks. While their receptive field grows exponentially with the number of layers, computing the convolutions over very long sequences of features in each layer is time and memory-intensive, and prohibits the use of longer receptive fields in practice. To increase efficiency, we make use of the "slow feature" hypothesis stating that many features of interest are slowly varying over time. For this, we use a UNet architecture that computes features at multiple time-scales and adapt it to our auto-regressive scenario by making convolutions causal. We apply our model ("Seq-U-Net") to a variety of tasks including language and audio generation. In comparison to TCN and Wavenet, our network consistently saves memory and computation time, with speed-ups for training and inference of over 4x in the audio generation experiment in particular, while achieving a comparable performance on real-world tasks.
引用
收藏
页码:2893 / 2900
页数:8
相关论文
共 50 条
  • [21] Modifying U-Net for small dataset - a simplified U-Net version for Liver Parenchyma segmentation
    Prasad, Pravda Jith Ray
    Elle, Ole Jakob
    Lindseth, Frank
    Albregtsen, Fritz
    Kumar, Rahul Prasanna
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [22] GT U-Net: A U-Net Like Group Transformer Network for Tooth Root Segmentation
    Li, Yunxiang
    Wang, Shuai
    Wang, Jun
    Zeng, Guodong
    Liu, Wenjun
    Zhang, Qianni
    Jin, Qun
    Wang, Yaqi
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 386 - 395
  • [23] Underwater U-Net: Deep Learning with U-Net for Visual Underwater Moving Object detection
    Bajpai, Vatsalya
    Sharma, Akhilesh
    Subudhi, Badri Narayan
    Veerakumar, T.
    Jakhetiya, Vinit
    OCEANS 2021: SAN DIEGO - PORTO, 2021,
  • [24] E-Res U-Net: An improved U-Net model for segmentation of muscle images
    Zhou, Junsheng
    Lu, Yiwen
    Tao, Siyi
    Cheng, Xuan
    Huang, Chenxi
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
  • [25] E-Res U-Net: An improved U-Net model for segmentation of muscle images
    Zhou, Junsheng
    Lu, Yiwen
    Tao, Siyi
    Cheng, Xuan
    Huang, Chenxi
    Expert Systems with Applications, 2021, 185
  • [26] ONLINE SINGING VOICE SEPARATION USING A RECURRENT ONE-DIMENSIONAL U-NET TRAINED WITH DEEP FEATURE LOSSES
    Doire, Clement S. J.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3752 - 3756
  • [27] Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies
    Ghaznavi, Ali
    Saberioon, Mohammadmehdi
    Brom, Jakub
    Itzerott, Sibylle
    APPLIED COMPUTING AND GEOSCIENCES, 2024, 21
  • [28] MA-Res U-Net: Design of Soybean Navigation System with Improved U-Net Model
    Liu, Qianshuo
    Zhao, Jun
    PHYTON-INTERNATIONAL JOURNAL OF EXPERIMENTAL BOTANY, 2024, 93 (10) : 2663 - 2681
  • [29] Robust U-Net: Development of robust image enhancement model using modified U-Net architecture
    Bhavani, Murapaka Dhanalakshmi
    Murugan, Raman
    Goel, Tripti
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
  • [30] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
    Kumar T. Rajamani
    Priya Rani
    Hanna Siebert
    Rajkumar ElagiriRamalingam
    Mattias P. Heinrich
    Signal, Image and Video Processing, 2023, 17 : 981 - 989