Seq-U-Net: A One-Dimensional Causal U-Net for Efficient Sequence Modelling

被引：0

作者：

Stoller, Daniel ^{[1
]}

Tian, Mi ^{[2
]}

Ewert, Sebastian ^{[2
]}

Dixon, Simon ^{[1
]}

机构：

[1] Queen Mary Univ London, London, England

[2] Spotify, Stockholm, Sweden

来源：

PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks (CNNs) with dilated filters such as the Wavenet or the Temporal Convolutional Network (TCN) have shown good results in a variety of sequence modelling tasks. While their receptive field grows exponentially with the number of layers, computing the convolutions over very long sequences of features in each layer is time and memory-intensive, and prohibits the use of longer receptive fields in practice. To increase efficiency, we make use of the "slow feature" hypothesis stating that many features of interest are slowly varying over time. For this, we use a UNet architecture that computes features at multiple time-scales and adapt it to our auto-regressive scenario by making convolutions causal. We apply our model ("Seq-U-Net") to a variety of tasks including language and audio generation. In comparison to TCN and Wavenet, our network consistently saves memory and computation time, with speed-ups for training and inference of over 4x in the audio generation experiment in particular, while achieving a comparable performance on real-world tasks.

引用

页码：2893 / 2900

页数：8

共 50 条

[21] Modifying U-Net for small dataset - a simplified U-Net version for Liver Parenchyma segmentation
Prasad, Pravda Jith Ray
Elle, Ole Jakob
Lindseth, Frank
Albregtsen, Fritz
Kumar, Rahul Prasanna
MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
[22] GT U-Net: A U-Net Like Group Transformer Network for Tooth Root Segmentation
Li, Yunxiang
Wang, Shuai
Wang, Jun
Zeng, Guodong
Liu, Wenjun
Zhang, Qianni
Jin, Qun
Wang, Yaqi
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 : 386 - 395
[23] Underwater U-Net: Deep Learning with U-Net for Visual Underwater Moving Object detection
Bajpai, Vatsalya
Sharma, Akhilesh
Subudhi, Badri Narayan
Veerakumar, T.
Jakhetiya, Vinit
OCEANS 2021: SAN DIEGO - PORTO, 2021,
[24] E-Res U-Net: An improved U-Net model for segmentation of muscle images
Zhou, Junsheng
Lu, Yiwen
Tao, Siyi
Cheng, Xuan
Huang, Chenxi
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 185
[25] E-Res U-Net: An improved U-Net model for segmentation of muscle images
Zhou, Junsheng
Lu, Yiwen
Tao, Siyi
Cheng, Xuan
Huang, Chenxi
Expert Systems with Applications, 2021, 185
[26] ONLINE SINGING VOICE SEPARATION USING A RECURRENT ONE-DIMENSIONAL U-NET TRAINED WITH DEEP FEATURE LOSSES
Doire, Clement S. J.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3752 - 3756
[27] Comparative performance analysis of simple U-Net, residual attention U-Net, and VGG16-U-Net for inventory inland water bodies
Ghaznavi, Ali
Saberioon, Mohammadmehdi
Brom, Jakub
Itzerott, Sibylle
APPLIED COMPUTING AND GEOSCIENCES, 2024, 21
[28] MA-Res U-Net: Design of Soybean Navigation System with Improved U-Net Model
Liu, Qianshuo
Zhao, Jun
PHYTON-INTERNATIONAL JOURNAL OF EXPERIMENTAL BOTANY, 2024, 93 (10) : 2663 - 2681
[29] Robust U-Net: Development of robust image enhancement model using modified U-Net architecture
Bhavani, Murapaka Dhanalakshmi
Murugan, Raman
Goel, Tripti
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (28):
[30] Attention-augmented U-Net (AA-U-Net) for semantic segmentation
Kumar T. Rajamani
Priya Rani
Hanna Siebert
Rajkumar ElagiriRamalingam
Mattias P. Heinrich
Signal, Image and Video Processing, 2023, 17 : 981 - 989

← 1 2 3 4 5 →