END-TO-END SOURCE SEPARATION WITH ADAPTIVE FRONT-ENDS

被引：0

作者：

Venkataramani, Shrikant ^{[1
]}

Casebeer, Jonah ^{[1
]}

Smaragdis, Paris ^{[2
]}

机构：

[1] Univ Illinois, Champaign, IL 61820 USA

[2] Univ Illinois, Adobe Res, Champaign, IL USA

来源：

2018 CONFERENCE RECORD OF 52ND ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS | 2018年

关键词：

Auto-encoders; adaptive transforms; source separation; deep learning;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We develop an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for supervised source separation. In terms of separation performance, these transforms significantly outperform their Fourier counterparts. Finally, we also propose and interpret a novel source to distortion ratio based cost function for end-to-end source separation.

引用

页码：684 / 688

页数：5

共 50 条

[31] Wavesplit: End-to-End Speech Separation by Speaker Clustering
Zeghidour, Neil
Grangier, David
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2840 - 2849
[32] Resolution-Adaptive Source-Channel Coding for End-to-End Wireless Image Transmission
Yang, Ke
Wang, Sixian
Tan, Kailin
Dai, Jincheng
Zhou, Dekun
Niu, Kai
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1460 - 1465
[33] Intelligent front-ends and product models
Bjoerk, Bo-Christer
International journal for artificial intelligence in engineering, 1991, 6 (01): : 46 - 56
[34] Integrated micromechanical radio front-ends
Nguyen, Clark T. -C.
2008 INTERNATIONAL SYMPOSIUM ON VLSI TECHNOLOGY, SYSTEMS AND APPLICATIONS (VLSI-TSA), PROCEEDINGS OF TECHNICAL PROGRAM, 2008, : 3 - +
[35] MAGNETIC RECORDING CHANNEL FRONT-ENDS
KLAASSEN, KB
IEEE TRANSACTIONS ON MAGNETICS, 1991, 27 (06) : 4503 - 4508
[36] Support for Authoring Service Front-Ends
Paterno, Fabio
Santoro, Carmen
Spano, Lucio Davide
EICS'09: PROCEEDINGS OF THE ACM SIGCHI SYMPOSIUM ON ENGINEERING INTERACTIVE COMPUTING SYSTEMS, 2009, : 85 - 90
[37] An adaptive front end for olfaction
Apsel, A
Stanford, T
Hasler, P
ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : B107 - B110
[38] END-TO-END SPEECH RECOGNITION WITH ADAPTIVE COMPUTATION STEPS
Li, Mohan
Liu, Min
Masanori, Hattori
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6246 - 6250
[39] Adaptive Feature Selection for End-to-End Speech Translation
Zhang, Biao
Titov, Ivan
Haddow, Barry
Sennrich, Rico
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2533 - 2544
[40] A system for end-to-end authentication of adaptive multimedia content
Suzuki, T
Ramzan, Z
Fujimoto, H
Gentry, C
Nakayama, T
Jain, R
COMMUNICATIONS AND MULTIMEDIA SECURITY, 2005, 175 : 237 - 249

← 1 2 3 4 5 →