END-TO-END SOURCE SEPARATION WITH ADAPTIVE FRONT-ENDS

被引:0
|
作者
Venkataramani, Shrikant [1 ]
Casebeer, Jonah [1 ]
Smaragdis, Paris [2 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Univ Illinois, Adobe Res, Champaign, IL USA
关键词
Auto-encoders; adaptive transforms; source separation; deep learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We develop an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for supervised source separation. In terms of separation performance, these transforms significantly outperform their Fourier counterparts. Finally, we also propose and interpret a novel source to distortion ratio based cost function for end-to-end source separation.
引用
收藏
页码:684 / 688
页数:5
相关论文
共 50 条
  • [31] Wavesplit: End-to-End Speech Separation by Speaker Clustering
    Zeghidour, Neil
    Grangier, David
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2840 - 2849
  • [32] Resolution-Adaptive Source-Channel Coding for End-to-End Wireless Image Transmission
    Yang, Ke
    Wang, Sixian
    Tan, Kailin
    Dai, Jincheng
    Zhou, Dekun
    Niu, Kai
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1460 - 1465
  • [33] Intelligent front-ends and product models
    Bjoerk, Bo-Christer
    International journal for artificial intelligence in engineering, 1991, 6 (01): : 46 - 56
  • [34] Integrated micromechanical radio front-ends
    Nguyen, Clark T. -C.
    2008 INTERNATIONAL SYMPOSIUM ON VLSI TECHNOLOGY, SYSTEMS AND APPLICATIONS (VLSI-TSA), PROCEEDINGS OF TECHNICAL PROGRAM, 2008, : 3 - +
  • [35] MAGNETIC RECORDING CHANNEL FRONT-ENDS
    KLAASSEN, KB
    IEEE TRANSACTIONS ON MAGNETICS, 1991, 27 (06) : 4503 - 4508
  • [36] Support for Authoring Service Front-Ends
    Paterno, Fabio
    Santoro, Carmen
    Spano, Lucio Davide
    EICS'09: PROCEEDINGS OF THE ACM SIGCHI SYMPOSIUM ON ENGINEERING INTERACTIVE COMPUTING SYSTEMS, 2009, : 85 - 90
  • [37] An adaptive front end for olfaction
    Apsel, A
    Stanford, T
    Hasler, P
    ISCAS '98 - PROCEEDINGS OF THE 1998 INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-6, 1998, : B107 - B110
  • [38] END-TO-END SPEECH RECOGNITION WITH ADAPTIVE COMPUTATION STEPS
    Li, Mohan
    Liu, Min
    Masanori, Hattori
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6246 - 6250
  • [39] Adaptive Feature Selection for End-to-End Speech Translation
    Zhang, Biao
    Titov, Ivan
    Haddow, Barry
    Sennrich, Rico
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2533 - 2544
  • [40] A system for end-to-end authentication of adaptive multimedia content
    Suzuki, T
    Ramzan, Z
    Fujimoto, H
    Gentry, C
    Nakayama, T
    Jain, R
    COMMUNICATIONS AND MULTIMEDIA SECURITY, 2005, 175 : 237 - 249