END-TO-END SOURCE SEPARATION WITH ADAPTIVE FRONT-ENDS

被引:0
|
作者
Venkataramani, Shrikant [1 ]
Casebeer, Jonah [1 ]
Smaragdis, Paris [2 ]
机构
[1] Univ Illinois, Champaign, IL 61820 USA
[2] Univ Illinois, Adobe Res, Champaign, IL USA
关键词
Auto-encoders; adaptive transforms; source separation; deep learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We develop an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for supervised source separation. In terms of separation performance, these transforms significantly outperform their Fourier counterparts. Finally, we also propose and interpret a novel source to distortion ratio based cost function for end-to-end source separation.
引用
收藏
页码:684 / 688
页数:5
相关论文
共 50 条
  • [41] End-to-End Adaptive Framework for Multimedia Information Retrieval
    Sokhn, Maria
    Mugellini, Elena
    Khaled, OmarAbou
    Serhrouchni, Ahmed
    WIRED/WIRELESS INTERNET COMMUNICATIONS, 2011, 6649 : 197 - 206
  • [42] QMan: An adaptive end-to-end QoS management architecture
    Suthon, SW
    Pung, HK
    Zhou, LF
    2004 12TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS, VOLS 1 AND 2 , PROCEEDINGS: UNITY IN DIVERSITY, 2004, : 797 - 803
  • [43] End-to-end QoS management for adaptive video flows
    Campbell, A
    Eleftheriadis, A
    Aurrecoechea, C
    MULTIMEDIA COMMUNICATIONS AND VIDEO CODING, 1996, : 105 - 115
  • [44] Direction specific ambisonics source separation with end-to-end deep learning (vol 7, 29, 2023)
    Lluis, Francesc
    Meyer-Kahlen, Nils
    Chatziioannou, Vasileios
    Hofmann, Alex
    ACTA ACUSTICA, 2023, 7
  • [45] GENERAL LAYOUT DESIGN FOR THE ADVANCED-PHOTON-SOURCE BEAMLINE FRONT-ENDS
    SHU, DM
    KUZAY, TM
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1994, 347 (1-3): : 584 - 590
  • [46] The end of end-to-end?
    Garfinkel, S
    TECHNOLOGY REVIEW, 2003, 106 (06) : 30 - 30
  • [47] End-to-end consensus using end-to-end channels
    Wiesmann, Matthias
    Defago, Xavier
    12TH PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2006, : 341 - +
  • [48] End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
    Wang, Zhong-Qiu
    Le Roux, Jonathan
    Wang, DeLiang
    Hershey, John R.
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2708 - 2712
  • [49] End-to-end XY Separation for Single Image Blind Deblurring
    Chen, Liuhan
    Wang, Yirou
    Chen, Yongyong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 1273 - 1282
  • [50] Performance Based Cost Functions for End-to-End Speech Separation
    Venkataramani, Shrikant
    Higa, Ryley
    Smaragdis, Paris
    2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 350 - 355