Group Communication With Context Codec for Lightweight Source Separation

被引:19
|
作者
Luo, Yi [1 ]
Han, Cong [1 ]
Mesgarani, Nima [1 ]
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
关键词
Codecs; Context modeling; Decoding; Complexity theory; Pipelines; Neural networks; Convolutional codes; Source separation; deep learning; lightweight; group communication; context codec; SPEECH ENHANCEMENT; PATH RNN;
D O I
10.1109/TASLP.2021.3078640
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Despite the recent progress on neural network architectures for speech separation, the balance between the model size, model complexity and model performance is still an important and challenging problem for the deployment of such models to low-resource platforms. In this paper, we propose two simple modules, group communication and context codec, that can be easily applied to a wide range of architectures to jointly decrease the model size and complexity without sacrificing the performance. A group communication module splits a high-dimensional feature into groups of low-dimensional features and captures the inter-group dependency. A separation module with a significantly smaller model size can then be shared by all the groups. A context codec module, containing a context encoder and a context decoder, is designed as a learnable downsampling and upsampling module to decrease the length of a sequential feature processed by the separation module. The combination of the group communication and the context codec modules is referred to as the GC3 design. Experimental results show that applying GC3 on multiple network architectures for speech separation can achieve on-par or better performance with as small as 2.5% model size and 17.6% model complexity, respectively.
引用
收藏
页码:1752 / 1761
页数:10
相关论文
共 50 条
  • [1] ULTRA-LIGHTWEIGHT SPEECH SEPARATION VIA GROUP COMMUNICATION
    Luo, Yi
    Han, Cong
    Mesgarani, Nima
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 16 - 20
  • [2] Lightweight Membership Management Scheme for Lightweight Group Communication Platforms
    Alrashed, Saleh
    Abuelyaman, Eltayeb
    ICCMB 2019 - THE 2ND INTERNATIONAL CONFERENCE ON COMPUTERS IN MANAGEMENT AND BUSINESS, 2019, : 81 - 86
  • [3] A Lightweight Certificate-based Source Authentication Protocol for Group Communication in Hybrid Wireless/Satellite Networks
    Roy-Chowdhury, Ayan
    Baras, John S.
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [4] Provable secure dynamic lightweight group communication in VANETs
    Naresh, Vankamamidi Srinivasa
    Reddi, Sivaranjani
    Allavarpu, V. V. L. Divakar
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (04)
  • [5] Group Blind Source Separation (GBSS)
    Wang, Dong
    Shen, Haipeng
    Truong, Young K.
    INTERNATIONAL WORK-CONFERENCE ON TIME SERIES (ITISE 2014), 2014, : 28 - 39
  • [6] Separation of Communication Signals Based on Underdetermined Blind Source Separation
    Guo, Xiaotao
    Wang, Xing
    Zhang, Ying
    PROCEEDINGS OF THE 2016 5TH INTERNATIONAL CONFERENCE ON ENVIRONMENT, MATERIALS, CHEMISTRY AND POWER ELECTRONICS, 2016, 84 : 258 - 263
  • [7] Source authentication in group communication systems
    Zhao, X
    Prakash, A
    14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 455 - 459
  • [8] AMOUN: Asymmetric lightweight cryptographic scheme for wireless group communication
    Mansour, Ahmad
    Malik, Khalid M.
    Kaso, Niko
    COMPUTER COMMUNICATIONS, 2021, 169 : 154 - 167
  • [9] Blind source separation algorithm for communication complex signals in communication reconnaissance
    State Key Laboratory of Integrated Service Network, Xidian University, Xi'an 710071, China
    不详
    Huazhong Ligong Daxue Xuebao, 2007, 4 (33-36):
  • [10] Designing a unified speech/audio codec by adopting a single channel harmonic source separation module
    Shin, Sang-Wook
    Lee, Chang-Heon
    Oh, Hyen-O
    Kang, Hong-Goo
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 185 - +