End-to-End Learning for Physics-Based Acoustic Modeling

被引:6
|
作者
Gabrielli, Leonardo [1 ]
Tomassetti, Stefano [1 ]
Zinato, Carlo [2 ]
Piazza, Francesco [1 ]
机构
[1] Univ Politecn Marche, Dept Informat Engn, I-60121 Ancona, Italy
[2] Viscount Int SpA, I-47836 Mondaino, Italy
关键词
Physics-based acoustic modeling; end-to-end learning; convolutional neural networks; SOUND; ALGORITHM;
D O I
10.1109/TETCI.2017.2787125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In past years, physics-based acoustic modeling developed theoretically to the point of yielding accurate understanding and description of a large number of acoustic phenomena, such as those involved in sound generation. Numerical algorithms have been proposed that are able to simulate these phenomena in real time with an acceptable computational cost, indeed reaching the market with commercial products. Sound synthesis based on physical models could benefit greatly from automated methods that require less specific know-how and save the sound-designer valuable time. This paper introduces a novel approach to parameter estimation in physics-based sound synthesis that is general and obtains good results based on an end-to-end computational intelligence paradigm. The approach is presented in a formal way and application to a practical use case is reported. Methodological issues, such as dataset generation, are investigated.
引用
收藏
页码:160 / 170
页数:11
相关论文
共 50 条
  • [41] End-to-end representation learning for Correlation Filter based tracking
    Valmadre, Jack
    Bertinetto, Luca
    Henriques, Joao
    Vedaldi, Andrea
    Torr, Philip H. S.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5000 - 5008
  • [42] End-to-end Deep Reinforcement Learning Based Coreference Resolution
    Fei, Hongliang
    Li, Xu
    Li, Dingcheng
    Li, Ping
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 660 - 665
  • [43] An End-to-End Image Dehazing Method Based on Deep Learning
    Zhang, Yi
    Huang, Hongbing
    Liu, Junyi
    Fan, Chao
    Wang, Yanyan
    Cai, Qing
    Ruan, Yingying
    Gong, Xiaojin
    2018 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION, IMAGE AND SIGNAL PROCESSING, 2019, 1169
  • [44] End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition
    Palaz, Dimitri
    Magimai-Doss, Mathew
    Collobert, Ronan
    SPEECH COMMUNICATION, 2019, 108 : 15 - 32
  • [45] STATISTICAL LEARNING FOR END-TO-END SIMULATIONS
    Vicent, J.
    Verrelst, J.
    Rivera-Caicedo, J. P.
    Sabater, N.
    Munoz-Mari, J.
    Camps-Valls, G.
    Moreno, J.
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 1699 - 1702
  • [46] END-TO-END LEARNING FOR MUSIC AUDIO
    Dieleman, Sander
    Schrauwen, Benjamin
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [47] End-to-end Learning for Graph Decomposition
    Song, Jie
    Andres, Bjoern
    Black, Michael J.
    Hilliges, Otmar
    Tang, Siyu
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10092 - 10101
  • [48] Amharic OCR: An End-to-End Learning
    Belay, Birhanu
    Habtegebrial, Tewodros
    Meshesha, Million
    Liwicki, Marcus
    Belay, Gebeyehu
    Stricker, Didier
    APPLIED SCIENCES-BASEL, 2020, 10 (03):
  • [49] The Predictron: End-To-End Learning and Planning
    Silver, David
    van Hasselt, Hado
    Hessel, Matteo
    Schaul, Tom
    Guez, Arthur
    Harley, Tim
    Dulac-Arnold, Gabriel
    Reichert, David
    Rabinowitz, Neil
    Barret, Andre
    Degris, Thomas
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [50] END-TO-END LEARNING OF COMPRESSIBLE FEATURES
    Singh, Saurabh
    Abu-El-Haija, Sami
    Johnston, Nick
    Balle, Johannes
    Shrivastava, Abhinav
    Toderici, George
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3349 - 3353