A Perceptual Evaluation of Generative Adversarial Network Real-time Synthesized Drum Sounds in a Virtual Environment

被引:2
|
作者
Chang, Minwook [1 ]
Kim, Youngwon Ryan [1 ]
Kim, Gerard Jounghyun [1 ]
机构
[1] Korea Univ, Seoul, South Korea
关键词
Generation of immersive environments and virtual worlds; Multimodal interaction and experiences in VR/AR; Machine learning for multimodal interaction;
D O I
10.1109/AIVR.2018.00030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional methods of real time sound effects in 3D graphical and virtual environments relied upon preparing all the needed samples ahead of time and simply replaying them as needed, or parametrically modifying a basic set of samples using physically based techniques such as the spring-damper simulation and modal analysis/synthesis. In this work, we propose to apply the generative adversarial network (GAN) approach to the problem at hand, with which only one generator is trained to produce the needed sounds fast with perceptually indifferent quality. Otherwise, with the conventional methods, separate and approximate models would be needed to deal with different material properties and contact types, and manage real time performance. We demonstrate our claim by training a GAN (more specifically WaveGAN) with sounds of different drums and synthesizing the sounds on the fly for a virtual drum playing environment. The perceptual test revealed that the subjects could not discern the synthesized sounds from the ground truth nor perceived any noticeable delay upon the corresponding physical event.
引用
收藏
页码:144 / 148
页数:5
相关论文
共 50 条
  • [41] A Real-Time Virtual Integration Environment for Neuroprosthetics and Rehabilitation
    Armiger, Robert S.
    Tenore, Francesco V.
    Bishop, William E.
    Beaty, James D.
    Bridges, Michael M.
    Burck, James M.
    Vogelstein, R. Jacob
    Harshbarger, Stuart D.
    JOHNS HOPKINS APL TECHNICAL DIGEST, 2011, 30 (03): : 198 - 206
  • [42] Real-time fluid simulation in a dynamic virtual environment
    Chen, JX
    Lobo, ND
    Hughes, CE
    Moshell, JM
    IEEE COMPUTER GRAPHICS AND APPLICATIONS, 1997, 17 (03) : 52 - 61
  • [43] EVALUATION OF REAL-TIME CEPSTRA FOR FAST DETECTION OF VOICED SOUNDS
    TIMME, M
    IDLER, H
    LAY, T
    NACHRICHTENTECHNISCHE ZEITSCHRIFT, 1973, 26 (07): : 312 - 316
  • [44] THE NETWORK ADVISER ANALYSIS AND REAL-TIME ENVIRONMENT
    BHAT, S
    HEWLETT-PACKARD JOURNAL, 1992, 43 (05): : 29 - 33
  • [45] Network advisor analysis and real-time environment
    Bhat, Sunil
    Hewlett-Packard Journal, 1992, 43 (05):
  • [46] Multi-style Generative Network for Real-Time Transfer
    Zhang, Hang
    Dana, Kristin
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 349 - 365
  • [47] A Robust Technique for Real-Time Face Verification with a Generative Network
    Akkaya, Ibrahim Batuhan
    Karaman, Kaan
    REAL-TIME IMAGE PROCESSING AND DEEP LEARNING 2020, 2020, 11401
  • [48] User-centered design and evaluation of a real-time battlefield visualization virtual environment
    Hix, D
    Swan, JE
    Gabbard, JL
    McGee, M
    Durbin, J
    King, T
    IEEE VIRTUAL REALITY - PROCEEDINGS, 1999, : 96 - 103
  • [49] User-centered design and evaluation of a real-time battlefield visualization virtual environment
    Hix, Deborah
    Swan II, J.Edward
    Gabbard, Joseph L.
    McGee, Mike
    Durbin, Jim
    King, Tony
    Proceedings - Virtual Reality Annual International Symposium, 1999, : 96 - 103
  • [50] Generative adversarial network-based semi-supervised learning for real-time risk warning of process industries
    He, Rui
    Li, Xinhong
    Chen, Guoming
    Chen, Guoxing
    Liu, Yiwei
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150