Direction Estimation of Instrumental Sound Sources Using Regression Analysis by Convolutional Neural Network

被引:0
|
作者
Yamamoto, Kaho [1 ]
Ogihara, Akio [2 ]
Murata, Harumi [3 ]
机构
[1] Shonan Inst Technol, Fac Informat, 1-1-25 Tsujido Nishikaigan, Fujisawa, Kanagawa 2518511, Japan
[2] Kindai Univ, Fac Engn, 1 Takaya Umenobe, Higashihiroshima, Hiroshima 7392116, Japan
[3] Chukyo Univ, Sch Engn, 101 Tokodachi,Kaizu Cho, Toyota 4700393, Japan
关键词
Direction estimation of sound source; Instrument sound; Overtone structure; MUSIC spectrum; CNN; Regression analysis;
D O I
10.1007/s00034-023-02433-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
There has been much research on estimating noise and speech source direction, but there have not been many studies on estimating the source direction of instrumental sound sources. In this study, we considered the source direction estimation of a single instrumental sound. Direction estimation of sound sources by the multiple signal classification (MUSIC) method often causes large estimation errors. Then, we propose a technique for estimating the direction of musical instrument sound sources by applying regression analysis using a convolutional neural network (CNN), a type of neural network. We calculated the MUSIC spectrum obtained using MUSIC that uses the fundamental and harmonic components, which have relatively large amplitudes, and we estimated the direction of the sound source using the CNN with these components as input. We achieved this by focusing on the overtone structure of the instrumental sound source. This study demonstrated the effectiveness of this method using simulations in a monaural environment.
引用
收藏
页码:7004 / 7021
页数:18
相关论文
共 50 条
  • [21] Estimation of human metabolic age using regression and neural network analysis
    Korkushko, O., V
    Pysaruk, A., V
    Chyzhova, V. P.
    ZAPOROZHYE MEDICAL JOURNAL, 2021, 23 (01) : 60 - 64
  • [22] Direction Estimation of Multiple Sound Sources Using Circular Probability Distributions
    Nam, Seung-Hyon
    Kim, Yong-Hoh
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (06): : 308 - 314
  • [23] Direction of Arrival Estimation of Moving Sound Sources using Deep Learning
    Rusrus, Jana
    Bouchard, Martin
    Shirmohammadi, Shervin
    2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
  • [24] A modular neural network for direction-of-arrival estimation of two sources
    Ofek, Gal
    Tabrikian, Joseph
    Aladjem, Mayer
    NEUROCOMPUTING, 2011, 74 (17) : 3092 - 3102
  • [25] Multi-speaker Direction of Arrival Estimation Using Audio and Visual Modalities with Convolutional Neural Network
    Wu, Yulin
    Hu, Ruimin
    Wang, Xiaochen
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 636 - 641
  • [26] Convolutional Neural Network- based Direction-of-Arrival Estimation using Stereo Microphones for Drone
    Choi, Jeonghwan
    Chang, Joon-Hyuk
    2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,
  • [27] Direction of Arrival Estimation in Terahertz Communications using Convolutional Neural Networks
    Abdullah, Mariam
    Li, Mingxiang Stephen
    He, Jiayuan
    Wang, Ke
    Fumeaux, Christophe
    Withayachumnankul, Withawat
    2024 49TH INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER, AND TERAHERTZ WAVES, IRMMW-THZ 2024, 2024,
  • [28] Head Pose Estimation Using Convolutional Neural Network
    Lee, Seungsu
    Saitoh, Takeshi
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 164 - 171
  • [29] Improving Precipitation Estimation Using Convolutional Neural Network
    Pan, Baoxiang
    Hsu, Kuolin
    AghaKouchak, Amir
    Sorooshian, Soroosh
    WATER RESOURCES RESEARCH, 2019, 55 (03) : 2301 - 2321
  • [30] Food Calorie Estimation using Convolutional Neural Network
    Kasyap, V. Balaji
    Jayapandian, N.
    ICSPC'21: 2021 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICPSC), 2021, : 666 - 670