Attention-Inspired Artificial Neural Networks for Speech Processing: A Systematic Review

被引:12
|
作者
Zacarias-Morales, Noel [1 ]
Pancardo, Pablo [1 ]
Hernandez-Nolasco, Jose Adan [1 ]
Garcia-Constantino, Matias [2 ]
机构
[1] Juarez Autonomous Univ Tabasco, Acad Div Sci & Informat Technol, Cunduacan 86690, Tabasco, Mexico
[2] Ulster Univ, Sch Comp, Jordanstown BT37 0QB, North Ireland
来源
SYMMETRY-BASEL | 2021年 / 13卷 / 02期
关键词
artificial neural networks; deep learning; attention; speech; systematic review;
D O I
10.3390/sym13020214
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Artificial Neural Networks (ANNs) were created inspired by the neural networks in the human brain and have been widely applied in speech processing. The application areas of ANN include: Speech recognition, speech emotion recognition, language identification, speech enhancement, and speech separation, amongst others. Likewise, given that speech processing performed by humans involves complex cognitive processes known as auditory attention, there has been a growing amount of papers proposing ANNs supported by deep learning algorithms in conjunction with some mechanism to achieve symmetry with the human attention process. However, while these ANN approaches include attention, there is no categorization of attention integrated into the deep learning algorithms and their relation with human auditory attention. Therefore, we consider it necessary to have a review of the different ANN approaches inspired in attention to show both academic and industry experts the available models for a wide variety of applications. Based on the PRISMA methodology, we present a systematic review of the literature published since 2000, in which deep learning algorithms are applied to diverse problems related to speech processing. In this paper 133 research works are selected and the following aspects are described: (i) Most relevant features, (ii) ways in which attention has been implemented, (iii) their hypothetical relationship with human attention, and (iv) the evaluation metrics used. Additionally, the four publications most related with human attention were analyzed and their strengths and weaknesses were determined.
引用
收藏
页码:1 / 43
页数:46
相关论文
共 50 条
  • [41] Editorial: Artificial Neural Networks as Models of Neural Information Processing
    van Gerven, Marcel
    Bohte, Sander
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2017, 11
  • [42] SYNAPTIC DEPRESSION IN DEEP NEURAL NETWORKS FOR SPEECH PROCESSING
    Zhang, Wenhao
    Li, Hanyu
    Yang, Minda
    Mesgarani, Nima
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5865 - 5869
  • [43] Introduction to the Special Issue on Neural Networks for Speech Processing
    Gorin, A. L.
    Mammone, R. J.
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (01): : 113 - 114
  • [44] Recognition and Processing of Speech Signals Using Neural Networks
    Douglas O’Shaughnessy
    Circuits, Systems, and Signal Processing, 2019, 38 : 3454 - 3481
  • [45] Generative adversarial networks for speech processing: A review
    Wali, Aamir
    Alamgir, Zareen
    Karim, Saira
    Fawaz, Ather
    Ali, Mubariz Barkat
    Adan, Muhammad
    Mujtaba, Malik
    COMPUTER SPEECH AND LANGUAGE, 2022, 72
  • [46] Applications of neural networks in speech processing for Romanian language
    Gavat, I
    2002 6TH SEMINAR ON NEURAL NETWORK APPLICATIONS IN ELECTRICAL ENGINEERING, PROCEEDINGS, 2002, : 65 - 70
  • [47] Recognition and Processing of Speech Signals Using Neural Networks
    O'Shaughnessy, Douglas
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (08) : 3454 - 3481
  • [48] Optimizing information processing in brain-inspired neural networks
    Paprocki, B.
    Pregowska, A.
    Szczepanski, J.
    BULLETIN OF THE POLISH ACADEMY OF SCIENCES-TECHNICAL SCIENCES, 2020, 68 (02) : 225 - 233
  • [49] Image processing with neural networks - a review
    Egmont-Petersen, M
    de Ridder, D
    Handels, H
    PATTERN RECOGNITION, 2002, 35 (10) : 2279 - 2301
  • [50] Artificial neural networks applied for solidified soils data prediction: a bibliometric and systematic review
    Pacheco, Vinicius Luiz
    Bragagnolo, Lucimara
    Thome, Antonio
    ENGINEERING COMPUTATIONS, 2021, 38 (07) : 3104 - 3131