Automatic Pitch Accent Detection Using Long Short-Term Memory Neural Networks

被引:2
|
作者
Wu, Yizhi [1 ]
Li, Sha [1 ]
Li, Hongyan [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, 2999 Renmin Rd North, Shanghai, Peoples R China
关键词
Pitch accent detection; LSTM; lexical and syntactic features; acoustic features;
D O I
10.1145/3364908.3365291
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prosody detection is gaining increasingly popularity in the domain of prosody research because of its significance in Text to Sound, Computer-aided pronunciation training (CAPT), etc. Pitch accent is an important part of prosody and many recognition models of both static and dynamic have been investigated for automatic labeling it. Recently, artificial neural networks, especially Recurrent Neural Networks (RNNs) have been applied in pitch accent detection. However, traditional recurrent neural networks are unable to learn and remember over long sequences due to the issue of back-propagated error decay. To solve this problem, this paper investigates the use of Long Short-Term Memory (LSTM) neural networks for automatic pitch accent detection. This paper encodes lexical and syntactic features as binary variables and uses syllable-based acoustic features including syllable duration, syllable energy, features related to the fundamental frequency. Our experimental results show that LSTM-RNNs for pitch accent detection achieves an accuracy of 89.0%, which is better than the results of using classical detection methods by about 83.2%.
引用
收藏
页码:41 / 45
页数:5
相关论文
共 50 条
  • [1] Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks
    Abbas, Zainab
    Al-Shishtawy, Ahmad
    Girdzijauskas, Sarunas
    Vlassov, Vladimir
    2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 57 - 65
  • [2] Intrusion Detection Using Multilayer Perceptron and Neural Networks with Long Short-Term Memory
    Borisenko, B. B.
    Erokhin, S. D.
    Fadeev, A. S.
    Martishin, I. D.
    2021 SYSTEMS OF SIGNAL SYNCHRONIZATION, GENERATING AND PROCESSING IN TELECOMMUNICATIONS (SYNCHROINFO), 2021,
  • [3] Automatic Cause Inference of Construction Accident Using Long Short-Term Memory Neural Networks
    Wu, Hengqin
    Shen, Geoffrey Qiping
    Zhou, Zhenzong
    Li, Wenpeng
    Li, Xin
    CARBON PEAK AND NEUTRALITY STRATEGIES OF THE CONSTRUCTION INDUSTRY (ICCREM 2022), 2022, : 269 - 275
  • [4] Automatic temporal segment detection via bilateral long short-term memory recurrent neural networks
    Sun, Bo
    Cao, Siming
    He, Jun
    Yu, Lejun
    Li, Liandong
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (02)
  • [5] Automatic Fall Detection Using Long Short-Term Memory Network
    Magalhaes, Carlos
    Ribeiro, Joao
    Leite, Argentina
    Pires, E. J. Solteiro
    Pavao, Joao
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 359 - 371
  • [6] Deepfake Detection using Capsule Networks and Long Short-Term Memory Networks
    Mehra, Akul
    Spreeuwers, Luuk
    Strisciuglio, Nicola
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 407 - 414
  • [7] Long Short-Term Memory Networks for Automatic Generation of Conversations
    Fujita, Tomohiro
    Bai, Wenjun
    Quan, Changqin
    2017 18TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNDP 2017), 2017, : 483 - 487
  • [8] Dialog State Tracking Using Long Short-term Memory Neural Networks
    Yang, Xiaohao
    Liu, Jia
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1800 - 1804
  • [9] Deflated reputation using multiplicative long short-term memory neural networks
    Ma, Yixuan
    Zhang, Zhenji
    Li, Deming
    Tang, Mincong
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 118 : 198 - 207
  • [10] An Incremental Learning Approach Using Long Short-Term Memory Neural Networks
    Lemos Neto, Alvaro C.
    Coelho, Rodrigo A.
    de Castro, Cristiano L.
    JOURNAL OF CONTROL AUTOMATION AND ELECTRICAL SYSTEMS, 2022, 33 (05) : 1457 - 1465