A computer-aided speech analytics approach for pronunciation feedback using deep feature clustering

被引:0
|
作者
Faria Nazir
Muhammad Nadeem Majeed
Mustansar Ali Ghazanfar
Muazzam Maqsood
机构
[1] University of Engineering and Technology Taxila,Department of Software Engineering
[2] University of the Punjab,Department of Data Science
[3] University of East London,School of Architecture, Computing and Engineering
[4] COMSATS University Islamabad,Department of Computer Science
来源
Multimedia Systems | 2023年 / 29卷
关键词
Speech analytics; Deep convolutional neural network; Multimedia tools; Deep clustering; Phone variation model;
D O I
暂无
中图分类号
学科分类号
摘要
Nowadays, the demand for language learning is increasing because people need to communicate with other people belonging to different regions for their business deals, study, etc. During language learning, a lot of pronunciation mistakes occur due to unfamiliarity with a new language and differences in accent. In this paper, we perform speech mistakes analysis using deep feature-based clustering. We proposed two novel methods for speech analysis, one to deal with phonemic errors (confusing phonemes) and the other to deal with the prosodic errors (partially changed pronunciation variation of phones). For accurate and efficient language learning, it is important to learn both phonemic as well as prosodic error corrections. In our first method, we perform speech analysis by combining deep CNN features and clustering algorithm to detect the phonemic errors. We classify the phonemes using K-nearest neighbor, Naïve Bayes, and support vector machine (SVM). We perform experiments on the six most frequently mispronounced confusing pairs of Arabic to handle phonemic errors and achieve an accuracy of 94%. In our second method, we proposed the unsupervised phone variation model (PVM) to detect prosodic errors. In PVM, each phone is extended to represent the different types of pronunciation variation of that phone with different proficiency levels. We use an Arabic dataset of 28 individual phones for speech analysis and provide feedback based on the variation of each phone and achieves an accuracy of 97%.
引用
收藏
页码:1699 / 1715
页数:16
相关论文
共 50 条
  • [41] Using Computer-aided Landscaping
    Range, Kurt
    HORTSCIENCE, 2009, 44 (04) : 986 - 986
  • [42] SYSTEM APPROACH TO COMPUTER-AIDED DESIGN
    KOSTELIC, A
    STROJARSTVO, 1977, 19 (03): : 119 - 126
  • [43] COMPUTER-AIDED APPROACH TO ROUTE LOCATION
    ROY, GG
    COMPUTER-AIDED DESIGN, 1979, 11 (01) : 23 - 26
  • [44] AN APPROACH TO COMPUTER-AIDED PARAMETRIC DESIGN
    ROLLER, D
    COMPUTER-AIDED DESIGN, 1991, 23 (05) : 385 - 391
  • [45] An evolutionary approach to computer-aided orchestration
    Carpentier, Gregoire
    Tardieu, Damien
    Assayag, Gerard
    Rodet, Xavier
    Saint-James, Emmanuel
    APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2007, 4448 : 488 - +
  • [46] A COMPUTER-AIDED APPROACH TO MANUFACTURING QUALITY
    GETTINGS, M
    INDUSTRIAL ENGINEERING, 1990, 22 (03): : 18 - 21
  • [47] THE COMPUTER-AIDED APPROACH TO SUSCEPTIBILITY TESTING
    AUDONE, B
    GERBI, G
    IEEE TRANSACTIONS ON ELECTROMAGNETIC COMPATIBILITY, 1980, 22 (02) : 130 - 135
  • [48] AN APPROACH TO COMPUTER-AIDED DOCUMENT EXAMINATION
    BRZAKOVIC, D
    TOU, JT
    INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1985, 14 (06): : 365 - 385
  • [49] A COMPUTER-AIDED APPROACH FOR CONSTRUCTION BRIEFING
    Luo, X. C.
    Shen, Q. P.
    PROCEEDINGS OF THE SECOND INTERNATIONAL POSTGRADUATE CONFERENCE ON INFRASTRUCTURE AND ENVIRONMENT, VOL 2, 2010, : 143 - 154
  • [50] COMPUTER-AIDED PROTOTYPING - TRANSFORMATIONAL APPROACH
    HABRA, N
    INFORMATION AND SOFTWARE TECHNOLOGY, 1991, 33 (09) : 685 - 697