EVALUATION OF MIMICKED SPEECH USING PROSODIC FEATURES

被引:0
|
作者
Mary, Leena [1 ]
Babu, Anish K. K. [1 ]
Joseph, Aju [1 ]
George, Gibin M. [1 ]
机构
[1] Rajiv Gandhi Inst Technol, Kottayam 686501, Kerala, India
关键词
Prosody; intonation; mimicked speech; legendre coefficients; dynamic time warping; LANGUAGE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we describe a technique for evaluating the quality of mimicked speech. In other words, mimicry artists are evaluated based on their competences to mimic a particular person. This evaluation is done based on prosodic characteristics for the text dependent cases. Prosodic characteristics are represented using features derived from pitch contour, duration and energy. In this work, prosodic features are extracted from speech after automatically segmenting into intonational phrases. Pitch contour corresponding to each phrase is approximated using weighted sum of legendre polynomials. Prosodic feature set includes weights of first four legendre polynomials (w(0k), w(1k), w(2k), w(3k)), average jitter, average shimmer, voiced duration, total duration and change in energy of each intonation phrase. The effectiveness of the technique is demonstrated using a text dependent database of mimicked speeches. Evaluation is done by dynamic time warping of prosodic features derived from the mimicked speech and the original speech. The scores obtained from this evaluation is compared with the results of manual perception/listening tests, which clearly indicate the effectiveness of the proposed technique.
引用
收藏
页码:7189 / 7193
页数:5
相关论文
共 50 条
  • [41] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [42] Integrating Disfluency-based and Prosodic Features with Acoustics in Automatic Fluency Evaluation of Spontaneous Speech
    Deng, Huaijin
    Lin, Youchao
    Utsuro, Takehito
    Kobayashi, Akio
    Nishizaki, Hiromitsu
    Hoshino, Junichi
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6429 - 6437
  • [43] Speech/Non-Speech Segments Detection Based On Chaotic and Prosodic Features
    Shafiee, Soheil
    Almasganj, Farshad
    Jafari, Ayyoob
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 111 - 114
  • [44] SPEECH REHABILITATION: RESPIRATORY AND PROSODIC FEATURES IN READING LITERATURE IN MANDARIN
    Yang, Feng
    MEDICINE, 2023, 102 (30) : 65 - 65
  • [45] CHANGES IN PROSODIC FEATURES OF SPEECH DUE TO ENVIRONMENTAL-FACTORS
    VILKMAN, E
    MANNINEN, O
    SPEECH COMMUNICATION, 1986, 5 (3-4) : 331 - 345
  • [46] Spectral and prosodic features-based speech pattern classification
    Sinha, Shweta
    Jain, Aruna
    Agrawal, S. S.
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2015, 2 (01) : 96 - 110
  • [47] Effects of the prosodic features of speech sound upon the personality impressions
    Uchida, Teruhisa
    Uchida, Chiharu
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 588 - 588
  • [48] Speech emotion recognition based on prosodic segment level features
    Han, Wenjing
    Li, Haifeng
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2009, 49 (SUPPL. 1): : 1363 - 1368
  • [49] Discrimination Capability of Prosodic and Spectral Features for Emotional Speech Recognition
    Delic, V.
    Bojanic, M.
    Gnjatovic, M.
    Secujski, M.
    Jovicic, S. T.
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 18 (09) : 51 - 54
  • [50] SPEECH SYNTHESIS USING SEGMENTAL AND PROSODIC PHONEMES
    MANDURAH, MM
    JOURNAL OF ENGINEERING SCIENCES, 1985, 11 (01): : 79 - 90