METHODS AND CHALLENGES FOR CREATING AN EMOTIONAL AUDIO-VISUAL DATABASE

被引:0
|
作者
Pandharipande, Meghna A. [1 ]
Chakraborty, Rupayan [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] TCS Innovat Labs Mumbai, Yantra Pk, Thane West 400601, India
来源
2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA) | 2017年
关键词
emotion database; speech emotion; visual expression; acted; spontaneous; induced; SPEECH;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Emotion has a very important role in human communication and can be expressed either verbally through speech (e.g. pitch, intonation, prosody etc), or by facial expressions, gestures etc. Most of the contemporary human-computer interaction are deficient in interpreting these information and hence suffers from lack of emotional intelligence. In other words, these systems are unable to identify human's emotional state and hence is not able to react properly. To overcome these inabilities, machines are required to be trained using annotated emotional data samples. Motivated from this fact, here we have attempted to collect and create an audio-visual emotional corpus. Audio-visual signals of multiple subjects were recorded when they were asked to watch either presentation (having background music) or emotional video clips. Post recording subjects were asked to express how they felt, and to read out sentences that appeared on the screen. Self annotation from the subject itself, as well as annotation from others have also been carried out to annotate the recorded data.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [41] Audio-Visual Techniques
    Sears, William P., Jr.
    EDUCATION, 1948, 69 (02): : 132 - 132
  • [42] AUDIO-VISUAL UNIT
    WHARTON, BA
    PEDIATRICS, 1971, 47 (05) : 957 - &
  • [43] AUDIO-VISUAL POTPOURRI
    不详
    INDUSTRIAL PHOTOGRAPHY, 1968, 17 (07): : 30 - &
  • [44] AUDIO-VISUAL DEVELOPMENTS
    Schwartz, Mortimer
    JOURNAL OF LEGAL EDUCATION, 1952, 5 (01) : 88 - 95
  • [45] Audio-visual biometrics
    Aleksic, Petar S.
    Katsaggelos, Aggelos K.
    PROCEEDINGS OF THE IEEE, 2006, 94 (11) : 2025 - 2044
  • [46] The Problems and Challenges of Managing Crowd Sourced Audio-Visual Evidence
    Lallie, Harjinder Singh
    FUTURE INTERNET, 2014, 6 (02): : 190 - 202
  • [47] AUDIO-VISUAL FOR THE PATIENT
    STUTTLE, FL
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 1959, 41 (07): : 1362 - 1362
  • [48] The Audio-Visual Reader
    不详
    JOURNAL OF EDUCATIONAL RESEARCH, 1955, 48 (07): : 552 - 553
  • [49] An audio-visual speech recognition system for testing new audio-visual databases
    Pao, Tsang-Long
    Liao, Wen-Yuan
    VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, : 192 - +
  • [50] Perceptual thresholds of audio-visual spatial coherence for a variety of audio-visual objects
    Stenzel, Hanne
    Jackson, Philip J. B.
    2018 AES INTERNATIONAL CONFERENCE ON AUDIO FOR VIRTUAL AND AUGMENTED REALITY, 2018,