A Large Scale Speech Sentiment Corpus

被引:0
|
作者
Chen, Eric Y. [1 ]
Lu, Zhiyun [2 ]
Xu, Hao [1 ]
Cao, Liangliang [1 ]
Zhang, Yu [1 ]
Fan, James [1 ]
机构
[1] Google Inc, New York, NY 10011 USA
[2] Univ Southern Calif, Los Angeles, CA 90007 USA
关键词
sentiment; switchboard; multimodal; speech;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We present a multimodal corpus for sentiment analysis based on the existing Switchboard-1 Telephone Speech Corpus released by the Linguistic Data Consortium. This corpus extends the Switchboard-1 Telephone Speech Corpus by adding sentiment labels from 3 different human annotators for every transcript segment. Each sentiment label can be one of three options: positive, negative, and neutral. Annotators are recruited using Google Cloud's data labeling service and the labeling task was conducted over the internet. The corpus contains a total of 49500 labeled utterances covering 140 hours of audio. To the best of our knowledge, this is the largest multimodal Corpus for sentiment analysis that includes both speech and text features.
引用
收藏
页码:6549 / 6555
页数:7
相关论文
共 50 条
  • [31] Morphological analysis of a large spontaneous speech corpus in Japanese
    Uchimoto, K
    Nobata, C
    Yamada, A
    Sekine, S
    Isahara, H
    41ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2003, : 479 - 488
  • [32] Morphological Annotation of a Large Spontaneous Speech Corpus in Japanese
    Uchimoto, Kiyotaka
    Isahara, Hitoshi
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 1731 - 1737
  • [33] A study on time-dependent voice quality variation in a large-scale single speaker speech corpus used for speech synthesis
    Kawai, H
    Tsuzaki, M
    PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 15 - 18
  • [34] JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research
    Itou, Katunobu
    Yamamoto, Mikio
    Takeda, Kazuya
    Takezawa, Toshiyuki
    Matsuoka, Tatsuo
    Kobayashi, Tetsunori
    Shikano, Kiyohiro
    Itahashi, Shuichi
    Journal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi), 1999, 20 (03): : 199 - 206
  • [35] ATTENTION BASED ON-DEVICE STREAMING SPEECH RECOGNITION WITH LARGE SPEECH CORPUS
    Kim, Kwangyoun
    Lee, Kyungmin
    Gowda, Dhananjaya
    Park, Junmo
    Kim, Sungsoo
    Jin, Sichen
    Lee, Young-Yoon
    Yeo, Jinsu
    Kim, Daehyun
    Jung, Seokyeong
    Lee, Jungin
    Han, Myoungji
    Kim, Chanwoo
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 956 - 963
  • [36] A Large-Scale Corpus for Conversation Disentanglement
    Kummerfeld, Jonathan K.
    Athreya, Vignesh
    Patel, Siva Sankalp
    Gouravajhala, Sai R.
    Gunasekara, Chulaka
    Polymenakos, Lazaros
    Peper, Joseph J.
    Ganhotra, Jatin
    Lasecki, Walter S.
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3846 - 3856
  • [37] A Corpus for Large-Scale Phonetic Typology
    Salesky, Elizabeth
    Chodroff, Eleanor
    Pimentel, Tiago
    Wiesner, Matthew
    Cotterell, Ryan
    Black, Alan W.
    Eisner, Jason
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 4526 - 4546
  • [38] VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
    Wang, Changhan
    Riviere, Morgane
    Lee, Ann
    Wu, Anne
    Talnikar, Chaitanya
    Haziza, Daniel
    Williamson, Mary
    Pino, Juan
    Dupoux, Emmanuel
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 993 - 1003
  • [39] CSTD-Telugu Corpus: Crowd-Sourced Approach for Large-Scale Speech data collection
    Mirishkar, Ganesh S.
    Raju, Vishnu Vidyadhara V.
    Naroju, Meher Dinesh
    Maity, Sudhamay
    Yalla, Prakash
    Vuppala, Anil Kumar
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 511 - 517
  • [40] Large Scale Sentiment Analysis with Locality Sensitive BitHash
    Zhang, Wenhao
    Ji, Jianqiu
    Zhu, Jun
    Xu, Hua
    Zhang, Bo
    INFORMATION RETRIEVAL TECHNOLOGY, AIRS 2015, 2015, 9460 : 29 - 40