Utilizing Large Twitter Corpora to Create Sentiment Lexica

被引:0
|
作者
Fredriksen, Valerij [1 ]
Jahren, Brage [1 ]
Gamback, Bjorn [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Comp Sci, Trondheim, Norway
关键词
Pointwise Mutual Information; Sentiment lexica; Lexicon-based sentiment analysis;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The paper describes an automatic Twitter sentiment lexicon creator and a lexicon-based sentiment analysis system. The lexicon creator is based on a Pointwise Mutual Information approach, utilizing 6.25 million automatically labeled tweets and 103 million unlabeled, with the created lexicon consisting of about 3 000 entries. In a comparison experiment, this lexicon beat a manually annotated lexicon. A sentiment analysis system utilizing the created lexicon, and handling both negation and intensification, produces results almost on par with sophisticated machine learning-based systems, while significantly outperforming those in terms of run-time.
引用
收藏
页码:2829 / 2836
页数:8
相关论文
共 50 条
  • [1] Sentiment Lexica from Paired Comparisons
    Dalitz, Christoph
    Bednarek, Katrin E.
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 924 - 930
  • [2] Semi-supervised Sentiment Annotation of Large Corpora
    Brum, Henrico Bertini
    Volpe Nunes, Maria das Gracas
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 385 - 395
  • [3] Automatic generation of lexica for sentiment polarity shifters
    Schulder, Marc
    Wiegand, Michael
    Ruppenhofer, Josef
    NATURAL LANGUAGE ENGINEERING, 2021, 27 (02) : 153 - 179
  • [4] Twitter Sentiment Polarity Analysis: A Novel Approach for Improving the Automated Labeling in a Text Corpora
    Tapia, Pablo A.
    Velasquez, Juan D.
    ACTIVE MEDIA TECHNOLOGY, AMT 2014, 2014, 8610 : 274 - 285
  • [5] Utilizing Deep Learning in Arabic Text Classification Sentiment Analysis of Twitter
    Ibrahim, Nehad M.
    Yafooz, Wael M. S.
    Emara, Abdel-Hamid M.
    Abdel-Wahab, Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (12) : 830 - 838
  • [6] HYBRID FEATURE SELECTION FRAMEWORK FOR SENTIMENT ANALYSIS ON LARGE CORPORA
    Adewole, Kayode S.
    Balogun, Abdullateef O.
    Raheem, Muiz O.
    Jimoh, Muhammed K.
    Jimoh, Rasheed G.
    Mabayoje, Modinat A.
    Usman-Hamza, Fatima E.
    Akintola, Abimbola G.
    Asaju-Gbolagade, Ayisat W.
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2021, 7 (02): : 130 - 151
  • [7] Towards Producing Bilingual Lexica from Monolingual Corpora
    Han, Jingyi
    Bel, Nuria
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 2222 - 2227
  • [8] AffinityFinder: A System for Deriving Hidden Affinity Relationships on Twitter Utilizing Sentiment Analysis
    Rezgui, Abdelmounaam
    Fahey, Daniel
    Smith, Ian
    2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD WORKSHOPS (FICLOUDW), 2016, : 212 - 215
  • [9] Utilizing Tweet Content for the Detection of Sentiment-based Interaction Communities on Twitter
    Jan Lam, Alron
    Cheng, Charibeth
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 682 - 691
  • [10] The arrangement of sentiment lexica in the space of distributed word representations
    Razova, Elena, V
    Kotelnikov, Evgeny, V
    PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), 2018, : 240 - 245