Automatically Constructing a Fine-Grained Sentiment Lexicon for Sentiment Analysis

被引:8
|
作者
Wang, Yabing [1 ]
Huang, Guimin [1 ,2 ]
Li, Maolin [1 ]
Li, Yiqun [1 ]
Zhang, Xiaowei [1 ]
Li, Hui [1 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin 541004, Guangxi, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Key Lab Image & Graph Intelligent Proc, Guilin 541004, Guangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment lexicon; Sentiment analysis; Neural network model; Lexicon construction; STRENGTH DETECTION; EMOTION;
D O I
10.1007/s12559-022-10043-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is an important research area in natural language processing (NLP), and the performance of sentiment analysis models is largely influenced by the quality of sentiment lexicons. Existing sentiment lexicons contain only the sentiment information of words. In this paper, we propose an approach for automatically constructing a fine-grained sentiment lexicon that contains both emotion information and sentiment information to solve the problem that the emotion and sentiment of texts cannot be jointly analyzed. We design an emotion-sentiment transfer method and construct a fine-grained sentiment seed lexicon, and we then expand the sentiment seed lexicon by applying the graph dissemination method to the synonym set. Subsequently, we propose a multi-information fusion method based on neural network to expand the sentiment lexicon based on a corpus. Finally, we generate the Fine-Grained Sentiment Lexicon (FGSL), which contains 40,554 words. FGSL achieves F1 values of 61.97%, 69.58%, and 66.99% on three emotion datasets and 88.19%, 89.31%, and 86.88% on three sentiment datasets. Experimental results on multiple public benchmark datasets illustrate that FGSL achieves significantly better performance in both emotion analysis and sentiment analysis tasks.
引用
收藏
页码:254 / 271
页数:18
相关论文
共 50 条
  • [41] Fine-grained emoji sentiment analysis based on attributes of Twitter users
    Sun, Xiaoyu
    Li, Huakang
    Sun, Guozi
    Zhu, Ming
    2020 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2020), 2020, : 134 - 139
  • [42] Fine-grained analysis of explicit and implicit sentiment in financial news articles
    Van de Kauter, Marjan
    Breesch, Diane
    Hoste, Veronique
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (11) : 4999 - 5010
  • [43] Fine-Grained Sentiment Analysis of Multi-domain Online Reviews
    Theodoropoulos, Panagiotis
    Alexandris, Christina
    HUMAN-COMPUTER INTERACTION: TECHNOLOGICAL INNOVATION, PT II, 2022, 13303 : 264 - 278
  • [44] Unsupervised Fine-Grained Sentiment Analysis System Using Lexicons and Concepts
    Ofek, Nir
    Rokach, Lior
    SEMANTIC WEB EVALUATION CHALLENGE, 2014, 475 : 28 - 33
  • [45] Coarse- and Fine-Grained Sentiment Analysis of Social Media Text
    Fink, Clayton R.
    Chou, Danielle S.
    Kopecky, Jonathon J.
    Llorens, Ashley J.
    JOHNS HOPKINS APL TECHNICAL DIGEST, 2011, 30 (01): : 22 - 30
  • [46] An Unsupervised Fine-grained Sentiment Analysis Model for Chinese Online Reviews
    Shi, Hanxiao
    Zhou, Guodong
    Qian, Peide
    Li, Xiaojun
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4277 - 4294
  • [47] Fine-Grained Sentiment Analysis Based on SSFF-GCN Model
    Zhao, Yuexu
    Fang, Junjie
    Jin, Shaolong
    SYSTEMS, 2025, 13 (02):
  • [48] A fine-grained labeled dataset for textual sentiment analysis in technical education
    Singh, Manoj
    Panwar, Subhash
    Choudhary, Sanju
    DATA IN BRIEF, 2024, 57
  • [49] EmoTweet-28: A Fine-Grained Emotion Corpus for Sentiment Analysis
    Yan, Jasy Liew Suet
    Turtle, Howard R.
    Liddy, Elizabeth D.
    LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1149 - 1156
  • [50] Design and Evaluation of SentiEcon: a fine-grained Economic/Financial Sentiment Lexicon from a Corpus of Business News
    Moreno-Ortiz, Antonio
    Fernandez-Cruz, Javier
    Perez-Hernandez, Chantal
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5065 - 5072