Generating Word Embeddings from an Extreme Learning Machine for Sentiment Analysis and Sequence Labeling Tasks

被引:36
|
作者
Lauren, Paula [1 ]
Qu, Guangzhi [1 ]
Yang, Jucheng [2 ]
Watta, Paul [3 ]
Huang, Guang-Bin [4 ]
Lendasse, Amaury [5 ]
机构
[1] Oakland Univ, Dept Comp Sci & Engn, Rochester, MI 48309 USA
[2] Tianjin Univ Sci & Technol, Coll Comp Sci & Informat Engn, Tianjin, Peoples R China
[3] Univ Michigan, Dept Elect & Comp Engn, Dearborn, MI 48128 USA
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[5] Univ Iowa, Dept Ind & Syst Engn, Iowa City, IA USA
基金
中国国家自然科学基金;
关键词
Word embeddings; Extreme learning machine (ELM); Word2Vec; Global vectors (GloVe); Text categorization; Sentiment analysis; Sequence labeling; REPRESENTATIONS;
D O I
10.1007/s12559-018-9548-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Embeddings are low-dimensional distributed representations that encompass a set of language modeling and feature learning techniques from Natural Language Processing (NLP). Words or phrases from the vocabulary are mapped to vectors of real numbers in a low-dimensional space. In previous work, we proposed using an Extreme Learning Machine (ELM) for generating word embeddings. In this research, we apply the ELM-based Word Embeddings to the NLP task of Text Categorization, specifically Sentiment Analysis and Sequence Labeling. The ELM-based Word Embeddings utilizes a count-based approach similar to the Global Vectors (GloVe) model, where the word-context matrix is computed then matrix factorization is applied. A comparative study is done with Word2Vec and GloVe, which are the two popular state-of-the-art models. The results show that ELM-based Word Embeddings slightly outperforms the aforementioned two methods in the Sentiment Analysis and Sequence Labeling tasks.In addition, only one hyperparameter is needed using ELM whereas several are utilized for the other methods. ELM-based Word Embeddings are comparable to the state-of-the-art methods: Word2Vec and GloVe models. In addition, the count-based ELM model have word similarities to both the count-based GloVe and the predict-based Word2Vec models, with subtle differences.
引用
收藏
页码:625 / 638
页数:14
相关论文
共 50 条
  • [21] Multi-channel word embeddings for sentiment analysis
    Jhe-Wei Lin
    Tran Duy Thanh
    Rong-Guey Chang
    Soft Computing, 2022, 26 : 12703 - 12715
  • [22] Sentiment classification and aspect-based sentiment analysis on yelp reviews using deep learning and word embeddings
    Alamoudi, Eman Saeed
    Alghamdi, Norah Saleh
    JOURNAL OF DECISION SYSTEMS, 2021, 30 (2-3) : 259 - 281
  • [23] Debiasing Word Embeddings from Sentiment Associations in Names
    Hube, Christoph
    Idahl, Maximilian
    Fetahu, Besnik
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, : 259 - 267
  • [24] A Comprehensive Survey for Sentiment Analysis Tasks Using Machine Learning Techniques
    Aydogan, Ebru
    Akcayol, M. Ali
    PROCEEDINGS OF THE 2016 INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2016,
  • [25] Cross-domain sentiment aware word embeddings for review sentiment analysis
    Jun Liu
    Shuang Zheng
    Guangxia Xu
    Mingwei Lin
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 343 - 354
  • [26] Cross-domain sentiment aware word embeddings for review sentiment analysis
    Liu, Jun
    Zheng, Shuang
    Xu, Guangxia
    Lin, Mingwei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 343 - 354
  • [28] Learning Domain-Sensitive and Sentiment-Aware Word Embeddings
    Shi, Bei
    Fu, Zihao
    Bing, Lidong
    Lam, Wai
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 2494 - 2504
  • [29] A Comparative Evaluation of Word Embeddings Techniques for Twitter Sentiment Analysis
    Kaibi, Ibrahim
    Nfaoui, El Habib
    Satori, Hassan
    2019 INTERNATIONAL CONFERENCE ON WIRELESS TECHNOLOGIES, EMBEDDED AND INTELLIGENT SYSTEMS (WITS), 2019,
  • [30] More than Bags of Words: Sentiment Analysis with Word Embeddings
    Rudkowsky, Elena
    Haselmayer, Martin
    Wastian, Matthias
    Jenny, Marcelo
    Emrich, Stefan
    Sedlmair, Michael
    COMMUNICATION METHODS AND MEASURES, 2018, 12 (2-3) : 140 - 157