Generating Word Embeddings from an Extreme Learning Machine for Sentiment Analysis and Sequence Labeling Tasks

被引:36
|
作者
Lauren, Paula [1 ]
Qu, Guangzhi [1 ]
Yang, Jucheng [2 ]
Watta, Paul [3 ]
Huang, Guang-Bin [4 ]
Lendasse, Amaury [5 ]
机构
[1] Oakland Univ, Dept Comp Sci & Engn, Rochester, MI 48309 USA
[2] Tianjin Univ Sci & Technol, Coll Comp Sci & Informat Engn, Tianjin, Peoples R China
[3] Univ Michigan, Dept Elect & Comp Engn, Dearborn, MI 48128 USA
[4] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[5] Univ Iowa, Dept Ind & Syst Engn, Iowa City, IA USA
基金
中国国家自然科学基金;
关键词
Word embeddings; Extreme learning machine (ELM); Word2Vec; Global vectors (GloVe); Text categorization; Sentiment analysis; Sequence labeling; REPRESENTATIONS;
D O I
10.1007/s12559-018-9548-y
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Embeddings are low-dimensional distributed representations that encompass a set of language modeling and feature learning techniques from Natural Language Processing (NLP). Words or phrases from the vocabulary are mapped to vectors of real numbers in a low-dimensional space. In previous work, we proposed using an Extreme Learning Machine (ELM) for generating word embeddings. In this research, we apply the ELM-based Word Embeddings to the NLP task of Text Categorization, specifically Sentiment Analysis and Sequence Labeling. The ELM-based Word Embeddings utilizes a count-based approach similar to the Global Vectors (GloVe) model, where the word-context matrix is computed then matrix factorization is applied. A comparative study is done with Word2Vec and GloVe, which are the two popular state-of-the-art models. The results show that ELM-based Word Embeddings slightly outperforms the aforementioned two methods in the Sentiment Analysis and Sequence Labeling tasks.In addition, only one hyperparameter is needed using ELM whereas several are utilized for the other methods. ELM-based Word Embeddings are comparable to the state-of-the-art methods: Word2Vec and GloVe models. In addition, the count-based ELM model have word similarities to both the count-based GloVe and the predict-based Word2Vec models, with subtle differences.
引用
收藏
页码:625 / 638
页数:14
相关论文
共 50 条
  • [31] THE JOINT EFFECT OF SEMANTIC AND SYNTACTIC WORD EMBEDDINGS ON SENTIMENT ANALYSIS
    Chen, Shu
    Chen, Guang
    Wang, Wei
    PROCEEDINGS OF 2016 5TH IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2016), 2016, : 366 - 370
  • [32] Generating Bags of Words from the Sums of Their Word Embeddings
    White, Lyndon
    Togneri, Roberto
    Liu, Wei
    Bennamoun, Mohammed
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, (CICLING 2016), PT I, 2018, 9623 : 91 - 102
  • [33] Fine-Tuning of Word Embeddings for Semantic Sentiment Analysis
    Atzeni, Mattia
    Recupero, Diego Reforgiato
    SEMANTIC WEB CHALLENGES, SEMWEBEVAL 2018, 2018, 927 : 140 - 150
  • [34] Refining Word Embeddings Using Intensity Scores for Sentiment Analysis
    Yu, Liang-Chih
    Wang, Jin
    Lai, K. Robert
    Zhang, Xuejie
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) : 671 - 681
  • [35] Evaluating Neural Word Embeddings Created from Online Course Reviews for Sentiment Analysis
    Dessi, Danilo
    Dragoni, Mauro
    Fenu, Gianni
    Marras, Mirko
    Recupero, Diego Reforgiato
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 2124 - 2127
  • [36] Sentence-Level Sentiment Analysis Using Feature Vectors from Word Embeddings
    Hayashi, Toshitaka
    Fujita, Hamido
    NEW TRENDS IN INTELLIGENT SOFTWARE METHODOLOGIES, TOOLS AND TECHNIQUES (SOMET_18), 2018, 303 : 749 - 758
  • [37] Adversarial learning of sentiment word representations for sentiment analysis
    Peng, Bo
    Wang, Jin
    Zhang, Xuejie
    INFORMATION SCIENCES, 2020, 541 : 426 - 441
  • [38] Training Word Embeddings for Deep Learning in Biomedical Text Mining Tasks
    Jiang, Zhenchao
    Li, Lishuang
    Huang, Degen
    Jin, Liuke
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 625 - 628
  • [39] Software Sentiment Analysis Using Machine Learning with Different Word-Embedding
    Mula, Venkata Krishna Chandra
    Vijayvargiya, Sanidhya
    Kumar, Lov
    Samant, Surender Singh
    Murthy, Lalita Bhanu
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART V, 2022, 13381 : 396 - 410
  • [40] Learning Word Representations for Sentiment Analysis
    Li, Yang
    Pan, Quan
    Yang, Tao
    Wang, Suhang
    Tang, Jiliang
    Cambria, Erik
    COGNITIVE COMPUTATION, 2017, 9 (06) : 843 - 851