Sentiment Analysis for Multilingual Corpora

被引:0
|
作者
Galeshchuk, Svitlana [1 ]
Qiu, Ju [2 ]
Jourdan, Julien [1 ]
机构
[1] PSL Res Univ, Governance Analyt, Univ Paris Dauphine, Pl Marechal Lattre Tassigny, F-75016 Paris, France
[2] PSL Res Univ, Univ Paris Dauphine, Pl Marechal Lattre Tassigny, F-75016 Paris, France
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper presents a generic approach to the supervised sentiment analysis of social media content in foreign languages. The method proposes translating documents from the original language to English with Google's Neural Translation Model. The resulted texts are then converted to vectors by averaging the vectorial representation of words derived from a pretrained Word2Vec English model. Testing the approach with several machine learning methods on Polish, Slovenian and Croatian Twitter corpora returns up to 86 % of classification accuracy on the out-of-sample data.
引用
收藏
页码:120 / 125
页数:6
相关论文
共 50 条
  • [31] Multilingual emoji prediction using BERT for sentiment analysis
    Tomihira, Toshiki
    Otsuka, Atsushi
    Yamashita, Akihiro
    Satoh, Tetsuji
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2020, 16 (03) : 265 - 280
  • [32] A Sentiment Analysis Service Platform for Streamed Multilingual Tweets
    Karageorgou, Ioanna
    Liakos, Panagiotis
    Delis, Alex
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3262 - 3271
  • [33] MSATS: Multilingual Sentiment Analysis via Text Summarization
    Bhargava, Rupal
    Sharma, Yashvardhan
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE AND ENGINEERING (CONFLUENCE 2017), 2017, : 71 - 76
  • [34] Multilingual Sentiment Analysis on Social Media Disaster Data
    Fuadvy, Muhammad Jauharul
    Ibrahim, Roliana
    2019 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND INFORMATION ENGINEERING (ICEEIE), 2019, : 269 - 272
  • [35] BPA: A Multilingual Sentiment Analysis Approach based on BiLSTM
    Chaves, Iago C.
    Martins, Antonio Diogo F.
    Praciano, Francisco D. B. S.
    Brito, Felipe T.
    Monteiro, Jose Maria
    Machado, Javam C.
    ICEIS: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS - VOL 1, 2022, : 553 - 560
  • [36] Seeing through multilingual corpora: On the use of corpora in contrastive studies
    Viberg, Ake
    LANGUAGE, 2009, 85 (02) : 476 - 480
  • [37] Multilingual text categorization and sentiment analysis: a comparative analysis of the utilization of multilingual approaches for classifying twitter data
    Manias, George
    Mavrogiorgou, Argyro
    Kiourtis, Athanasios
    Symvoulidis, Chrysostomos
    Kyriazis, Dimosthenis
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (29): : 21415 - 21431
  • [38] Multilingual text categorization and sentiment analysis: a comparative analysis of the utilization of multilingual approaches for classifying twitter data
    George Manias
    Argyro Mavrogiorgou
    Athanasios Kiourtis
    Chrysostomos Symvoulidis
    Dimosthenis Kyriazis
    Neural Computing and Applications, 2023, 35 : 21415 - 21431
  • [39] Sentiment Analysis for Brazilian Portuguese over a Skewed Class Corpora
    Brum, Henrico
    Araujo, Filipe
    Kepler, Fabio
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE (PROPOR 2016), 2016, 9727 : 134 - 138
  • [40] HYBRID FEATURE SELECTION FRAMEWORK FOR SENTIMENT ANALYSIS ON LARGE CORPORA
    Adewole, Kayode S.
    Balogun, Abdullateef O.
    Raheem, Muiz O.
    Jimoh, Muhammed K.
    Jimoh, Rasheed G.
    Mabayoje, Modinat A.
    Usman-Hamza, Fatima E.
    Akintola, Abimbola G.
    Asaju-Gbolagade, Ayisat W.
    JORDANIAN JOURNAL OF COMPUTERS AND INFORMATION TECHNOLOGY, 2021, 7 (02): : 130 - 151