Aspect Analysis of Cebu Establishments' Online Reviews using k-means Clustering and word2vec

被引:0
|
作者
Capao, Kris [1 ]
Gorro, Ken D. [1 ]
Gorro, Kim D. [1 ]
Sabellano, Mary Jane [1 ]
Militante, Cris Lawrence Adrian G. [1 ]
Manalili, Justin Paul C. [1 ]
机构
[1] Univ San Carlos, Sch Arts & Sci, Dept Comp & Informat Sci, Cebu, Cebu, Philippines
关键词
K-means clustering; Word2vec; open coding; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Customer reviews are important part to any business. With the development of the technology, customer reviews are usually found on the internet. In this study, online reviews from different Cebu establishments were gathered using selenium as web scraper. A total of 3776 online reviews were gathered. Word2vec and k-means clustering were utilized to analyze and discover different online review corpora. To identify the best number of clusters, a series of experiments were conducted to find for the best Silhouette coefficient. For better analysis of k-means clustering, open coding was used to understand the significant qualitative codes. Based on the k-means clustering results, the following qualitative codes were identified: time, staff friendly, service, affordable, love, food, price, ambiance, good, great, relax. Analyses of the clusters show that quality service, tasty and affordable food and good atmosphere are the significant aspect that the online reviews are concerned. Based on the word2vec results, the researchers focused on the following words: Waiters, relax, great, ambiance, service and tasty. The results of the study provide meaningful insights on the group of words obtained using the analogy to word2vec model, as well as the subject focus of the categories.
引用
收藏
页码:61 / 66
页数:6
相关论文
共 50 条
  • [1] Automatic Text Summarization Using Gensim Word2Vec and K-Means Clustering Algorithm
    Haider, Mofiz Mojib
    Hossin, Md Arman
    Mahi, Hasibur Rashid
    Arif, Hossain
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 283 - 286
  • [2] Research on the Validity of Online Commodity Reviews Based on Word2vec
    Wang, Haiting
    Ren, Junling
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND ELECTRICAL ENGINEERING 2018 (ICITEE '18), 2018,
  • [3] Word2vec and Clustering based Twitter Sentiment Analysis
    Coban, Onder
    Ozyer, Gulsah Tumuklu
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [4] 基于K-means与Word2vec的哺乳文胸评论主题挖掘研究
    刘妍
    刘驰
    人类工效学, 2024, (02) : 40 - 45
  • [5] Improvement of Sentiment Analysis based on Clustering of Word2Vec Features
    Alshari, Eissa M.
    Azman, Azreen
    Doraisamy, Shyamala
    Mustapha, Norwati
    Alkeshr, Mustafa
    2017 28TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2017, : 123 - 126
  • [6] Automatic Synonym Extraction Using Word2Vec and Spectral Clustering
    Zhang, Li
    Li, Jun
    Wang, Chao
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 5629 - 5632
  • [7] Weighted aspect based sentiment analysis using extended OWA operators and Word2Vec for tourism
    Ghosal, Sayani
    Jain, Amita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18353 - 18380
  • [8] Weighted aspect based sentiment analysis using extended OWA operators and Word2Vec for tourism
    Sayani Ghosal
    Amita Jain
    Multimedia Tools and Applications, 2023, 82 : 18353 - 18380
  • [9] 基于word2vec与K-means算法食品安全事件自动聚类研究
    沈思
    梁晓静
    信息通信, 2018, (11) : 8 - 10
  • [10] 基于Word2vec与K-means的高校图书馆在线评论主题分析
    刘伟
    李秀霞
    图书馆学刊, 2022, 44 (10) : 88 - 94