Aspect Analysis of Cebu Establishments' Online Reviews using k-means Clustering and word2vec

被引:0
|
作者
Capao, Kris [1 ]
Gorro, Ken D. [1 ]
Gorro, Kim D. [1 ]
Sabellano, Mary Jane [1 ]
Militante, Cris Lawrence Adrian G. [1 ]
Manalili, Justin Paul C. [1 ]
机构
[1] Univ San Carlos, Sch Arts & Sci, Dept Comp & Informat Sci, Cebu, Cebu, Philippines
关键词
K-means clustering; Word2vec; open coding; ALGORITHM;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Customer reviews are important part to any business. With the development of the technology, customer reviews are usually found on the internet. In this study, online reviews from different Cebu establishments were gathered using selenium as web scraper. A total of 3776 online reviews were gathered. Word2vec and k-means clustering were utilized to analyze and discover different online review corpora. To identify the best number of clusters, a series of experiments were conducted to find for the best Silhouette coefficient. For better analysis of k-means clustering, open coding was used to understand the significant qualitative codes. Based on the k-means clustering results, the following qualitative codes were identified: time, staff friendly, service, affordable, love, food, price, ambiance, good, great, relax. Analyses of the clusters show that quality service, tasty and affordable food and good atmosphere are the significant aspect that the online reviews are concerned. Based on the word2vec results, the researchers focused on the following words: Waiters, relax, great, ambiance, service and tasty. The results of the study provide meaningful insights on the group of words obtained using the analogy to word2vec model, as well as the subject focus of the categories.
引用
收藏
页码:61 / 66
页数:6
相关论文
共 50 条
  • [41] Analysis Clustering of Electricity Usage Profile Using K-Means Algorithm
    Amri, Yasirli
    Fadhilah, Amanda Lailatul
    Fatmawati
    Setiani, Novi
    Rani, Septia
    INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (ICET4SD) 2015, 2016, 105
  • [42] NMR metabolic analysis of samples using fuzzy K-means clustering
    Cuperlovic-Culf, Miroslava
    Belacel, Nabil
    Cuif, Adrian S.
    Chute, Ian C.
    Ouellette, Rodney J.
    Burton, Ian W.
    Karakach, Tobias K.
    Walter, John A.
    MAGNETIC RESONANCE IN CHEMISTRY, 2009, 47 : S96 - S104
  • [43] Data Analysis of Educational Evaluation Using K-Means Clustering Method
    Liu, Rui
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [45] Analysis of Electricity Consumption at Home Using K-means Clustering Algorithm
    Choi, Hyun Wong
    Qureshi, Nawab Muhammad Faseeh
    Shin, Dong Ryeol
    2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 639 - 643
  • [46] An Analysis of Students' Academic Performance Using K-Means Clustering Algorithm
    Ahmad, Maryam
    Arshad, Noreen Izza Bt
    Sarlan, Aliza Bt
    ADVANCES ON INTELLIGENT INFORMATICS AND COMPUTING: HEALTH INFORMATICS, INTELLIGENT SYSTEMS, DATA SCIENCE AND SMART COMPUTING, 2022, 127 : 309 - 318
  • [47] A study on topic models using LDA and Word2Vec in travel route recommendation: focus on convergence travel and tours reviews
    Seong-Taek Park
    Chang Liu
    Personal and Ubiquitous Computing, 2022, 26 : 429 - 445
  • [48] A study on topic models using LDA and Word2Vec in travel route recommendation: focus on convergence travel and tours reviews
    Park, Seong-Taek
    Liu, Chang
    PERSONAL AND UBIQUITOUS COMPUTING, 2022, 26 (02) : 429 - 445
  • [49] Privacy preserving using joint 2 K-means clustering and coati optimization algorithm for online social networks
    Gowda N.R.
    Venkatesh
    Venugopal K.R.
    International Journal of Information Technology, 2024, 16 (4) : 2715 - 2724
  • [50] Statistical Assessment on Student Engagement in Asynchronous Online Learning Using the k-Means Clustering Algorithm
    Kim, Sohee
    Cho, Sunghee
    Kim, Joo Yeun
    Kim, Dae-Jin
    SUSTAINABILITY, 2023, 15 (03)