Qualitative data analysis of disaster risk reduction suggestions assisted by topic modeling and word2vec

被引:0
|
作者
Gorro, Ken [1 ]
Ancheta, Jeffrey Rosario [2 ]
Capao, Kris [1 ]
Oco, Nathaniel [2 ]
Roxas, Rachel Edita [2 ]
Sabellano, Mary Jane [1 ]
Nonnecke, Brandie [3 ]
Mohanty, Shrestha [3 ]
Crittenden, Camille [3 ]
Goldberg, Ken [3 ]
机构
[1] Univ San Carlos, Cebu, Philippines
[2] Natl Univ, Manila, Philippines
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
word embedding; biterm topic modeling; gensim; scikit learn; Malasakit toolkit; disaster risk reduction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we examine suggestions for disaster risk reduction strategies provided by residents in selected disaster-prone areas in the Philippines. The study utilizes 976 suggestions on how their barangay can help them better prepare for a disaster. These were collected through Malasakit, an e-participation platform designed by University of California, Berkeley and National University (Philippines) to engage community participation in gathering qualitative and quantitative data. Analyses were conducted through biterm topic modeling (BTM) and word embedding using gensim. For better accuracy, data preprocessing was performed to remove irrelevant or noisy data. Based on the BTM result, we identified the following important codes: preparedness, disaster, awareness, community, help, seminars, kanal (canal), linisin. (clean), drainage, garbage, basura (garbage). Analyses of the topic models show that disaster preparedness is an integral part in disaster risk reduction by improving solid waste management, providing seminars for public awareness and evacuation preparation. A word intrusion test was conducted where BTM scored 55.71% which implies strong cohesion of the words with their topics. For word embedding, we drilled down on the following words: community, preparedness, emergency, barangay (village), help, kanal (drainage), basura (garbage), awareness, seminars, information. The word2vec results has a cosine similarity score of 0.902 which implies strong relatedness of each word. The result shows that the participants give importance to community preparedness for emergency, helping the barangay in clean-up drive, and awareness through seminars and information dissemination.
引用
收藏
页码:293 / 297
页数:5
相关论文
共 50 条
  • [41] Sentiment Analysis of Turkish and English Twitter Feeds Using Word2Vec Model
    Karcioglu, Abdullah Ammar
    Aydin, Tolga
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [42] A deep learning analysis on question classification task using Word2vec representations
    Seyhmus Yilmaz
    Sinan Toklu
    Neural Computing and Applications, 2020, 32 : 2909 - 2928
  • [43] Word2Vec for Indonesian Sentiment Analysis towards Hotel Reviews: An Evaluation Study
    Nawangsari, Rizka Putri
    Kusumaningrum, Retno
    Wibowo, Adi
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 360 - 366
  • [44] Efficient Word2Vec Vectors for Sentiment Analysis to Improve Commercial Movie Success
    Parikh, Yash
    Palusa, Abhinivesh
    Kasthuri, Shravankumar
    Mehta, Rupa
    Rana, Dipti
    ADVANCED COMPUTATIONAL AND COMMUNICATION PARADIGMS, VOL 1, 2018, 475 : 269 - 279
  • [45] Driving word2vec: Distributed Semantic Vector Representation for Symbolized Naturalistic Driving Data
    Fuchida, Yusuke
    Taniguchi, Tadahiro
    Takano, Toshiaki
    Mori, Takuma
    Takenaka, Kazuhito
    Bando, Takashi
    2016 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2016, : 1313 - 1320
  • [46] Fusion of the word2vec word embedding model and cluster analysis for the communication of music intangible cultural heritage
    Hui Ning
    Zhenyu Chen
    Scientific Reports, 13
  • [47] Fusion of the word2vec word embedding model and cluster analysis for the communication of music intangible cultural heritage
    Ning, Hui
    Chen, Zhenyu
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [48] Microblog Emotional Analysis Based on TF-IWF Weighted Word2vec Model
    Tian, Hao
    Wu, Liuai
    PROCEEDINGS OF 2018 IEEE 9TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2018, : 893 - 896
  • [49] Effective Method for Sentiment Lexical Dictionary Enrichment based on Word2Vec for Sentiment Analysis
    Alshari, Eissa M.
    Azman, Azreen
    Doraisamy, Shyamala
    Mustapha, Norwati
    Alkeshr, Mostafa
    2018 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2018, : 177 - 181
  • [50] Sentiment Analysis of Sub-Events extracted out of an Event using Word2vec
    Keshavamurthy, Bettahally N.
    Srivastava, Shashank Prakash
    Hans, Jaseel
    Kumar, Ankush
    Wazarkar, Seema
    2018 IEEE 18TH INTERNATIONAL CONFERENCE ON ADVANCED LEARNING TECHNOLOGIES (ICALT 2018), 2018, : 444 - 448