Qualitative data analysis of disaster risk reduction suggestions assisted by topic modeling and word2vec

被引:0
|
作者
Gorro, Ken [1 ]
Ancheta, Jeffrey Rosario [2 ]
Capao, Kris [1 ]
Oco, Nathaniel [2 ]
Roxas, Rachel Edita [2 ]
Sabellano, Mary Jane [1 ]
Nonnecke, Brandie [3 ]
Mohanty, Shrestha [3 ]
Crittenden, Camille [3 ]
Goldberg, Ken [3 ]
机构
[1] Univ San Carlos, Cebu, Philippines
[2] Natl Univ, Manila, Philippines
[3] Univ Calif Berkeley, Berkeley, CA 94720 USA
关键词
word embedding; biterm topic modeling; gensim; scikit learn; Malasakit toolkit; disaster risk reduction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we examine suggestions for disaster risk reduction strategies provided by residents in selected disaster-prone areas in the Philippines. The study utilizes 976 suggestions on how their barangay can help them better prepare for a disaster. These were collected through Malasakit, an e-participation platform designed by University of California, Berkeley and National University (Philippines) to engage community participation in gathering qualitative and quantitative data. Analyses were conducted through biterm topic modeling (BTM) and word embedding using gensim. For better accuracy, data preprocessing was performed to remove irrelevant or noisy data. Based on the BTM result, we identified the following important codes: preparedness, disaster, awareness, community, help, seminars, kanal (canal), linisin. (clean), drainage, garbage, basura (garbage). Analyses of the topic models show that disaster preparedness is an integral part in disaster risk reduction by improving solid waste management, providing seminars for public awareness and evacuation preparation. A word intrusion test was conducted where BTM scored 55.71% which implies strong cohesion of the words with their topics. For word embedding, we drilled down on the following words: community, preparedness, emergency, barangay (village), help, kanal (drainage), basura (garbage), awareness, seminars, information. The word2vec results has a cosine similarity score of 0.902 which implies strong relatedness of each word. The result shows that the participants give importance to community preparedness for emergency, helping the barangay in clean-up drive, and awareness through seminars and information dissemination.
引用
收藏
页码:293 / 297
页数:5
相关论文
共 50 条
  • [1] Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec
    Qiang Gao
    Xiao Huang
    Ke Dong
    Zhentao Liang
    Jiang Wu
    Scientometrics, 2022, 127 : 1543 - 1563
  • [2] Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec
    Gao, Qiang
    Huang, Xiao
    Dong, Ke
    Liang, Zhentao
    Wu, Jiang
    SCIENTOMETRICS, 2022, 127 (03) : 1543 - 1563
  • [3] Modelling of Topic from Hindi Corpus using Word2Vec
    Panigrahi, Sabitra Sankalp
    Panigrahi, Narayan
    Paul, Biswajit
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, CONTROL AND COMMUNICATION TECHNOLOGY (IAC3T), 2018, : 97 - 100
  • [4] Similarity Analysis in Data Element Matching based on Word2vec
    Liu, Wenhong
    Peng, Zhiyuan
    Zhao, Shuang
    Liu, Jiawei
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY, AND SECURITY COMPANION, QRS-C, 2022, : 319 - 323
  • [5] Characterization of citizens using word2vec and latent topic analysis in a large set of tweets
    Vargas-Calderon, Vladimir
    Camargo, Jorge E.
    CITIES, 2019, 92 : 187 - 196
  • [6] A User Profile Modeling Method Based on Word2Vec
    Hu, Jianqiao
    Jin, Feng
    Zhang, Guigang
    Wang, Jian
    Yang, Yi
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 410 - 414
  • [7] A Word2vec Model for Sentiment Analysis of Weibo
    Shi, Bowen
    Zhao, Jichang
    Xu, Ke
    2019 16TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM2019), 2019,
  • [8] ECG analysis based on Word2Vec model
    Oliinyk, Yurii
    Tereschenko, Andrii
    Baklan, Igor
    Beraudo, Elisa
    IDDM 2021: INFORMATICS & DATA-DRIVEN MEDICINE: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE (IDDM 2021), 2021, 3038 : 213 - 222
  • [9] Using Word2Vec to Process Big Text Data
    Ma, Long
    Zhang, Yanqing
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2895 - 2897
  • [10] The enhancement of TextRank algorithm by using word2vec and its application on topic extraction
    Zuo, Xiaolei
    Zhang, Silan
    Xia, Jingbo
    2ND ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2017), 2017, 887