Utilization of social media in floods assessment using data mining techniques

被引:18
|
作者
Khan, Qasim [1 ]
Kalbus, Edda [2 ]
Zaki, Nazar [3 ]
Mohamed, Mohamed Mostafa [1 ,2 ]
机构
[1] United Arab Emirates Univ, Civil & Environm Engn Dept, Al Ain, U Arab Emirates
[2] United Arab Emirates Univ, Natl Water Ctr, Al Ain, U Arab Emirates
[3] United Arab Emirates Univ, Dept Comp Sci & Software Engn, Al Ain, U Arab Emirates
来源
PLOS ONE | 2022年 / 17卷 / 04期
关键词
INFORMATION; TWITTER;
D O I
10.1371/journal.pone.0267079
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Floods are among the devastating types of disasters in terms of human life, social and financial losses. Authoritative data from flood gauges are scarce in arid regions because of the specific type of dry climate that dysfunctions these measuring devices. Hence, social media data could be a useful tool in this case, where a wealth of information is available online. This study investigates the reliability of flood related data quality collected from social media, particularly for an arid region where the usage of flow gauges is limited. The data (text, images and videos) of social media, related to a flood event, was analyzed using the Machine Learning approach. For this reason, digital data (758 images and 1413 video frames) was converted into numeric values through ResNet50 model using the VGG-16 architecture. Numeric data of images, videos and text was further classified using different Machine Learning algorithms. Receiver operating characteristics (ROC) curve and area under curve (AUC) methods were used to evaluate and compare the performance of the developed machine learning algorithms. This novel approach of studying the quality of social media data could be a reliable alternative in the absence of real-time flow gauges data. A flash flood that occurred in the United Arab Emirates (UAE) from March 7-11, 2016 was selected as the focus of this study. Random forest showed the highest accuracy of 80.18% among the five other classifiers for images and videos. Precipitation/rainfall data were used to validate social media data, which showed a significant relationship between rainfall and the number of posts. The validity of the machine learning models was assessed using the area under the curve, precision-recall curve, root mean square error, and kappa statistics to confirm the validity and accuracy of the model. The data quality of YouTube videos was found to have the highest accuracy followed by Facebook, Flickr, Twitter, and Instagram. These results showed that social media data could be used when gauge data is unavailable.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Assessment of building operational performance using data mining techniques: a case study
    Fan, Cheng
    Xiao, Fu
    8TH INTERNATIONAL CONFERENCE ON SUSTAINABILITY IN ENERGY AND BUILDINGS, SEB-16, 2017, 111 : 1070 - 1078
  • [42] A Study on Credit Risk Assessment in Banking Sector using Data Mining Techniques
    Mittal, Ankita
    Shrivastava, Amit
    Saxena, Aumreesh
    Manoria, Manish
    2018 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATION AND TELECOMMUNICATION (ICACAT), 2018,
  • [43] Groundwater spring potential assessment using new ensemble data mining techniques
    Yousefi, Saleh
    Sadhasivam, Nitheshnirmal
    Pourghasemi, Hamid Reza
    Nazarlou, Hamid Ghaffari
    Golkar, Foroogh
    Tavangar, Shahla
    Santosh, M.
    MEASUREMENT, 2020, 157
  • [44] Cardiovascular risk assessment using data mining inferencing and feature engineering techniques
    Sahu A.
    Gm H.
    Gourisaria M.K.
    Rautaray S.S.
    Pandey M.
    International Journal of Information Technology, 2021, 13 (5) : 2011 - 2023
  • [45] Floods and leptospirosis in Brazilian municipalities from 2003 to 2013: use of data mining techniques
    Gracie, Renata
    Xavier, Diego Ricardo
    Medronho, Roberto
    CADERNOS DE SAUDE PUBLICA, 2021, 37 (05):
  • [46] Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
    Gulnerman, Ayse Giz
    Karaman, Himmet
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (04)
  • [47] Social media data mining: An analysis & overview of social media networks and political landscape
    Joseph, Sethunya R. (Sethunya.joseph@studentmail.biust.ac.bw), 2016, Science and Engineering Research Support Society (09):
  • [48] Mining Social Media for Disaster Management: Leveraging Social Media Data for Community Recovery
    Shibuya, Yuya
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3111 - 3118
  • [49] Risk Assessment of Social-media Utilization in an Enterprise
    Tanimoto, Shigeaki
    Ohata, Kenichi
    Yoneda, Shoichi
    Iwashita, Motoi
    Seki, Yoshiaki
    Sato, Hiroyuki
    Kanai, Atsushi
    2015 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2015, : 577 - 580
  • [50] Data Mining Cultural Aspects of Social Media Marketing
    Hochreiter, Ronald
    Waldhauser, Christoph
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, 2014, 8557 : 130 - 143