Iktishaf plus : A Big Data Tool with Automatic Labeling for Road Traffic Social Sensing and Event Detection Using Distributed Machine Learning

被引:46
作者
Alomari, Ebtesam [1 ]
Katib, Iyad [1 ]
Albeshri, Aiiad [1 ]
Yigitcanlar, Tan [2 ,3 ]
Mehmood, Rashid [4 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah 21589, Saudi Arabia
[2] Queensland Univ Technol, Sch Architecture & Built Environm, 2 George St, Brisbane, Qld 4000, Australia
[3] Univ Fed Santa Catarina, Sch Technol, Campus Univ, BR-88040900 Florianopolis, SC, Brazil
[4] King Abdulaziz Univ, High Performance Comp Ctr, Jeddah 21589, Saudi Arabia
关键词
smart cities; big data; event detection; road traffic; distributed machine learning; automatic labeling; social media; data analytics; social media analytics; Arabic tweets; APACHE SPARK; SPMV COMPUTATIONS; ANOMALY DETECTION; SMART; SYSTEMS; IOT; CONGESTION; LOGISTICS; TRANSPORT; FRAMEWORK;
D O I
10.3390/s21092993
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Digital societies could be characterized by their increasing desire to express themselves and interact with others. This is being realized through digital platforms such as social media that have increasingly become convenient and inexpensive sensors compared to physical sensors in many sectors of smart societies. One such major sector is road transportation, which is the backbone of modern economies and costs globally 1.25 million deaths and 50 million human injuries annually. The cutting-edge on big data-enabled social media analytics for transportation-related studies is limited. This paper brings a range of technologies together to detect road traffic-related events using big data and distributed machine learning. The most specific contribution of this research is an automatic labelling method for machine learning-based traffic-related event detection from Twitter data in the Arabic language. The proposed method has been implemented in a software tool called Iktishaf+ (an Arabic word meaning discovery) that is able to detect traffic events automatically from tweets in the Arabic language using distributed machine learning over Apache Spark. The tool is built using nine components and a range of technologies including Apache Spark, Parquet, and MongoDB. Iktishaf+ uses a light stemmer for the Arabic language developed by us. We also use in this work a location extractor developed by us that allows us to extract and visualize spatio-temporal information about the detected events. The specific data used in this work comprises 33.5 million tweets collected from Saudi Arabia using the Twitter API. Using support vector machines, naive Bayes, and logistic regression-based classifiers, we are able to detect and validate several real events in Saudi Arabia without prior knowledge, including a fire in Jeddah, rains in Makkah, and an accident in Riyadh. The findings show the effectiveness of Twitter media in detecting important events with no prior knowledge about them.
引用
收藏
页数:33
相关论文
共 88 条
[1]   Face off: Travel Habits, Road Conditions and Traffic City Characteristics Bared Using Twitter [J].
Agarwal, Amit ;
Toshniwal, Durga .
IEEE ACCESS, 2019, 7 :66536-66552
[2]  
Al-Dhubhani Raed, 2018, Smart Societies, Infrastructure, Technologies and Applications. First International Conference, SCITA 2017. Proceedings. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering (LNICST 224), P123, DOI 10.1007/978-3-319-94180-6_14
[3]  
AL-Smadi M, 2016, INT J ADV COMPUT SC, V7, P483
[4]  
Alabbas W., 2017, 2017 IEEE INT C SOC, P1
[5]   iResponse: An AI and IoT-Enabled Framework for Autonomous COVID-19 Pandemic Management [J].
Alam, Furgan ;
Almaghthawi, Ahmed ;
Katib, Iyad ;
Albeshri, Aiiad ;
Mehmood, Rashid .
SUSTAINABILITY, 2021, 13 (07)
[6]   TAAWUN: a Decision Fusion and Feature Specific Road Detection Approach for Connected Autonomous Vehicles [J].
Alam, Furqan ;
Mehmood, Rashid ;
Katib, Iyad ;
Altowaijri, Saleh M. ;
Albeshri, Aiiad .
MOBILE NETWORKS & APPLICATIONS, 2023, 28 (02) :636-652
[7]   Data Fusion and IoT for Smart Ubiquitous Environments: A Survey [J].
Alam, Furqan ;
Mehmood, Rashid ;
Katib, Iyad ;
Albogami, Nasser N. ;
Albeshri, Aiiad .
IEEE ACCESS, 2017, 5 :9533-9554
[8]  
Alamoudi E, 2020, EAI SPRINGER INNOVAT, P217, DOI 10.1007/978-3-030-13705-2_9
[9]  
Alkhamisi Abrar Omar, 2020, 2020 6th Conference on Data Science and Machine Learning Applications (CDMA), P54, DOI 10.1109/CDMA47397.2020.00015
[10]   An Arabic social media based framework for incidents and events monitoring in smart cities [J].
Alkhatib, Manar ;
El Barachi, May ;
Shaalan, Khaled .
JOURNAL OF CLEANER PRODUCTION, 2019, 220 :771-785