Investigation on the use of ensemble learning and big data in crop identification

被引:6
|
作者
Ahmed, Sayed [1 ]
Mahmoud, Amira S. [1 ]
Farg, Eslam [1 ]
Mohamed, Amany M. [1 ]
Moustafa, Marwa S. [1 ]
Abutaleb, Khaled [1 ]
Saleh, Ahmed M. [1 ]
AbdelRahman, Mohamed A. E. [1 ]
AbdelSalam, Hisham M. [2 ]
Arafat, Sayed M. [1 ]
机构
[1] Natl Author Remote Sensing & Space Sci NARSS, Cairo, Egypt
[2] Cairo Univ, Fac Comp & Artificial Intelligence, Giza, Egypt
关键词
Big data; Crop identification; Ensemble learning; DB Framework; Apache spark;
D O I
10.1016/j.heliyon.2023.e13339
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The agriculture sector in Egypt faces several problems, such as climate change, water storage, and yield variability. The comprehensive capabilities of Big Data (BD) can help in tackling the uncertainty of food supply occurs due to several factors such as soil erosion, water pollution, climate change, socio-cultural growth, governmental regulations, and market fluctuations. Crop identification and monitoring plays a vital role in modern agriculture. Although several machine learning models have been utilized in identifying crops, the performance of ensemble learning has not been investigated extensively. The massive volume of satellite imageries has been established as a big data problem forcing to deploy the proposed solution using big data technologies to manage, store, analyze, and visualize satellite data. In this paper, we have developed a weighted voting mechanism for improving crop classification performance in a large scale, based on ensemble learning and big data schema. Built upon Apache Spark, the popular DB Framework, the proposed approach was tested on El Salheya, Ismaili governate. The proposed ensemble approach boosted accuracy by 6.5%, 1.9%, 4.4%, 4.9%, 4.7% in precision, recall, F-score, Overall Accuracy (OA), and Matthews correlation coefficient (MCC) metrics respectively. Our findings confirm the generalization of the proposed crop identification approach at a large-scale setting.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Big data and machine learning for crop protection
    Ip, Ryan H. L.
    Ang, Li-Minn
    Seng, Kah Phooi
    Broster, J. C.
    Pratley, J. E.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 151 : 376 - 383
  • [2] Big data classification of learning behaviour based on data reduction and ensemble learning
    Wang, Taotao
    Wu, Xiaoxuan
    INTERNATIONAL JOURNAL OF CONTINUING ENGINEERING EDUCATION AND LIFE-LONG LEARNING, 2023, 33 (4-5) : 496 - 510
  • [3] An Asymptotic Ensemble Learning Framework for Big Data Analysis
    Salloum, Salman
    Huang, Joshua Zhexue
    He, Yulin
    Chen, Xiaojun
    IEEE ACCESS, 2019, 7 : 3675 - 3693
  • [4] Empirical Analysis of Asymptotic Ensemble Learning for Big Data
    Salloum, Salman
    Huang, Joshua Zhexue
    He, Yulin
    2016 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING, APPLICATIONS AND TECHNOLOGIES (BDCAT), 2016, : 8 - 17
  • [5] Data analytics in ensemble learning for effective crop yield prediction
    Tripathi, Deeksha
    Biswas, Saroj K.
    Baruah, Barnana
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (03):
  • [6] Improving malware detection using big data and ensemble learning
    Gupta, Deepak
    Rani, Rinkle
    COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86
  • [7] Intrusion detection based on ensemble learning for big data classification
    Jemili, Farah
    Meddeb, Rahma
    Korbaa, Ouajdi
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (03): : 3771 - 3798
  • [8] Towards Big Data Bayesian Network Learning - an Ensemble Learning Based Approach
    Tang, Yan
    Wang, Yu
    Li, Ling
    Cooper, Kendra M. L.
    2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, : 355 - 357
  • [9] Mineral identification based on data augmentation and ensemble learning
    Wang, Lin
    Ji, Xiaohui
    Yang, Mei
    He, Mingyue
    Zhang, Zhaochong
    Zeng, Shan
    Wang, Yuzhu
    Earth Science Frontiers, 2024, 31 (04) : 87 - 94
  • [10] An Algorithm Design of Big Data Anomaly Detection Based on Ensemble Learning
    Chen, Xiao
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON COMPUTER AND MULTIMEDIA TECHNOLOGY, ICCMT 2024, 2024, : 319 - 323