Probabilistic modeling of renewable energy source based on Spark platform with large-scale sample data

被引:4
|
作者
Yang, Yan [1 ]
Yu, Juan [1 ]
Yang, Mengfan [1 ,2 ]
Ren, Pengling [1 ]
Yang, Zhifang [1 ]
Wang, Guisheng [3 ]
机构
[1] Chongqing Univ, Key Lab Power Transmiss Equipment & Syst Secur &, Chongqing 400030, Peoples R China
[2] Johnson Controls Inc, Core Platforms, Milwaukee, WI 53201 USA
[3] Chongqing ZENO Big Data Anal Co Ltd, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
renewable energy source; large-scale data; probabilistic modeling; Spark platform; resilient distributed dataset; KERNEL DENSITY-ESTIMATION;
D O I
10.1002/etep.2759
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, the probabilistic modeling based on resilient distribution dataset (RDD) of Spark platform is proposed to efficiently process the large-scale sample data of renewable energy source (RES). Based on Spark and Hadoop distributed file system, a parallel and distributed framework compatible with on-hand RES data storage systems is firstly designed for the fast probabilistic modeling of RES. On the basis of the designed framework, a novel parallel estimation algorithm of Wakeby distribution as well as kernel density estimation is developed based on RDD. With the in-memory parallel computing and fault-tolerant characteristics of RDD, the proposed algorithms significantly enhance the parallel execution performance of probabilistic. Besides, the approximate analytical relationship among time consumptions of the proposed algorithms, two important adjustable parameters (degree of parallelism and the number of partitions) of Spark platform, and large sample size of RES is derived, which is helpful for prediction of computational time, hardware configuration setting, and program tuning in the Spark platform. Simulation results with sample size ranging from 7.3 x 10(6) to 3.6 x 10(9) demonstrate the correctness and effectiveness of the proposed techniques.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] MetaReg: a platform for modeling, analysis and visualization of biological systems using large-scale experimental data
    Ulitsky, Igor
    Gat-Viks, Irit
    Shamir, Ron
    GENOME BIOLOGY, 2008, 9 (01)
  • [42] MetaReg: a platform for modeling, analysis and visualization of biological systems using large-scale experimental data
    Igor Ulitsky
    Irit Gat-Viks
    Ron Shamir
    Genome Biology, 9
  • [43] Large Scale Video Data Analysis Based on Spark
    Yang, Shuai
    Wu, Bin
    2015 INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA (CCBD), 2015, : 209 - 212
  • [44] A Large-Scale Filter Method for Feature Selection Based on Spark
    Marone, Reine Marie
    Camara, Fode
    Ndiaye, Samba
    2017 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2017, : 16 - 20
  • [45] Accelerate Large-Scale Seismic Data Kirchhoff Time Migration in Spark
    Tian, Yang
    Liu, Chao
    Yan, Haihua
    2018 4TH INTERNATIONAL CONFERENCE ON INFORMATION MANAGEMENT (ICIM2018), 2018, : 41 - 45
  • [46] Similarity Estimation for Large-Scale Human Action Video Data on Spark
    Xu, Weihua
    Uddin, Md Azher
    Dolgorsuren, Batjargal
    Akhond, Mostafijur Rahman
    Khan, Kifayat Ullah
    Hossain, Md Ibrahim
    Lee, Young-Koo
    APPLIED SCIENCES-BASEL, 2018, 8 (05):
  • [47] Toward efficient numerical modeling and analysis of large-scale thermal energy storage for renewable district heating
    Dahash, Abdulrahman
    Ochs, Fabian
    Tosatto, Alice
    Streicher, Wolfgang
    APPLIED ENERGY, 2020, 279
  • [48] Modeling and Probabilistic Reasoning of Population Evacuation During Large-scale Disaster
    Song, Xuan
    Zhang, Quanshi
    Sekimoto, Yoshihide
    Horanont, Teerayut
    Ueyama, Satoshi
    Shibasaki, Ryosuke
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 1231 - 1239
  • [49] A Large-Scale SUMO-Based Emulation Platform
    Griggs, Wynita M.
    Ordonez-Hurtado, Rodrigo H.
    Crisostomi, Emanuele
    Haeusler, Florian
    Massow, Kay
    Shorten, Robert N.
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (06) : 3050 - 3059
  • [50] Large-scale data processing platform for laser absorption tomography
    Zhou, Minqiu
    Zhang, Rui
    Chen, Yuan
    Fu, Yalei
    Xia, Jiangnan
    Upadhyay, Abhishek
    Liu, Chang
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (12)