The Design and Implementation of Vector Autoregressive Model and Structural Vector Autoregressive Model Based on Spark

被引:3
|
作者
Li, Tao [1 ]
Li, Xueyu [1 ]
Zhang, Xu [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Network Educ, Beijing 100876, Peoples R China
关键词
computer software and theory; Big Data; Time Series Analysis; Distributed Computing; Spark; VAR (Vector Auto-regressive); SVAR(Structural Vector Auto-regression); SGD (Stochastic Gradient Descent);
D O I
10.1109/BIGCOM.2017.46
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
VAR (Vector Auto-regressive) model is a kind of commonly used econometric-model. It is used to estimate the dynamic relationship of the endogenous variables without any prior constraints. Since VAR is one of the most easily operated models to deal with the analysis and prediction of multiple related economic indicators, more and more attention has been paid by economists in two decades. However, with the increasing of data size, the individual computer has encountered its processing bottleneck. Meanwhile, the advantages of the distributed computing cluster have begun to show obvious strength, such as Hadoop, Spark, and so on. Due to the lack of VAR related model on Spark, MLlib, we developed approaches of VAR and SVAR (Structural Vector Auto-regression) model in Spark and Hadoop cluster. Meanwhile, SGD (Stochastic Gradient Descent) algorithm has been applied after the data processing. To verify the approaches, different sizes of data are used for model testing in different platform, including R and Spark cluster. According to the comparison of the response time of different data size in both platform, the experiment results have shown that the developed methods are simple and efficient in big data environment.
引用
收藏
页码:386 / 394
页数:9
相关论文
共 50 条