Efficient spatiotemporal interpolation with spark machine learning

被引:0
|
作者
Weitian Tong
Lixin Li
Xiaolu Zhou
Jason Franklin
机构
[1] Georgia Southern University,Department of Computer Science
[2] Georgia Southern University,Department of Geology and Geography
来源
Earth Science Informatics | 2019年 / 12卷
关键词
Spatiotemporal interpolation; Spark; Machine learning; Inverse distance weighting (IDW); k-d tree; Bootstrap aggregating;
D O I
暂无
中图分类号
学科分类号
摘要
To better assess the relationships between environmental exposures and health outcomes, an appropriate spatiotemporal interpolation is critical. Traditional spatiotemporal interpolation methods either consider the spatial and temporal dimensions separately or incorporate both dimensions simultaneously by simply treating time as another dimension in space. Such interpolation results suffer from relatively low accuracy as the true space-time domain is skewed inappropriately and the distance calculation in such domain is not accurate. We employ the efficient k-d tree structure to store spatiotemporal data and adopt several machine learning methods to learn optimal parameters. To overcome the computational difficulty with large data sets, we implement our method on an efficient cluster computing framework – Apache Spark. Real world PM2.5 data sets are utilized to test our implementation and the experimental results demonstrate the computational power of our method, which significantly outperforms the previous work in terms of both speed and accuracy.
引用
收藏
页码:87 / 96
页数:9
相关论文
共 50 条
  • [1] Efficient spatiotemporal interpolation with spark machine learning
    Tong, Weitian
    Li, Lixin
    Zhou, Xiaolu
    Franklin, Jason
    EARTH SCIENCE INFORMATICS, 2019, 12 (01) : 87 - 96
  • [2] Sparker: Efficient Reduction for More Scalable Machine Learning with Spark
    Yu, Bowen
    Cao, Huanqi
    Shan, Tianyi
    Wang, Haojie
    Tang, Xiongchao
    Chen, Wenguang
    50TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2021,
  • [3] MLlib: Machine learning in Apache Spark
    Meng, Xiangrui
    Bradley, Joseph
    Yavuz, Burak
    Sparks, Evan
    Venkataraman, Shivaram
    Liu, Davies
    Freeman, Jeremy
    Tsai, D.B.
    Amde, Manish
    Owen, Sean
    Xin, Doris
    Xin, Reynold
    Franklin, Michael J.
    Zadeh, Reza
    Zaharia, Matei
    Talwalkar, Ameet
    Journal of Machine Learning Research, 2016, 17
  • [4] MLlib: Machine Learning in Apache Spark
    Meng, Xiangrui
    Bradley, Joseph
    Yavuz, Burak
    Sparks, Evan
    Venkataraman, Shivaram
    Liu, Davies
    Freeman, Jeremy
    Tsai, D. B.
    Amde, Manish
    Owen, Sean
    Xin, Doris
    Xin, Reynold
    Franklin, Michael J.
    Zadeh, Reza
    Zaharia, Matei
    Talwalkar, Ameet
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [5] SystemML: Declarative Machine Learning on Spark
    Boehm, Matthias
    Dusenberry, Michael W.
    Eriksson, Deron
    Evfimievski, Alexandre V.
    Manshadi, Faraz Makari
    Pansare, Niketan
    Reinwald, Berthold
    Reiss, Frederick R.
    Sen, Prithviraj
    Surve, Arvind C.
    Tatikonda, Shirish
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1425 - 1436
  • [6] Efficient and accurate machine-learning interpolation of atomic energies in compositions with many species
    Artrith, Nongnuch
    Urban, Alexander
    Ceder, Gerbrand
    PHYSICAL REVIEW B, 2017, 96 (01)
  • [7] ADMM based Scalable Machine Learning on Spark
    Dhar, Sauptik
    Yi, Congrui
    Ramakrishnan, Naveen
    Shah, Mohak
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1174 - 1182
  • [8] Intelligent interpolation by Monte Carlo machine learning
    Jia, Yongna
    Yu, Siwei
    Ma, Jianwei
    GEOPHYSICS, 2018, 83 (02) : V83 - V97
  • [9] IMAT: matrix learning machine with interpolation mapping
    Wang, Zhe
    Lu, Mingzhe
    Zhu, Yujin
    Gao, Daqi
    ELECTRONICS LETTERS, 2014, 50 (24) : 1836 - U201
  • [10] GEOMAGNETIC SURVEY INTERPOLATION WITH THE MACHINE LEARNING APPROACH
    Aleshin, Igor
    Kholodkov, Kirill
    Malygin, Ivan
    Shevchuk, Roman
    Sidorov, Roman
    RUSSIAN JOURNAL OF EARTH SCIENCES, 2022, 22 (06):