FTSGD: An Adaptive Stochastic Gradient Descent Algorithm for Spark MLlib

被引：2

作者：

Zhang, Hong ^{[1
]}

Liu, Zixia ^{[1
]}

Huang, Hai ^{[2
]}

Wang, Liqiang ^{[1
]}

机构：

[1] Univ Cent Florida, Dept Comp Sci, Orlando, FL 32816 USA

[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA

来源：

2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH) | 2018年

关键词：

Spark; MLlib; Asynchronous Stochastic Gradient Decent; Adaptive Iterative Learning;

D O I：

10.1109/DASC/PiCom/DataCom/CyberSciTec.2018.00-22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The proliferation of massive datasets and the surge of interests in big data analytics have popularized a number of novel distributed data processing platforms such as Hadoop and Spark. Their large and growing ecosystems of libraries enable even novice to take advantage of the latest data analytics and machine learning algorithms. However, time-consuming data synchronization and communications in iterative algorithms on large-scale distributed platforms can lead to significant performance inefficiency. MLlib is Spark's scalable library consisting of common machine learning algorithms, many of which employ Stochastic Gradient Descent (SGD) to find minima or maxima by iterations. However, the convergence can be very slow if gradient data are synchronized on each iteration. In this work, we optimize the current implementation of SGD in Spark's MLlib by reusing data partition for multiple times within a single iteration to find better candidate weights in a more efficient way. Whether using multiple local iterations within each partition is dynamically decided by the 68-95-99.7 rule. We also design a variant of momentum algorithm to optimize step size in every iteration. This method uses a new adaptive rule that decreases the step size whenever neighboring gradients show differing directions of significance. Experiments show that our adaptive algorithm is more efficient and can be 7 times faster compared to the original MLlib's SGD.

引用

页码：828 / 835

页数：8

共 50 条

[1] A Stochastic Gradient Descent Algorithm Based on Adaptive Differential Privacy
Deng, Yupeng
Li, Xiong
He, Jiabei
Liu, Yuzhen
Liang, Wei
COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT II, 2022, 461 : 133 - 152
[2] Stochastic parallel gradient descent algorithm for adaptive optics system
Ma H.
Zhang P.
Zhang J.
Fan C.
Wang Y.
Qiangjiguang Yu Lizishu/High Power Laser and Particle Beams, 2010, 22 (06): : 1206 - 1210
[3] Adaptive Gradient Estimation Stochastic Parallel Gradient Descent Algorithm for Laser Beam Cleanup
Ma, Shiqing
Yang, Ping
Lai, Boheng
Su, Chunxuan
Zhao, Wang
Yang, Kangjian
Jin, Ruiyan
Cheng, Tao
Xu, Bing
PHOTONICS, 2021, 8 (05)
[4] A stochastic multiple gradient descent algorithm
Mercier, Quentin
Poirion, Fabrice
Desideri, Jean-Antoine
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 271 (03) : 808 - 817
[5] Optimization of stochastic parallel gradient descent algorithm for adaptive optics in atmospheric turbulence
Chen B.
Li X.
Jiang W.
Zhongguo Jiguang/Chinese Journal of Lasers, 2010, 37 (04): : 959 - 964
[6] Theoretical Analysis of Stochastic Parallel Gradient Descent Control Algorithm in Adaptive Optics
Yang, Huizhen
Li, Xinyang
PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL II, 2009, : 338 - +
[7] Adaptive optical confocal fluorescence microscope with stochastic parallel gradient descent algorithm
He, Yi
Wang, Zhibin
Wei, Ling
Li, Xiqi
Yang, Jinsheng
Zhang, Yudong
2016 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE (ACP), 2016,
[8] Linear Convergence of Adaptive Stochastic Gradient Descent
Xie, Yuege
Wu, Xiaoxia
Ward, Rachel
arXiv, 2019,
[9] On the Convergence of Stochastic Gradient Descent with Adaptive Stepsizes
Li, Xiaoyu
Orabona, Francesco
22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
[10] Linear Convergence of Adaptive Stochastic Gradient Descent
Xie, Yuege
Wu, Xiaoxia
Ward, Rachel
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108

← 1 2 3 4 5 →