Constrained Stochastic Gradient Descent for Large-scale Least Squares Problem

被引:0
|
作者
Mu, Yang [1 ]
Ding, Wei [1 ]
Zhou, Tianyi [2 ]
Tao, Dacheng [2 ]
机构
[1] Univ Massachusetts, 100 Morrissey Blvd, Boston, MA 02125 USA
[2] Univ Technol Sydney, Ultimo, NSW 2007, Australia
关键词
Stochastic optimization; Large-scale least squares; online learning; APPROXIMATION; ALGORITHMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The least squares problem is one of the most important regression problems in statistics, machine learning and data mining. In this paper, we present the Constrained Stochastic Gradient Descent (CSGD) algorithm to solve the large-scale least squares problem. CSGD improves the Stochastic Gradient Descent (SGD) by imposing a provable constraint that the linear regression line passes through the mean point of all the data points. It results in the best regret bound o(logT), and fastest convergence speed among all first order approaches. Empirical studies justify the effectiveness of CSGD by comparing it with SGD and other state-of-the-art approaches. An example is also given to show how to use CSGD to optimize SGD based least squares problems to achieve a better performance.
引用
收藏
页码:883 / 891
页数:9
相关论文
共 50 条
  • [41] Scaled Least Squares Estimator for GLMs in Large-Scale Problems
    Erdogdu, Murat A.
    Bayati, Mohsen
    Dicker, Lee H.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [42] M-Decomposed Least Squares and Recursive Least Squares Identification Algorithms for Large-Scale Systems
    Ji, Yuejiang
    Lv, Lixin
    IEEE ACCESS, 2021, 9 : 139466 - 139472
  • [43] Constrained Stochastic Gradient Descent: The Good Practice
    Roy, Soumava Kumar
    Harandi, Mehrtash
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 596 - 603
  • [44] DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition
    Huang, Jingwei
    Huang, Shan
    Sun, Mingwei
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10303 - 10312
  • [45] THE CONSTRAINED LEAST GRADIENT PROBLEM IN RN
    STERNBERG, P
    WILLIAMS, G
    ZIEMER, WP
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1993, 339 (01) : 403 - 432
  • [46] Parallelizing Stochastic Gradient Descent for Least Squares Regression: Mini-batching, Averaging, and Model Misspecification
    Jain, Prateek
    Netrapalli, Praneeth
    Kakade, Sham M.
    Kidambi, Rahul
    Sidford, Aaron
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [47] ON THE COMPLEX LEAST SQUARES PROBLEM WITH CONSTRAINED PHASE
    Markovsky, Ivan
    SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 2011, 32 (03) : 987 - 992
  • [48] Distributing the Stochastic Gradient Sampler for Large-Scale LDA
    Yang, Yuan
    Chen, Jianfei
    Zhu, Jun
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1975 - 1984
  • [49] Accelerated orthogonal least-squares for large-scale sparse reconstruction
    Hashemi, Abolfazl
    Vikalo, Haris
    DIGITAL SIGNAL PROCESSING, 2018, 82 : 91 - 105
  • [50] Partitioned least-squares operator for large-scale geophysical inversion
    Porsani, Milton J.
    Stoffa, Paul L.
    Sen, Mrinal K.
    Seif, Roustam K.
    GEOPHYSICS, 2010, 75 (06) : R121 - R128