Improved Pathwise Coordinate Descent for Power Penalties

被引:0
|
作者
Griffin, Maryclare [1 ,2 ]
机构
[1] Univ Massachusetts Amherst, Dept Math & Stat, Amherst, MA USA
[2] Univ Massachusetts Amherst, Dept Math & Stat, Amherst, MA 01003 USA
关键词
Coordinate descent; LASSO; Nonconvex optimization; Regularization surface; Sparse regression; REGRESSION; SELECTION; BRIDGE;
D O I
10.1080/10618600.2023.2256807
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Pathwise coordinate descent algorithms have been used to compute entire solution paths for lasso and other penalized regression problems quickly with great success. They improve upon cold start algorithms by solving the problems that make up the solution path sequentially for an ordered set of tuning parameter values, instead of solving each problem separastely. However, extending pathwise coordinate descent algorithms to more the general bridge or power family of l(q) penalties is challenging. Faster algorithms for computing solution paths for these penalties are needed because l(q) penalized regression problems can be nonconvex and especially burdensome to solve. In this paper, we show that a reparameterization of l(q) penalized regression problems is more amenable to pathwise coordinate descent algorithms. This allows us to improve computation of the mode-thresholding function for l(q) penalized regression problems in practice and introduce two separate pathwise algorithms. We show that either pathwise algorithm is faster than the corresponding cold start alternative, and demonstrate that different pathwise algorithms may be more likely to reach better solutions. Supplemental materials for this article are available online.
引用
收藏
页码:310 / 315
页数:6
相关论文
共 50 条
  • [21] A Decomposition And Coordination Algorithm For Reactive Power Optimization Based On Block Coordinate Descent
    Liu, Jun
    He, Weiguo
    Zhan, Xiaoyin
    Li, Zhi
    2012 CHINA INTERNATIONAL CONFERENCE ON ELECTRICITY DISTRIBUTION (CICED), 2012,
  • [22] An improved parallel block coordinate descent method for the distributed computing of traffic assignment problem
    Wang, Zewen
    Zhang, Kai
    Chen, Xinyuan
    Wang, Meng
    Liu, Renwei
    Liu, Zhiyuan
    TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2022, 18 (03) : 1376 - 1400
  • [23] Asynchronous Parallel Greedy Coordinate Descent
    You, Yang
    Lian, XiangRu
    Liu, Ji
    Yu, Hsiang-Fu
    Dhillon, Inderjit S.
    Demmel, James
    Hsieh, Cho-Jui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [24] Coordinate Descent Full Configuration Interaction
    Wang, Zhe
    Li, Yingzhou
    Lu, Jianfeng
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (06) : 3558 - 3569
  • [25] Markov chain block coordinate descent
    Sun, Tao
    Sun, Yuejiao
    Xu, Yangyang
    Yin, Wotao
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2020, 75 (01) : 35 - 61
  • [26] Accelerating Coordinate Descent in Iterative Reconstruction
    Hsieh, Scott S.
    Hoffman, John M.
    Noo, Frederic
    MEDICAL IMAGING 2019: PHYSICS OF MEDICAL IMAGING, 2019, 10948
  • [27] Stability and Generalization for Randomized Coordinate Descent
    Wang, Puyu
    Wu, Liang
    Lei, Yunwen
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3104 - 3110
  • [28] Differentially private block coordinate descent
    Riaz, Shazia
    Ali, Saqib
    Wang, Guojun
    Anees, Asad
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (01) : 283 - 295
  • [29] Differentially Private Stochastic Coordinate Descent
    Damaskinos, Georgios
    Mendler-Duenner, Celestine
    Guerraoui, Rachid
    Papandreou, Nikolaos
    Parnell, Thomas
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7176 - 7184
  • [30] Parallel coordinate descent for the Adaboost problem
    Fercoq, Olivier
    2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 354 - 358