LASSO for streaming data with adaptative filtering

被引:0
|
作者
Marco Capó
Aritz Pérez
José A. Lozano
机构
[1] Basque Center for Applied Mathematics,Intelligent Systems Group, Department of Computer Science and Artifitial Intelligence
[2] University of the Basque Country UPV/EHU,undefined
来源
Statistics and Computing | 2023年 / 33卷
关键词
LASSO; Adaptative filtering; Streaming data; Homotopy;
D O I
暂无
中图分类号
学科分类号
摘要
Streaming data is ubiquitous in modern machine learning, and so the development of scalable algorithms to analyze this sort of information is a topic of current interest. On the other hand, the problem of l1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l_1$$\end{document}-penalized least-square regression, commonly referred to as LASSO, is a quite popular data mining technique, which is commonly used for feature selection. In this work, we develop a homotopy-based solver for LASSO, on a streaming data context, that massively speeds up its convergence by extracting the most information out of the solution prior receiving the latest batch of data. Since these batches may show a non-stationary behavior, our solver also includes an adaptive filter that improves the predictability of our method in this scenario. Besides different theoretical properties, we additionally compare empirically our solver to the state-of-the-art: LARS, coordinate descent and Garrigues and Ghaoui’s data streaming homotopy. The obtained results show our approach to massively reduce the computational time require to convergence for the previous approaches, reducing up to 3, 4 and 5 orders of magnitude of running time with respect to LARS, coordinate descent and Garrigues and Ghaoui’s homotopy, respectively.
引用
收藏
相关论文
共 50 条
  • [31] Data-based design of robust fault detection and isolation residuals via LASSO optimization and Bayesian filtering
    Cascianelli, Silvia
    Costante, Gabriele
    Crocetti, Francesco
    Ricci, Elisa
    Valigi, Paolo
    Luca Fravolini, Mario
    ASIAN JOURNAL OF CONTROL, 2021, 23 (01) : 57 - 71
  • [32] Information Filtering Method for Twitter Streaming Data Using Human-in-the-Loop Machine Learning
    Suzuki, Yu
    Nakamura, Satoshi
    DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA 2018), PT II, 2018, 11030 : 167 - 175
  • [33] Weighted Lasso with Data Integration
    Bergersen, Linn Cecilie
    Glad, Ingrid K.
    Lyng, Heidi
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [34] Streaming data
    Szewczyk, William
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2011, 3 (01): : 22 - 29
  • [35] ADAPTATIVE LINEAR-FILTERING IN THE PRESENCE OF AN EVOLUTION NOISE OF POORLY KNOWN VARIANCE
    THANH, HH
    RECHERCHE AEROSPATIALE, 1979, (01): : 11 - 22
  • [36] Efficient Deadlock Avoidance for Streaming Computation with Filtering
    Buhler, Jeremy D.
    Agrawal, Kunal
    Li, Peng
    Chamberlain, Roger D.
    ACM SIGPLAN NOTICES, 2012, 47 (08) : 235 - 246
  • [37] QoS streaming based on a media filtering system
    Huang, CM
    Liu, PC
    Chang, RL
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, : 661 - 666
  • [38] Mapping Filtering Streaming Applications With Communication Costs
    Agrawal, Kunal
    Benoit, Anne
    Dufosse, Fanny
    Robert, Yves
    SPAA'09: PROCEEDINGS OF THE TWENTY-FIRST ANNUAL SYMPOSIUM ON PARALLELISM IN ALGORITHMS AND ARCHITECTURES, 2009, : 19 - 28
  • [39] An Incremental Approach for Collaborative Filtering in Streaming Scenarios
    Sreepada, Rama Syamala
    Patra, Bidyut Kr
    ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 632 - 637
  • [40] LOW-RANK EXTENDED KALMAN FILTERING FOR ONLINE LEARNING OF NEURAL NETWORKS FROM STREAMING DATA
    Chang, Peter G.
    Duran-Martin, Gerardo
    Shestopaloff, Alex
    Jones, Matt
    Murphy, Kevin
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 1025 - 1071