Learning Graphical Models from a Distributed Stream

被引:0
|
作者
Zhang, Yu [1 ]
Tirthapura, Srikanta [1 ]
Cormode, Graham [2 ]
机构
[1] Iowa State Univ, Elect & Comp Engn Dept, Ames, IA 50011 USA
[2] Univ Warwick, Coventry, W Midlands, England
来源
2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | 2018年
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
D O I
10.1109/ICDE.2018.00071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A current challenge for data management systems is to support the construction and maintenance of machine learning models over data that is large, multi-dimensional, and evolving. While systems that could support these tasks are emerging, the need to scale to distributed, streaming data requires new models and algorithms. In this setting, as well as computational scalability and model accuracy, we also need to minimize the amount of communication between distributed processors, which is the chief component of latency. We study Bayesian Networks, the workhorse of graphical models, and present a communication-efficient method for continuously learning and maintaining a Bayesian network model over data that is arriving as a distributed stream partitioned across multiple processors. We show a strategy for maintaining model parameters that leads to an exponential reduction in communication when compared with baseline approaches to maintain the exact MLE (maximum likelihood estimation). Meanwhile, our strategy provides similar prediction errors for the target distribution and for classification tasks.
引用
收藏
页码:725 / 736
页数:12
相关论文
共 50 条
  • [11] Learning graphical models with hubs
    Tan, Kean Ming
    London, Palma
    Mohan, Karthik
    Lee, Su-In
    Fazel, Maryam
    Witten, Daniela
    Journal of Machine Learning Research, 2015, 15 : 3297 - 3331
  • [12] Operations for Learning with Graphical Models
    Buntine, Wray L.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1994, 2 : 159 - 225
  • [13] Distributed Covariance Estimation in Gaussian Graphical Models
    Wiesel, Ami
    Hero, Alfred O., III
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (01) : 211 - 220
  • [14] Distributed Parameter Estimation in Probabilistic Graphical Models
    Mizrahi, Yariv D.
    Denil, Misha
    de Freitas, Nando
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [15] Learning genetic and environmental graphical models from family data
    Ribeiro, Adele H.
    Maria Pavan Soler, Julia
    STATISTICS IN MEDICINE, 2020, 39 (18) : 2403 - 2422
  • [16] Learning Sparse Gaussian Graphical Models from Correlated Data
    Song, Zeyuan
    Gunn, Sophia
    Monti, Stefano
    Peloso, Gina Marie
    Liu, Ching-Ti
    Lunetta, Kathryn
    Sebastiani, Paola
    GENETIC EPIDEMIOLOGY, 2024, 48 (07) : 395 - 395
  • [17] Differentially Private Learning of Undirected Graphical Models Using Collective Graphical Models
    Bernstein, Garrett
    McKenna, Ryan
    Sun, Tao
    Sheldon, Daniel
    Hay, Michael
    Miklau, Gerome
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [18] Learning graphical models for hypothesis testing
    Sanghavi, Sujay
    Tan, Vincent
    Willsky, Alan
    2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 69 - 73
  • [19] Learning of Tree-Structured Gaussian Graphical Models on Distributed Data Under Communication Constraints
    Tavassolipour, Mostafa
    Motahari, Seyed Abolfazl
    Shalmani, Mohammad-Taghi Manzuri
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (01) : 17 - 28
  • [20] Marrying Graphical Models with Deep Learning
    Welling, Max
    ERCIM NEWS, 2016, (107): : 20 - 21