Learning Graphical Models from a Distributed Stream

被引:0
|
作者
Zhang, Yu [1 ]
Tirthapura, Srikanta [1 ]
Cormode, Graham [2 ]
机构
[1] Iowa State Univ, Elect & Comp Engn Dept, Ames, IA 50011 USA
[2] Univ Warwick, Coventry, W Midlands, England
基金
美国国家科学基金会; 欧洲研究理事会;
关键词
D O I
10.1109/ICDE.2018.00071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A current challenge for data management systems is to support the construction and maintenance of machine learning models over data that is large, multi-dimensional, and evolving. While systems that could support these tasks are emerging, the need to scale to distributed, streaming data requires new models and algorithms. In this setting, as well as computational scalability and model accuracy, we also need to minimize the amount of communication between distributed processors, which is the chief component of latency. We study Bayesian Networks, the workhorse of graphical models, and present a communication-efficient method for continuously learning and maintaining a Bayesian network model over data that is arriving as a distributed stream partitioned across multiple processors. We show a strategy for maintaining model parameters that leads to an exponential reduction in communication when compared with baseline approaches to maintain the exact MLE (maximum likelihood estimation). Meanwhile, our strategy provides similar prediction errors for the target distribution and for classification tasks.
引用
收藏
页码:725 / 736
页数:12
相关论文
共 50 条
  • [1] Privacy Preserving Distributed Structure Learning of Probabilistic Graphical Models
    Li, Husheng
    2013 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2013, : 188 - 193
  • [2] Learning graphical models from the Glauber dynamics
    Bresler, Guy
    Gamarnik, David
    Shah, Devavrat
    2014 52ND ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2014, : 1148 - 1155
  • [3] Learning Graphical Models From the Glauber Dynamics
    Bresler, Guy
    Gamarnik, David
    Shah, Devavrat
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (06) : 4072 - 4080
  • [4] Learning posisibilistic graphical models from data
    Borgelt, C
    Kruse, R
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2003, 11 (02) : 159 - 172
  • [5] GRAPHICAL SPECIFICATION OF DISTRIBUTED SIMULATION MODELS
    Penzes, Jiri
    Kavicka, Antonin
    EUROPEAN SIMULATION AND MODELLING CONFERENCE 2013, 2013, : 128 - 133
  • [6] Learning Spatiotemporal Graphical Models From Incomplete Observations
    Javaheri, Amirhossein
    Amini, Arash
    Marvasti, Farokh
    Palomar, Daniel P.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 1361 - 1374
  • [7] Learning from imprecise data: possibilistic graphical models
    Borgelt, C
    Kruse, R
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 38 (04) : 449 - 463
  • [8] Learning Graphical Models With Hubs
    Tan, Kean Ming
    London, Palma
    Mohan, Karthik
    Lee, Su-In
    Fazel, Maryam
    Witten, Daniela
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 3297 - 3331
  • [9] Learning Graphical Game Models
    Duong, Quang
    Vorobeychik, Yevgeniy
    Singh, Satinder
    Wellman, Michael P.
    21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, 2009, : 116 - 121
  • [10] Scalable Learning of Graphical Models
    Petitjean, Francois
    Webb, Geoffrey I.
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 2131 - 2132