Parallel ILP for distributed-memory architectures

被引:18
|
作者
Fonseca, Nuno A. [1 ,2 ]
Srinivasan, Ashwin [3 ,4 ,5 ]
Silva, Fernando [2 ,6 ]
Camacho, Rui [7 ,8 ]
机构
[1] Univ Porto, IBMC, P-4169007 Oporto, Portugal
[2] Univ Porto, CRACS, P-4169007 Oporto, Portugal
[3] Indian Inst Technol, IBM India Res Lab, New Delhi 110016, India
[4] Univ New S Wales, Dept CSE, Sydney, NSW 2052, Australia
[5] Univ New S Wales, Ctr Hlth Informat, Sydney, NSW 2052, Australia
[6] Univ Porto, Fac Ciencias, P-4169007 Oporto, Portugal
[7] Univ Porto, LIAAD, P-4200465 Oporto, Portugal
[8] Univ Porto, Fac Engn, P-4200465 Oporto, Portugal
关键词
ILP; Parallelism; Efficiency;
D O I
10.1007/s10994-008-5094-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growth of machine-generated relational databases, both in the sciences and in industry, is rapidly outpacing our ability to extract useful information from them by manual means. This has brought into focus machine learning techniques like Inductive Logic Programming (ILP) that are able to extract human-comprehensible models for complex relational data. The price to pay is that ILP techniques are not efficient: they can be seen as performing a form of discrete optimisation, which is known to be computationally hard; and the complexity is usually some super-linear function of the number of examples. While little can be done to alter the theoretical bounds on the worst-case complexity of ILP systems, some practical gains may follow from the use of multiple processors. In this paper we survey the state-of-the-art on parallel ILP. We implement several parallel algorithms and study their performance using some standard benchmarks. The principal findings of interest are these: (1) of the techniques investigated, one that simply constructs models in parallel on each processor using a subset of data and then combines the models into a single one, yields the best results; and (2) sequential (approximate) ILP algorithms based on randomized searches have lower execution times than (exact) parallel algorithms, without sacrificing the quality of the solutions found.
引用
收藏
页码:257 / 279
页数:23
相关论文
共 50 条
  • [41] Parallel sparse orthogonal factorization on distributed-memory multiprocessors
    Sun, CG
    SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1996, 17 (03): : 666 - 685
  • [42] Algorithm 1033: Parallel Implementations for Computing the Minimum Distance of a Random Linear Code on Distributed-memory Architectures
    Quintana-Orti, Gregorio
    Hernando, Fernando
    Igual, Francisco D.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2023, 49 (01):
  • [43] Efficient Lagrangian particle tracking algorithms for distributed-memory architectures
    Baldan, Giacomo
    Bellosta, Tommaso
    Guardone, Alberto
    COMPUTERS & FLUIDS, 2023, 256
  • [44] A NEW PROGRAM TRANSFORMATION TO MINIMIZE COMMUNICATION IN DISTRIBUTED-MEMORY ARCHITECTURES
    OBOYLE, M
    HEDAYAT, GA
    LECTURE NOTES IN COMPUTER SCIENCE, 1992, 605 : 347 - 362
  • [45] A Parallel, Distributed-Memory Framework for Comparative Motif Discovery
    De Witte, Dieter
    Van Bel, Michiel
    Audenaert, Pieter
    Demeester, Piet
    Dhoedt, Bart
    Vandepoele, Klaas
    Fostier, Jan
    PARALLEL PROCESSING AND APPLIED MATHEMATICS (PPAM 2013), PT II, 2014, 8385 : 268 - 277
  • [46] PARALLEL COMPUTATION OF GROBNER BASES ON DISTRIBUTED-MEMORY MACHINES
    SAWADA, H
    TERASAKI, S
    AIBA, A
    JOURNAL OF SYMBOLIC COMPUTATION, 1994, 18 (03) : 207 - 222
  • [47] A PARALLEL VECTOR EQUATION SOLVER FOR DISTRIBUTED-MEMORY COMPUTERS
    QIN, JN
    NGUYEN, DT
    COMPUTING SYSTEMS IN ENGINEERING, 1994, 5 (01): : 19 - 25
  • [48] Parallel volume rendering on distributed-memory multiprocessor system
    Chen, WP
    Deng, JH
    Tang, ZS
    SECOND INTERNATION CONFERENCE ON IMAGE AND GRAPHICS, PTS 1 AND 2, 2002, 4875 : 969 - 974
  • [49] MAPPING UNIFORM LOOP NESTS ONTO DISTRIBUTED-MEMORY ARCHITECTURES
    DARTE, A
    ROBERT, Y
    PARALLEL COMPUTING, 1994, 20 (05) : 679 - 710
  • [50] Parallel FP-LAPW for distributed-memory machines
    Dohmen, R
    Pichlmeier, J
    Petersen, M
    Wagner, F
    Scheffler, M
    COMPUTING IN SCIENCE & ENGINEERING, 2001, 3 (04) : 18 - 29