Parallel ILP for distributed-memory architectures

被引：18

作者：

Fonseca, Nuno A. ^{[1
,2
]}

Srinivasan, Ashwin ^{[3
,4
,5
]}

Silva, Fernando ^{[2
,6
]}

Camacho, Rui ^{[7
,8
]}

机构：

[1] Univ Porto, IBMC, P-4169007 Oporto, Portugal

[2] Univ Porto, CRACS, P-4169007 Oporto, Portugal

[3] Indian Inst Technol, IBM India Res Lab, New Delhi 110016, India

[4] Univ New S Wales, Dept CSE, Sydney, NSW 2052, Australia

[5] Univ New S Wales, Ctr Hlth Informat, Sydney, NSW 2052, Australia

[6] Univ Porto, Fac Ciencias, P-4169007 Oporto, Portugal

[7] Univ Porto, LIAAD, P-4200465 Oporto, Portugal

[8] Univ Porto, Fac Engn, P-4200465 Oporto, Portugal

来源：

MACHINE LEARNING | 2009年 / 74卷 / 03期

关键词：

ILP; Parallelism; Efficiency;

D O I：

10.1007/s10994-008-5094-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The growth of machine-generated relational databases, both in the sciences and in industry, is rapidly outpacing our ability to extract useful information from them by manual means. This has brought into focus machine learning techniques like Inductive Logic Programming (ILP) that are able to extract human-comprehensible models for complex relational data. The price to pay is that ILP techniques are not efficient: they can be seen as performing a form of discrete optimisation, which is known to be computationally hard; and the complexity is usually some super-linear function of the number of examples. While little can be done to alter the theoretical bounds on the worst-case complexity of ILP systems, some practical gains may follow from the use of multiple processors. In this paper we survey the state-of-the-art on parallel ILP. We implement several parallel algorithms and study their performance using some standard benchmarks. The principal findings of interest are these: (1) of the techniques investigated, one that simply constructs models in parallel on each processor using a subset of data and then combines the models into a single one, yields the best results; and (2) sequential (approximate) ILP algorithms based on randomized searches have lower execution times than (exact) parallel algorithms, without sacrificing the quality of the solutions found.

引用

页码：257 / 279

页数：23

共 50 条

[31] ADAPTING FORTRAN 90 ARRAY PROGRAMS FOR DISTRIBUTED-MEMORY ARCHITECTURES
MERLIN, JH
LECTURE NOTES IN COMPUTER SCIENCE, 1992, 591 : 184 - 200
[32] Parallelizing RRT on Large-Scale Distributed-Memory Architectures
Devaurs, Didier
Simeon, Thierry
Cortes, Juan
IEEE TRANSACTIONS ON ROBOTICS, 2013, 29 (02) : 571 - 579
[33] COMMUNICATION OPTIMIZATIONS FOR IRREGULAR SCIENTIFIC COMPUTATIONS ON DISTRIBUTED-MEMORY ARCHITECTURES
DAS, R
UYSAL, M
SALTZ, J
HWANG, YS
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 22 (03) : 462 - 478
[34] MAPPING AND COLORING SCHEMES FOR DISTRIBUTED-MEMORY PARALLEL PROCESSORS
POMMERELL, C
ANNARATONE, M
FICHTNER, W
AEU-ARCHIV FUR ELEKTRONIK UND UBERTRAGUNGSTECHNIK-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 1990, 44 (05): : 353 - 367
[35] Atmospheric data assimilation on distributed-memory parallel supercomputers
Ding, CHQ
Lyster, PM
Larson, JW
Guo, J
da Silva, A
HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1998, 1401 : 115 - 124
[36] PPT - A PARALLEL PROGRAMMING TOOL FOR DISTRIBUTED-MEMORY MULTIPROCESSORS
CHUNG, YC
HO, WH
LIU, CC
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 1995, 18 (03) : 365 - 378
[37] Distributed-Memory Parallel Symmetric Nonnegative Matrix Factorization
Eswar, Srinivas
Hayashi, Koby
Ballard, Grey
Kannan, Ramakrishnan
Vuduc, Richard
Park, Haesun
PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
[38] Weak Scalability Analysis of the Distributed-Memory Parallel MLFMA
Michiels, Bart
Fostier, Jan
Bogaert, Ignace
De Zutter, Daniel
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2013, 61 (11) : 5567 - 5574
[39] TDR: A distributed-memory parallel routing algorithm for FPGAs
Cabral, LAF
Aude, RS
Maculan, N
FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS: RECONFIGURABLE COMPUTING IS GOING MAINSTREAM, 2002, 2438 : 263 - 270
[40] Parallel implementation of wavelet transforms on distributed-memory multicomputers
González, P
Cabaleiro, JC
Pena, TF
INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2001, : 45 - 50

← 1 2 3 4 5 →