Exploring parallel multi-GPU local search strategies in a metaheuristic framework

被引：23

作者：

Rios, Eyder ^{[1
,2
]}

Ochi, Luiz Satoru ^{[2
]}

Boeres, Cristina ^{[2
]}

Coelho, Vitor N. ^{[2
]}

Coelho, Igor M. ^{[3
]}

Farias, Ricardo ^{[4
]}

机构：

[1] Univ Estadual Piaui UESPI, Parnaiba, PI, Brazil

[2] Univ Fed Fluminense, Inst Comp, Niteroi, RJ, Brazil

[3] Univ Estado Rio De Janeiro, Rio De Janeiro, RJ, Brazil

[4] Univ Fed Rio de Janeiro, COPPE Sistemas, Rio de Janeiro, RJ, Brazil

来源：

JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING | 2018年 / 111卷

关键词：

Multi-GPU; Parallel metaheuristic; Local search; Minimum latency problem; VND; GRASP; ILS; COMBINATORIAL OPTIMIZATION; TRAVELING SALESMAN; IMPLEMENTATION; ALGORITHM;

D O I：

10.1016/j.jpdc.2017.06.011

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Optimization tasks are often complex, CPU-time consuming and usually deal with finding the best (or good enough) solution among alternatives for a given problem. Parallel metaheuristics have been used in many real-world and scientific applications to efficiently solve these kind of problems. Local Search (LS) is an essential component for some metaheuristics and, very often, represents the dominant computational effort accomplished by an algorithm. Several metaheuristic approaches try to adapt traditional LS models to parallel platforms without considering the intrinsic features of the available architectures. In this work, we present a novel local search strategy, so-called Distributed Variable Neighborhood Descent (DVND), specially designed for CPU and multi-GPU environment. Furthermore, a new neighborhood search strategy, so-called Multi Improvement, is introduced, taking advantage of GPU massive parallelism in order to boost up LS procedures. A hard combinatorial problem is considered as case of study, the Minimum Latency Problem (MLP). For tackling this problem, a hybrid metaheuristic algorithm is considered, which combines good quality initial solutions, generated by a Greedy Randomized Adaptive Search Procedures, with a flexible and powerful refinement procedure, inside the scope of an Iterated Local Search. The DVND was compared to the classic local search procedures, producing results that outperformed the best known sequential algorithm found in the literature. The speedups ranged from 7.3 to 13.7, for the larger MLP instances with 500 to 1000 clients. Results demonstrate the effectiveness of the proposed techniques in terms of solution quality, performance and scalability. (C) 2017 Elsevier Inc. All rights reserved.

引用

页码：39 / 55

页数：17

共 50 条

[41] Parallel Computing Model and Performance Prediction based on Multi-GPU Environments
Wang, Zhuowei
Xu, Xianbin
Zhao, Wuqing
2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTERS IN EDUCATION (ICFCE 2011), VOL I, 2011, : 309 - 312
[42] Exploring Complex Brain-Simulation Workloads on Multi-GPU Deployments
van der Vlag, Michiel A.
Smaragdos, Georgios
Al-Ars, Zaid
Strydis, Christos
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (04)
[43] Parallel Generation of Digitally Reconstructed Radiographs on Heterogeneous Multi-GPU Workstations
Abdellah, Marwan
Abdelaziz, Asem
Ali, Eslam
Abdelaziz, Sherief
Sayed, Abdelrahman
Owis, Mohamed I.
Eldeib, Ayman
2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 3953 - 3956
[44] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
Boratto, Murilo
Alonso, Pedro
Ramiro, Carla
Barreto, Marcos
Coelho, Leandro
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
[45] A multi-GPU protein database search model with hybrid alignment manner on distributed GPU clusters
Zhou, Wei
Cai, Zhanxiu
Lian, Bo
Wang, Jincai
Ma, Jianping
Sun, Bin
Yu, Qian
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (18):
[46] Efficient Breadth First Search on Multi-GPU Systems Using GPU-Centric OpenSHMEM
Potluri, Sreeram
Goswami, Anshuman
Venkata, Manjunath Gorentla
Imam, Neena
OPENSHMEM AND RELATED TECHNOLOGIES: BIG COMPUTE AND BIG DATA CONVERGENCE, OPENSHMEM 2017, 2018, 10679 : 82 - 96
[47] Parallel Singular Value Decomposition on Heterogeneous Multi-core and Multi-GPU Platforms
Feng, Xiaowen
Jin, Hai
Zheng, Ran
Zhu, Lei
2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2014, : 45 - 50
[48] CASE: A Compiler-Assisted SchEduling Framework for Multi-GPU Systems
Chen, Chao
Porter, Chris
Pande, Santosh
PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, : 17 - 31
[49] MG-GCN: A Scalable multi-GPU GCN Training Framework
Balin, Muhammed Fatih
Sancak, Kaan
Catalyurekt, Umit V.
51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
[50] A multi-GPU accelerated virtual-reality interaction simulation framework
Shao, Xuqiang
Xu, Weifeng
Lin, Lina
Zhang, Fengquan
PLOS ONE, 2019, 14 (04):

← 1 2 3 4 5 →