Exploring parallel multi-GPU local search strategies in a metaheuristic framework

被引:23
|
作者
Rios, Eyder [1 ,2 ]
Ochi, Luiz Satoru [2 ]
Boeres, Cristina [2 ]
Coelho, Vitor N. [2 ]
Coelho, Igor M. [3 ]
Farias, Ricardo [4 ]
机构
[1] Univ Estadual Piaui UESPI, Parnaiba, PI, Brazil
[2] Univ Fed Fluminense, Inst Comp, Niteroi, RJ, Brazil
[3] Univ Estado Rio De Janeiro, Rio De Janeiro, RJ, Brazil
[4] Univ Fed Rio de Janeiro, COPPE Sistemas, Rio de Janeiro, RJ, Brazil
关键词
Multi-GPU; Parallel metaheuristic; Local search; Minimum latency problem; VND; GRASP; ILS; COMBINATORIAL OPTIMIZATION; TRAVELING SALESMAN; IMPLEMENTATION; ALGORITHM;
D O I
10.1016/j.jpdc.2017.06.011
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Optimization tasks are often complex, CPU-time consuming and usually deal with finding the best (or good enough) solution among alternatives for a given problem. Parallel metaheuristics have been used in many real-world and scientific applications to efficiently solve these kind of problems. Local Search (LS) is an essential component for some metaheuristics and, very often, represents the dominant computational effort accomplished by an algorithm. Several metaheuristic approaches try to adapt traditional LS models to parallel platforms without considering the intrinsic features of the available architectures. In this work, we present a novel local search strategy, so-called Distributed Variable Neighborhood Descent (DVND), specially designed for CPU and multi-GPU environment. Furthermore, a new neighborhood search strategy, so-called Multi Improvement, is introduced, taking advantage of GPU massive parallelism in order to boost up LS procedures. A hard combinatorial problem is considered as case of study, the Minimum Latency Problem (MLP). For tackling this problem, a hybrid metaheuristic algorithm is considered, which combines good quality initial solutions, generated by a Greedy Randomized Adaptive Search Procedures, with a flexible and powerful refinement procedure, inside the scope of an Iterated Local Search. The DVND was compared to the classic local search procedures, producing results that outperformed the best known sequential algorithm found in the literature. The speedups ranged from 7.3 to 13.7, for the larger MLP instances with 500 to 1000 clients. Results demonstrate the effectiveness of the proposed techniques in terms of solution quality, performance and scalability. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:39 / 55
页数:17
相关论文
共 50 条
  • [41] Parallel Computing Model and Performance Prediction based on Multi-GPU Environments
    Wang, Zhuowei
    Xu, Xianbin
    Zhao, Wuqing
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTERS IN EDUCATION (ICFCE 2011), VOL I, 2011, : 309 - 312
  • [42] Exploring Complex Brain-Simulation Workloads on Multi-GPU Deployments
    van der Vlag, Michiel A.
    Smaragdos, Georgios
    Al-Ars, Zaid
    Strydis, Christos
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 16 (04)
  • [43] Parallel Generation of Digitally Reconstructed Radiographs on Heterogeneous Multi-GPU Workstations
    Abdellah, Marwan
    Abdelaziz, Asem
    Ali, Eslam
    Abdelaziz, Sherief
    Sayed, Abdelrahman
    Owis, Mohamed I.
    Eldeib, Ayman
    2016 38TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2016, : 3953 - 3956
  • [44] Parallel Algorithm for Landform Attributes Representation on Multicore and Multi-GPU Systems
    Boratto, Murilo
    Alonso, Pedro
    Ramiro, Carla
    Barreto, Marcos
    Coelho, Leandro
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2012, PT I, 2012, 7333 : 29 - 43
  • [45] A multi-GPU protein database search model with hybrid alignment manner on distributed GPU clusters
    Zhou, Wei
    Cai, Zhanxiu
    Lian, Bo
    Wang, Jincai
    Ma, Jianping
    Sun, Bin
    Yu, Qian
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (18):
  • [46] Efficient Breadth First Search on Multi-GPU Systems Using GPU-Centric OpenSHMEM
    Potluri, Sreeram
    Goswami, Anshuman
    Venkata, Manjunath Gorentla
    Imam, Neena
    OPENSHMEM AND RELATED TECHNOLOGIES: BIG COMPUTE AND BIG DATA CONVERGENCE, OPENSHMEM 2017, 2018, 10679 : 82 - 96
  • [47] Parallel Singular Value Decomposition on Heterogeneous Multi-core and Multi-GPU Platforms
    Feng, Xiaowen
    Jin, Hai
    Zheng, Ran
    Zhu, Lei
    2014 NINTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2014, : 45 - 50
  • [48] CASE: A Compiler-Assisted SchEduling Framework for Multi-GPU Systems
    Chen, Chao
    Porter, Chris
    Pande, Santosh
    PPOPP'22: PROCEEDINGS OF THE 27TH ACM SIGPLAN SYMPOSIUM ON PRINCIPLES AND PRACTICE OF PARALLEL PROGRAMMING, 2022, : 17 - 31
  • [49] MG-GCN: A Scalable multi-GPU GCN Training Framework
    Balin, Muhammed Fatih
    Sancak, Kaan
    Catalyurekt, Umit V.
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
  • [50] A multi-GPU accelerated virtual-reality interaction simulation framework
    Shao, Xuqiang
    Xu, Weifeng
    Lin, Lina
    Zhang, Fengquan
    PLOS ONE, 2019, 14 (04):