Parallelizing Astronomical Source Extraction on the GPU

被引:1
|
作者
Zhao, Baoxue [1 ]
Luo, Qiong [1 ]
Wu, Chao [2 ]
机构
[1] HKUST, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] Chinese Acad Sci, Natl Astron Observ, Beijing 100864, Peoples R China
基金
中国国家自然科学基金;
关键词
GPU; Source Extraction; SExtractor; Detection; SOFTWARE;
D O I
10.1109/eScience.2013.10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In astronomical observatory projects, raw images are processed so that information about the celestial objects in the images is extracted into catalogs. As such, this source extraction is the basis for the various analysis tasks that are subsequently performed on the catalog products. With the rapid progress of new, large astronomical projects, observational images will be produced every few seconds. This high speed of image production requires fast source extraction. Unfortunately, current source extraction tools cannot meet the speed requirement. To address this problem, we propose to use the GPU (Graphics Processing Unit) to accelerate source extraction. Specifically, we start from SExtractor, an astronomical source extraction tool widely used in astronomy projects, and study its parallelization on the GPU. We identify the object detection and deblending components as the most complex and time-consuming, and design a parallel connected component labelling algorithm for detection and a parallel object tree pruning method for deblending respectively on the GPU. We further parallelize other components, including cleaning, background subtraction, and measurement, effectively on the GPU, such that the entire source extraction is done on the GPU. We have evaluated our GPU-SExtractor in comparison with the original SExtractor on a desktop with an Intel i7 CPU and an NVIDIA GTX670 GPU on a set of real-world and synthetic astronomical images of different sizes. Our results show that the GPU-SExtractor outperforms the original SExtractor by a factor of 6, taking a merely 1.9 second to process a typical 4KX4K image containing 167 thousands objects.
引用
收藏
页码:88 / 97
页数:10
相关论文
共 50 条
  • [41] Parallelizing image feature extraction on coarse-grain machines
    Chung, YW
    Prasanna, VK
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (12) : 1389 - 1394
  • [42] GPU accelerated processing of astronomical high frame-rate videosequences
    Vitek, Stanislav
    Svihlik, Jan
    Krasula, Lukas
    Fliegel, Karel
    Pata, Petr
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXXVIII, 2015, 9599
  • [43] Parallelizing Fast Multipole Method for Large-Scale Electromagnetic Problems Using GPU Clusters
    Nguyen, Quang M.
    Vinh Dang
    Kilic, Ozlem
    El-Araby, Esam
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2013, 12 : 868 - 871
  • [44] Parallelizing the Hamiltonian Computation in DQMC Simulations: Checkerboard Method for Sparse Matrix Exponentials on Multicore and GPU
    Lee, Che-Rung
    Chen, Zhi-Hung
    Kao, Quey-Liang
    2012 IEEE 26TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS & PHD FORUM (IPDPSW), 2012, : 1889 - 1897
  • [45] An Empirical Study of Parallelizing Test Execution Using CUDA Unified Memory and OpenMP GPU Offloading
    Bagies, Taghreed
    Jannesari, Ali
    2021 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS (ICSTW 2021), 2021, : 271 - 278
  • [46] Parallelizing Broad Phase Collision Detection for Animation in Games: A Performance Comparison of CPU and GPU Algorithms
    Serpa, Ygor R.
    Rodrigues, Maria Andreia F.
    2014 BRAZILIAN SYMPOSIUM ON COMPUTER GAMES AND DIGITAL ENTERTAINMENT (SBGAMES 2014), 2014, : 80 - 88
  • [47] Parallelizing Multilevel Fast Multipole Algorithm for Large-Scale Electromagnetic Problem on GPU clusters
    Nghia Tran
    Tuan Phan
    Kilic, Ozlem
    2016 IEEE/ACES INTERNATIONAL CONFERENCE ON WIRELESS INFORMATION TECHNOLOGY AND SYSTEMS (ICWITS) AND APPLIED COMPUTATIONAL ELECTROMAGNETICS (ACES), 2016,
  • [48] Enhancing QR Decomposition: A GPU-Based Approach to Parallelizing the Householder Algorithm with CUDA Streams
    Eshwar, Uppu
    Chatterjee, Soumyajit
    Peri, Sathya
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2025, 2025, 15507 : 146 - 161
  • [49] CNN Architecture Extraction on Edge GPU
    Horvath, Peter
    Chmielewski, Lukasz
    Weissbart, Leo
    Batina, Lejla
    Yarom, Yuval
    APPLIED CRYPTOGRAPHY AND NETWORK SECURITY WORKSHOPS, PT I, ACNS 2024-AIBLOCK 2024, AIHWS 2024, AIOTS 2024, SCI 2024, AAC 2024, SIMLA 2024, LLE 2024, AND CIMSS 2024, 2024, 14586 : 158 - 175
  • [50] OPTIMIZED MFCC FEATURE EXTRACTION ON GPU
    Kou, Haofeng
    Shang, Weijia
    Lane, Ian
    Chong, Jike
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7130 - 7134