Parallelizing Astronomical Source Extraction on the GPU

被引:1
|
作者
Zhao, Baoxue [1 ]
Luo, Qiong [1 ]
Wu, Chao [2 ]
机构
[1] HKUST, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
[2] Chinese Acad Sci, Natl Astron Observ, Beijing 100864, Peoples R China
基金
中国国家自然科学基金;
关键词
GPU; Source Extraction; SExtractor; Detection; SOFTWARE;
D O I
10.1109/eScience.2013.10
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In astronomical observatory projects, raw images are processed so that information about the celestial objects in the images is extracted into catalogs. As such, this source extraction is the basis for the various analysis tasks that are subsequently performed on the catalog products. With the rapid progress of new, large astronomical projects, observational images will be produced every few seconds. This high speed of image production requires fast source extraction. Unfortunately, current source extraction tools cannot meet the speed requirement. To address this problem, we propose to use the GPU (Graphics Processing Unit) to accelerate source extraction. Specifically, we start from SExtractor, an astronomical source extraction tool widely used in astronomy projects, and study its parallelization on the GPU. We identify the object detection and deblending components as the most complex and time-consuming, and design a parallel connected component labelling algorithm for detection and a parallel object tree pruning method for deblending respectively on the GPU. We further parallelize other components, including cleaning, background subtraction, and measurement, effectively on the GPU, such that the entire source extraction is done on the GPU. We have evaluated our GPU-SExtractor in comparison with the original SExtractor on a desktop with an Intel i7 CPU and an NVIDIA GTX670 GPU on a set of real-world and synthetic astronomical images of different sizes. Our results show that the GPU-SExtractor outperforms the original SExtractor by a factor of 6, taking a merely 1.9 second to process a typical 4KX4K image containing 167 thousands objects.
引用
收藏
页码:88 / 97
页数:10
相关论文
共 50 条
  • [21] Parallelizing Dynamic Time Warping Algorithm Using Prefix Computations on GPU
    Xiao, Limin
    Zheng, Yao
    Tang, Wenqi
    Yao, Guangchao
    Ruan, Li
    2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, : 294 - 299
  • [22] Parallelizing Multimodal Background Modeling on a Low-Power Integrated GPU
    Azmat, Shoaib
    Wills, Linda
    Wills, Scott
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2017, 88 (01): : 43 - 53
  • [23] Parallelizing Network Coding on Manycore GPU-Accelerated System with Optimization
    Gan, Xinbiao
    Shen, Li
    Zhu, Qi
    Wang, Zhiying
    CEIS 2011, 2011, 15
  • [24] A lock-free approach to parallelizing personalized PageRank computations on GPU
    WANG Zhigang
    WANG Ning
    NIE Jie
    WEI Zhiqiang
    GU Yu
    YU Ge
    Frontiers of Computer Science, 2023, 17 (01)
  • [25] Parallelizing Single Source Shortest Path with OpenSHMEM
    Aderholdt, Ferrol
    Graves, Jeffrey A.
    Venkata, Manjunath Gorentla
    OPENSHMEM AND RELATED TECHNOLOGIES: BIG COMPUTE AND BIG DATA CONVERGENCE, OPENSHMEM 2017, 2018, 10679 : 65 - 81
  • [26] GPU-S2S: A source to source compiler for GPU
    Dong, X. (xsdong@mail.xjtu.edu.cn), 2012, Inst. of Scientific and Technical Information of China (22):
  • [27] GPU parallelizing atomic energy calculations using explicitly correlated Gaussian functions
    Wall, Zachary
    Cafiero, Mauricio
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [28] Parallelizing Epistasis Detection in GWAS on FPGA and GPU-Accelerated Computing Systems
    Gonzalez-Dominguez, Jorge
    Wienbrandt, Lars
    Kaessens, Jan Christian
    Ellinghaus, David
    Schimmler, Manfred
    Schmidt, Bertil
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (05) : 982 - 994
  • [29] Parallelizing the cryo-EM structure determination in THUNDER using GPU cluster
    Wang, Zhao
    Ruan, Huabin
    Yang, Guangwen
    Li, Xueming
    ENGINEERING REPORTS, 2025, 7 (01)
  • [30] Lightweight Dependency Checking for Parallelizing Loops with Non-deterministic Dependency on GPU
    Liu, Hongyuan
    Lam, King Tin
    Lin, Huanxin
    Wang, Cho-Li
    Ma, Junchao
    2016 IEEE 22ND INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS), 2016, : 884 - 893