Efficient GPU Implementations of Post-Quantum Signature XMSS

被引:6
|
作者
Wang, Ziheng [1 ]
Dong, Xiaoshe [1 ]
Chen, Heng [1 ]
Kang, Yan [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Comp Sci & Technol, Xian 710049, Peoples R China
关键词
Post-quantum cryptography; stateful hash-based signatures; XMSS; XMSSMT; parallel computing; GPU; FRODOKEM;
D O I
10.1109/TPDS.2022.3233348
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The National Institute of Standards and Technology (NIST) approved XMSS as part of the post-quantum cryptography (PQC) development effort in 2018. XMSS is currently one of only two standardized PQC algorithms, but its performance limits its use. For example, the fastest record for some standardized parameters still takes more than a minute to generate a keypair. In this article, we present the first GPU implementation for XMSS and its variant XMSSMT. The high parallelism of GPUs is especially effective for reducing latency in key generation and improving throughput for signing and verifying. In order to meet various application scenarios, we provide three parallel XMSS schemes: algorithmic parallelism, multi-keypair data parallelism, and single-keypair data parallelism. For these schemes, we design custom parallel strategies that use more than 10,000 cores for all parameters provided by NIST. In addition, we analyze the availability of most previous serial optimizations and explore numerous techniques to fully exploit GPU performance. Our evaluations are made with the XMSSMT-SHA2_20/2_256 parameter set on a GeForce RTX 3090. The result shows the key generation latency is 3.20 ms, a speedup of 21,899x compared to the GPU ported version, which is also 54x speedup faster than the fastest work (174 ms). When 16384 tasks are executed, the throughput (task/s) for signing/verifying in the single-key and multi-key cases is 311,424/415,100 and 145,100/419,887, respectively. Compared to the throughput for signing/verifying (1695/ 4000) of the fastest work, we obtain a speedup of 184x/104x and 86x/105x in single-key and multi-key cases, respectively.
引用
收藏
页码:938 / 954
页数:17
相关论文
共 50 条
  • [1] CUSPX: Efficient GPU Implementations of Post-Quantum Signature SPHINCS+
    Wang, Ziheng
    Dong, Xiaoshe
    Chen, Heng
    Kang, Yan
    Wang, Qiang
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (01) : 15 - 28
  • [2] cuNH: Efficient GPU Implementations of Post-Quantum KEM NewHope
    Gao, Yiwen
    Xu, Jia
    Wang, Hongbing
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (03) : 551 - 568
  • [3] ASIC Accelerator in 28 nm for the Post-Quantum Digital Signature Scheme XMSS
    Mohan, Prashanth
    Wang, Wen
    Jungk, Bernhard
    Niederhagen, Ruben
    Szefer, Jakub
    Mai, Ken
    2020 IEEE 38TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD 2020), 2020, : 656 - 662
  • [4] An Efficient Post-quantum Identity-Based Signature
    YANG Zhichao
    DUNG H.Duong
    WILLY Susilo
    YANG Guomin
    LI Chao
    CHEN Rongmao
    Chinese Journal of Electronics, 2021, 30 (02) : 238 - 248
  • [5] Post-quantum signature schemes for efficient hardware implementation
    Moldovyan, Dmitriy N.
    Moldovyan, Alexandr A.
    Moldovyan, Nikolay A.
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 80
  • [6] An Efficient Post-quantum Identity-Based Signature
    Zhichao, Yang
    Duong, Dung H.
    Susilo, Willy
    Guomin, Yang
    Chao, Li
    Rongmao, Chen
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (02) : 238 - 248
  • [7] Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS
    Sun, Shuzhou
    Zhang, Rui
    Ma, Hui
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (11) : 2542 - 2555
  • [8] Parallel implementations of post-quantum leighton-Micali signature on multiple nodes
    Kang, Yan
    Dong, Xiaoshe
    Wang, Ziheng
    Chen, Heng
    Wang, Qiang
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 5042 - 5072
  • [9] Parallel implementations of post-quantum leighton-Micali signature on multiple nodes
    Yan Kang
    Xiaoshe Dong
    Ziheng Wang
    Heng Chen
    Qiang Wang
    The Journal of Supercomputing, 2024, 80 : 5042 - 5072
  • [10] High-Throughput GPU Implementation of Dilithium Post-Quantum Digital Signature
    Shen, Shiyu
    Yang, Hao
    Dai, Wangchen
    Zhang, Hong
    Liu, Zhe
    Zhao, Yunlei
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (11) : 1964 - 1976