Efficient Low-Latency Hardware Architecture for Module-Lattice-Based Digital Signature Standard

被引:1
|
作者
Truong, Quang Dang [1 ]
Duong, Phap Ngoc [1 ,2 ]
Lee, Hanho [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon 22212, South Korea
[2] Univ Danang, Vietnam Korea Univ Informat & Commun Technol, Fac Comp Engn & Elect, Da Nang 50000, Vietnam
关键词
Computer architecture; Digital signatures; Standards; NIST; Arithmetic; Low latency communication; Quantum computing; Cryptography; Lattices; Public key cryptography; Field programmable gate arrays; Security management; Hardware security; Post-quantum cryptography (PQC); module-lattice-based digital signature standard (ML-DSA); crystals-Dilithium; lattice-based cryptography (LBC); number theoretic transform (NTT);
D O I
10.1109/ACCESS.2024.3370470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advancement of powerful quantum computers poses a significant security risk to current public-key cryptosystems, which heavily rely on the computational complexity of problems such as discrete logarithms and integer factorization. As a result, CRYSTALS-Dilithium, a lattice-based digital signature scheme with the potential to be an alternative algorithm that can withstand both quantum and classical attacks, has been standardized as ML-DSA after NIST Post-Quantum Cryptography competition. While prior studies have proposed hardware designs to accelerate this cryptosystem, there is room for further optimization in the tradeoff between performance and hardware consumption. This paper addresses these limitations by presenting an efficient low-latency hardware architecture for ML-DSA, leveraging optimized timing schedules for its three main algorithms. The hardware implementation enables runtime switching main operations in ML-DSA with various security levels. We design flexible arithmetic and hash modules tailored for ML-DSA, the most time-consuming submodules and key determinants of the scheme implementation. Combined with efficient operation scheduling to maximize the utilized time of submodules, our design achieves the best latency among FPGA-based implementations, outperforming stateof-the-art works by 1.27 similar to 2.58x in terms of the area-time tradeoff metric. Therefore, the proposed hardware architecture demonstrates its practical applicability for digital signature cryptosystems in post-quantum era.
引用
收藏
页码:32395 / 32407
页数:13
相关论文
共 50 条
  • [31] A Hardware Architecture of NIST Lightweight Cryptography Applied in IPSec to Secure High-Throughput Low-Latency IoT Networks
    Tran, Sy-Nam
    Hoang, Van-Thuc
    Bui, Duy-Hieu
    IEEE ACCESS, 2023, 11 : 89240 - 89248
  • [32] PPS: A Low-Latency and Low-Complexity Switching Architecture Based on Packet Prefetch and Arbitration Prediction
    Dai, Yi
    Wu, Ke
    Lai, Mingche
    Li, Qiong
    Dong, Dezun
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 3 - 16
  • [33] Efficient and Low-Latency Systolic Array Architecture for Full Searches in Block-Matching Motion Estimation
    张武健
    邱晓海
    周润德
    陈弘毅
    TsinghuaScienceandTechnology, 2001, (04) : 361 - 368
  • [34] Efficient and low-latency pixel data transmission module for adaptive optics wavefront processor based on field-programmable gate array
    Yang, Haifeng
    Xia, Yunxia
    Zhang, Haotian
    Li, Mei
    Rao, Changhui
    OPTICAL ENGINEERING, 2015, 54 (06)
  • [35] Vector-Vector-Matrix Architecture: A Novel Hardware-Aware Framework for Low-Latency Inference in NLP Applications
    Khoury, Matthew
    Dangovski, Rumen
    Ou, Longwu
    Nakov, Preslav
    Shen, Yichen
    Jing, Li
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7975 - 7984
  • [36] NOMA-Based Highly-Efficient Low-Latency HARQ Method for URLLC
    Kobayashi, Ryota
    Yuda, Yasuaki
    Higuchi, Kenichi
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [37] Hardware TCP Offload Engine based on 10-Gbps Ethernet for Low-Latency Network Communication
    Ding, Li
    Kang, Ping
    Yin, Wenbo
    Wang, Linli
    2016 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), 2016, : 269 - 272
  • [38] Low-latency and energy-efficient scheduling in fog-based IoT applications
    Rahbari, Dadmehr
    Nickray, Mohsen
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (02) : 1406 - 1427
  • [39] Cluster-based Architecture Relying on Optical Integrated Networks with the Provision Of a Low-latency Arbiter
    de Magalhaes, Felipe Gohring
    Hessel, Fabiano
    Liboiron-Ladouceur, Odile
    Nicolescu, Gabriela
    2016 29TH SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN (SBCCI), 2016,
  • [40] A Low-Latency Syndrome-based Deep Learning Decoder Architecture and its FPGA Implementation
    Kavvousanos, E.
    Paliouras, V
    2022 11TH INTERNATIONAL CONFERENCE ON MODERN CIRCUITS AND SYSTEMS TECHNOLOGIES (MOCAST), 2022,