Efficient Low-Latency Hardware Architecture for Module-Lattice-Based Digital Signature Standard

被引:1
|
作者
Truong, Quang Dang [1 ]
Duong, Phap Ngoc [1 ,2 ]
Lee, Hanho [1 ]
机构
[1] Inha Univ, Dept Elect & Comp Engn, Incheon 22212, South Korea
[2] Univ Danang, Vietnam Korea Univ Informat & Commun Technol, Fac Comp Engn & Elect, Da Nang 50000, Vietnam
关键词
Computer architecture; Digital signatures; Standards; NIST; Arithmetic; Low latency communication; Quantum computing; Cryptography; Lattices; Public key cryptography; Field programmable gate arrays; Security management; Hardware security; Post-quantum cryptography (PQC); module-lattice-based digital signature standard (ML-DSA); crystals-Dilithium; lattice-based cryptography (LBC); number theoretic transform (NTT);
D O I
10.1109/ACCESS.2024.3370470
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The rapid advancement of powerful quantum computers poses a significant security risk to current public-key cryptosystems, which heavily rely on the computational complexity of problems such as discrete logarithms and integer factorization. As a result, CRYSTALS-Dilithium, a lattice-based digital signature scheme with the potential to be an alternative algorithm that can withstand both quantum and classical attacks, has been standardized as ML-DSA after NIST Post-Quantum Cryptography competition. While prior studies have proposed hardware designs to accelerate this cryptosystem, there is room for further optimization in the tradeoff between performance and hardware consumption. This paper addresses these limitations by presenting an efficient low-latency hardware architecture for ML-DSA, leveraging optimized timing schedules for its three main algorithms. The hardware implementation enables runtime switching main operations in ML-DSA with various security levels. We design flexible arithmetic and hash modules tailored for ML-DSA, the most time-consuming submodules and key determinants of the scheme implementation. Combined with efficient operation scheduling to maximize the utilized time of submodules, our design achieves the best latency among FPGA-based implementations, outperforming stateof-the-art works by 1.27 similar to 2.58x in terms of the area-time tradeoff metric. Therefore, the proposed hardware architecture demonstrates its practical applicability for digital signature cryptosystems in post-quantum era.
引用
收藏
页码:32395 / 32407
页数:13
相关论文
共 50 条
  • [21] A Scalable Low-Latency FPGA Architecture for Spin Qubit Control through Direct Digital Synthesis
    Toubeix, Mathieu
    Guthmuller, Eric
    Evans, Adrian
    Meunier, Tristan
    2024 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2024,
  • [22] MELOPPR: Software/Hardware Co-design for Memory-efficient Low-latency Personalized PageRank
    Li, Lixiang
    Chen, Yao
    Zirnheld, Zacharie
    Li, Pan
    Hao, Cong
    2021 58TH ACM/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2021, : 601 - 606
  • [23] An Efficient Hardware Design for a Low-Latency Traffic Flow Prediction System Using an Online Neural Network
    Hanafy, Yasmin Adel
    Mashaly, Maggie
    Abd El Ghany, Mohamed A.
    ELECTRONICS, 2021, 10 (16)
  • [24] Low-Latency Area-Efficient Decoding Architecture for Shortened Reed-Solomon Codes
    Yoo, Hoyoung
    Lee, Youngjoo
    Park, In-Cheol
    2012 INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC), 2012, : 223 - 226
  • [25] A low-latency and bandwidth-efficient distributed optical burst switching architecture for metro ring
    Fumagalli, A
    Krishnamoorthy, P
    2003 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, VOLS 1-5: NEW FRONTIERS IN TELECOMMUNICATIONS, 2003, : 1340 - 1344
  • [26] FPGA-based Low-Latency Digital Servo for Optical Physics Experiments
    Pomponio, Marco
    Hati, Archita
    Nelson, Craig
    2020 JOINT CONFERENCE OF THE IEEE INTERNATIONAL FREQUENCY CONTROL SYMPOSIUM AND INTERNATIONAL SYMPOSIUM ON APPLICATIONS OF FERROELECTRICS (IFCS-ISAF), 2020,
  • [27] Energy Efficient and Low-latency Data Collection in TDMA-based WSN
    Zhu, Wenxiang
    Xu, Pingping
    You, Xingmiao
    Yang, Chencheng
    Bui, ThiOanh
    2015 IEEE 26TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2015, : 2020 - 2025
  • [28] Area-Optimized Low-Latency Approximate Multipliers for FPGA-based Hardware Accelerators
    Ullah, Salim
    Rehman, Semeen
    Prabakaran, Bharath Srinivas
    Kriebel, Florian
    Hanif, Muhammad Abdullah
    Shafique, Muhammad
    Kumar, Akash
    2018 55TH ACM/ESDA/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2018,
  • [29] STELLAR: Energy-Efficient and Low-Latency SNN Algorithm and Hardware Co-design with Spatiotemporal Computation
    Mao, Ruixin
    Tang, Lin
    Yuan, Xingyu
    Liu, Ye
    Zhou, Jun
    2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024, 2024, : 172 - 185
  • [30] Hardware-Algorithm Codesigned Low-Latency and Resource-Efficient OMP Accelerator for DOA Estimation on FPGA
    Jiang, Ruichang
    Ye, Wenbin
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2025, 33 (02) : 421 - 434