Efficient Additions and Montgomery Reductions of Large Integers for SIMD

08/31/2023
by   Pengchang Ren, et al.
0

This paper presents efficient algorithms, designed to leverage SIMD for performing Montgomery reductions and additions on integers larger than 512 bits. The existing algorithms encounter inefficiencies when parallelized using SIMD due to extensive dependencies in both operations, particularly noticeable in costly operations like ARM's SVE. To mitigate this problem, a novel addition algorithm is introduced that simulates the addition of large integers using a smaller addition, quickly producing the same set of carries. These carries are then utilized to perform parallel additions on large integers. For Montgomery reductions, serial multiplications are replaced with precomputations that can be effectively calculated using SIMD extensions. Experimental evidence demonstrates that these proposed algorithms substantially enhance the performance of state-of-the-art implementations of several post-quantum cryptography algorithms. Notably, they deliver a 30 CTIDH implementation, an 11 AVX-512 processors, and a 7 for SIKEp503 on A64FX.

READ FULL TEXT

page 1

page 13

research
05/10/2018

The Hidden Subgroup Problem and Post-quantum Group-based Cryptography

In this paper we discuss the Hidden Subgroup Problem (HSP) in relation t...
research
04/27/2018

On Basing One-way Permutations on NP-hard Problems under Quantum Reductions

A fundamental pursuit in complexity theory concerns reducing worst-case ...
research
11/19/2017

Verification of PCP-Related Computational Reductions in Coq

We formally verify several computational reductions concerning the Post ...
research
08/06/2020

Design of Reconfigurable Multi-Operand Adder for Massively Parallel Processing

The paper presents a systematic study and implementation of a reconfigur...
research
01/18/2018

Scheduling and Tiling Reductions on Realistic Machines

Computations, where the number of results is much smaller than the input...
research
02/17/2022

MeNTT: A Compact and Efficient Processing-in-Memory Number Theoretic Transform (NTT) Accelerator

Lattice-based cryptography (LBC) exploiting Learning with Errors (LWE) p...

Please sign up or login with your details

Forgot password? Click here to reset