Automatic Library Generation for Modular Polynomial Multiplication

09/05/2016
by   Lingchuan Meng, et al.
0

Polynomial multiplication is a key algorithm underlying computer algebra systems (CAS) and its efficient implementation is crucial for the performance of CAS. In this paper we design and implement algorithms for polynomial multiplication using approaches based the fast Fourier transform (FFT) and the truncated Fourier transform (TFT). We improve on the state-of-the-art in both theoretical and practical performance. The library generation system is extended and used to automatically generate and tune the performance of a polynomial multiplication library that is optimized for memory hierarchy, vectorization and multi-threading, using new and existing algorithms. The performance tuning has been aided by the use of automation where many code choices are generated and intelligent search is utilized to find the "best" implementation on a given architecture. The performance of autotuned implementations is comparable to, and in some cases better than, the best hand-tuned code.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/04/2016

Automatic Generation of Vectorized Montgomery Algorithm

Modular arithmetic is widely used in crytography and symbolic computatio...
research
05/10/2019

Implementations of efficient univariate polynomial matrix algorithms and application to bivariate resultants

Complexity bounds for many problems on matrices with univariate polynomi...
research
11/05/2018

Putting Fürer Algorithm into Practice with the BPAS Library

Fast algorithms for integer and polynomial multiplication play an import...
research
04/05/2023

FourierPIM: High-Throughput In-Memory Fast Fourier Transform and Polynomial Multiplication

The Discrete Fourier Transform (DFT) is essential for various applicatio...
research
11/23/2020

Speeding up decimal multiplication

Decimal multiplication is the task of multiplying two numbers in base 10...
research
11/14/2022

FLUPS – a flexible and performant massively parallel Fourier transform library

Massively parallel Fourier transforms are widely used in computational s...
research
03/04/2016

Optimized Polynomial Evaluation with Semantic Annotations

In this paper we discuss how semantic annotations can be used to introdu...

Please sign up or login with your details

Forgot password? Click here to reset