2000 character limit reached
Automatic Generation of Vectorized Montgomery Algorithm (1609.00999v1)
Published 4 Sep 2016 in cs.MS
Abstract: Modular arithmetic is widely used in crytography and symbolic computation. This paper presents a vectorized Montgomery algorithm for modular multiplication, the key to fast modular arithmetic, that fully utilizes the SIMD instructions. We further show how the vectorized algorithm can be automatically generated by the {\SPIRAL} system, as part of the effort for automatic generation of a modular polynomial multiplication library.