Rust has an experimental portable SIMD abstraction which could be particularly useful to speed up some of the vector modular operations / NTTs in a "generic" way (i.e., without needing to specialize for AVX2 or Neon). I did some minimal investigation in the branch simd-2, and observe some speedup in a virtual machine with Intel CPU when enabling avx2 and avx512f in RUSTFLAGS. Such a feature may require a lot of care to properly handle alignment, and this issue is created to centralize the discussions about SIMD.
Rust has an experimental portable SIMD abstraction which could be particularly useful to speed up some of the vector modular operations / NTTs in a "generic" way (i.e., without needing to specialize for AVX2 or Neon). I did some minimal investigation in the branch
simd-2, and observe some speedup in a virtual machine with Intel CPU when enablingavx2andavx512finRUSTFLAGS. Such a feature may require a lot of care to properly handle alignment, and this issue is created to centralize the discussions about SIMD.