-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Labels
optimizationCode optimization and algorithmic improvementsCode optimization and algorithmic improvementsperformanceSpeed improvements and optimization workSpeed improvements and optimization workpriority: mediumImportant improvements that enhance functionality or performanceImportant improvements that enhance functionality or performancesimdSIMD optimizations (AVX2, AVX-512, NEON)SIMD optimizations (AVX2, AVX-512, NEON)
Description
AVX2 is working great (42% speedup on ML-DSA), but newer CPUs have AVX-512 which should give more boost.
Targets for AVX-512:
- ML-DSA (all variants)
- Hash functions: SHA-2, SHA-3, BLAKE2/3
- Elliptic curves: field arithmetic, point multiplication
- AES (if there's gains to be had)
Requirements:
- Runtime CPU detection (can't assume everyone has AVX-512)
- Benchmark against AVX2 baseline
- Ensure graceful fallback to AVX2/scalar
- Document which CPUs benefit and by how much
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
optimizationCode optimization and algorithmic improvementsCode optimization and algorithmic improvementsperformanceSpeed improvements and optimization workSpeed improvements and optimization workpriority: mediumImportant improvements that enhance functionality or performanceImportant improvements that enhance functionality or performancesimdSIMD optimizations (AVX2, AVX-512, NEON)SIMD optimizations (AVX2, AVX-512, NEON)