fastlanes: bit-packed compare-constant fast path + bitpack_constant kernel #8013
1 benchmark regressed
⚠️ Unknown Walltime execution environment detected
Using the Walltime instrument on standard Hosted Runners will lead to inconsistent data.
For the most accurate results, we recommend using CodSpeed Macro Runners: bare-metal machines fine-tuned for performance measurement consistency.
⚠️ Different runtime environments detected
Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.
⚡ 12 improved benchmarks
❌ 1 regressed benchmark
✅ 1224 untouched benchmarks
🆕 8 new benchmarks
Warning
Please fix the performance issues or acknowledge them on CodSpeed.
Performance Changes
| Mode | Benchmark | BASE |
HEAD |
Efficiency | |
|---|---|---|---|---|---|
| ⚡ | Simulation | chunked_varbinview_canonical_into[(1000, 10)] |
197.9 µs | 162 µs | +22.19% |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(100, 100)] |
358.4 µs | 323.5 µs | +10.78% |
| ⚡ | Simulation | chunked_varbinview_into_canonical[(1000, 10)] |
211.2 µs | 175.8 µs | +20.11% |
| ⚡ | Simulation | chunked_varbinview_opt_canonical_into[(1000, 10)] |
224.8 µs | 188.6 µs | +19.23% |
| 🆕 | Simulation | full_encode[4, 1024] |
N/A | 19.2 µs | N/A |
| 🆕 | Simulation | fast_encode[4, 65536] |
N/A | 30.5 µs | N/A |
| 🆕 | Simulation | full_encode[16, 1024] |
N/A | 17.3 µs | N/A |
| 🆕 | Simulation | full_encode[16, 65536] |
N/A | 358 µs | N/A |
| 🆕 | Simulation | full_encode[4, 65536] |
N/A | 313.6 µs | N/A |
| ⚡ | Simulation | fast_eq_out_of_range[4, 1024] |
67 µs | 26.9 µs | ×2.5 |
| ❌ | Simulation | baseline_lt[4, 1024] |
64.1 µs | 79 µs | -18.86% |
| ⚡ | Simulation | fast_eq_out_of_range[16, 1024] |
67.7 µs | 26.8 µs | ×2.5 |
| ⚡ | Simulation | fast_eq_out_of_range[4, 65536] |
246 µs | 35.2 µs | ×7 |
| ⚡ | Simulation | fast_lt_out_of_range[4, 1024] |
87.5 µs | 32.8 µs | ×2.7 |
| 🆕 | Simulation | fast_encode[16, 65536] |
N/A | 81.5 µs | N/A |
| ⚡ | Simulation | fast_lt_out_of_range[16, 1024] |
67.8 µs | 25.7 µs | ×2.6 |
| ⚡ | Simulation | fast_eq_out_of_range[16, 65536] |
291.1 µs | 35.6 µs | ×8.2 |
| ⚡ | Simulation | fast_lt_out_of_range[4, 65536] |
262 µs | 35.1 µs | ×7.5 |
| 🆕 | Simulation | fast_encode[4, 1024] |
N/A | 11.6 µs | N/A |
| ⚡ | Simulation | fast_lt_out_of_range[16, 65536] |
306.3 µs | 35.2 µs | ×8.7 |
| ... | ... | ... | ... | ... | ... |
ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.
Tip
Investigate this regression by commenting @codspeedbot fix this regression on this PR, or directly use the CodSpeed MCP with your agent.
Comparing claude/bitpack-compare-speedup-KGPS3 (3b1b8cf) with develop (7b47788)