Skip to content

Commit 120c5be

Browse files
committed
Add a note about scalar loads
1 parent fb3f9d5 commit 120c5be

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

src/avx2/ycbcr.rs

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ macro_rules! ycbcr_image_avx2 {
2222
#[target_feature(enable = "avx2")]
2323
fn fill_buffers_avx2(&self, y: u16, buffers: &mut [Vec<u8>; 4]) {
2424
// TODO: this compiles to many separate scalar loads and could be optimized further.
25+
// But the gains are no more than 3% end to end and it doesn't seem to be worth the complexity.
2526
#[inline]
2627
#[target_feature(enable = "avx2")]
2728
fn load3(data: &[u8]) -> __m256i {

0 commit comments

Comments
 (0)