Commit 5c4728d8 authored by Martin Storsjö's avatar Martin Storsjö Committed by Henrik Gramner

aarch64: Fix integral_init4/8h_neon

The stride is the number of uint16_t elements and thus needs
to be shifted.

This issue had slipped unnoticed since checkasm didn't actually
verify the output of these functions.
parent 67076513
......@@ -1403,7 +1403,7 @@ endfunc
.endm
function integral_init4h_neon, export=1
sub x3, x0, x2
sub x3, x0, x2, lsl #1
ld1 {v6.8b,v7.8b}, [x1], #16
1:
subs x2, x2, #16
......@@ -1438,7 +1438,7 @@ endfunc
.endm
function integral_init8h_neon, export=1
sub x3, x0, x2
sub x3, x0, x2, lsl #1
ld1 {v16.8b,v17.8b}, [x1], #16
1:
subs x2, x2, #16
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment