-
The existing functions could easily be used by just calling them twice - this would give the following cycle numbers from checkasm: var2_8x8_c: 4110 var2_8x8_neon: 1505 var2_8x16_c: 8019 var2_8x16_neon: 2545 However, by merging both passes into the same function, we get the following speedup: var2_8x8_neon: 1205 var2_8x16_neon: 2327
98e9543b