-
This effectively reverts a0692eb8 for other architectures. The order that is beneficial for x86 SIMD is not beneficial for other architectures. For a NEON implementation of the warp filter, reordering the filter coefficients back in the right order took 1/4 of the filter runtime.
8abcf5dc