Commit 0d18b15a authored by Martin Storsjö's avatar Martin Storsjö

arm64: cdef: NEON optimized cdef filter function

Speedup vs C code:     Cortex A53    A72    A73
cdef_filter_4x4_8bpc_neon:   4.62   4.48   4.76
cdef_filter_4x8_8bpc_neon:   4.82   4.80   5.08
cdef_filter_8x8_8bpc_neon:   5.29   5.33   5.79
parent 109ee513
Pipeline #4493 passed with stages
in 7 minutes and 21 seconds