Skip to content
  • Martin Storsjö's avatar
    arm64: filmgrain: Add NEON implementation of the fguv function · 90bcb331
    Martin Storsjö authored
    Relative speedup over C code:
                                  Cortex A53    A72    A73   Apple M1
    fguv_32x32xn_8bpc_420_csfl0_neon:   4.51   2.87   3.88   6.51
    fguv_32x32xn_8bpc_420_csfl1_neon:   3.74   2.96   2.96   3.49
    fguv_32x32xn_8bpc_422_csfl0_neon:   4.49   3.18   4.07   5.00
    fguv_32x32xn_8bpc_422_csfl1_neon:   3.74   3.03   3.04   2.67
    fguv_32x32xn_8bpc_444_csfl0_neon:   6.68   4.24   5.66   5.02
    fguv_32x32xn_8bpc_444_csfl1_neon:   5.40   3.69   4.22   3.61
    90bcb331