• Martin Storsjö's avatar
    arm64: mc: Optimize the mul_mla_8_* macros for Cortex A53 · fc5a3728
    Martin Storsjö authored
    Before:                      Cortex A53   Snapdragon 835
    mc_8tap_regular_w2_v_8bpc_neon:   155.1   131.8
    mc_8tap_regular_w4_v_8bpc_neon:   199.6   148.1
    mc_8tap_regular_w8_v_8bpc_neon:   286.2   225.5
    After:
    mc_8tap_regular_w2_v_8bpc_neon:   134.1   129.5
    mc_8tap_regular_w4_v_8bpc_neon:   157.6   146.5
    mc_8tap_regular_w8_v_8bpc_neon:   208.0   225.0
    fc5a3728
mc.S 83.4 KB