• Martin Storsjö's avatar
    arm/mc: Add 8 bit neon asm for avg, w_avg and mask · 515e2667
    Martin Storsjö authored
    checkasm --bench numbers from a Snapdragon 835:
    nop: 23.0
    avg_w4_8bpc_c: 385.0
    avg_w4_8bpc_neon: 34.0
    avg_w8_8bpc_c: 590.5
    avg_w8_8bpc_neon: 65.5
    avg_w16_8bpc_c: 1304.4
    avg_w16_8bpc_neon: 161.3
    avg_w32_8bpc_c: 4098.4
    avg_w32_8bpc_neon: 589.2
    avg_w64_8bpc_c: 8405.0
    avg_w64_8bpc_neon: 1367.1
    avg_w128_8bpc_c: 19667.9
    avg_w128_8bpc_neon: 3409.0
    w_avg_w4_8bpc_c: 453.8
    w_avg_w4_8bpc_neon: 50.0
    w_avg_w8_8bpc_c: 749.0
    w_avg_w8_8bpc_neon: 105.7
    w_avg_w16_8bpc_c: 1851.2
    w_avg_w16_8bpc_neon: 283.7
    w_avg_w32_8bpc_c: 5991.5
    w_avg_w32_8bpc_neon: 1080.9
    w_avg_w64_8bpc_c: 12763.5
    w_avg_w64_8bpc_neon: 2544.4
    w_avg_w128_8bpc_c: 30311.3
    w_avg_w128_8bpc_neon: 6350.5
    mask_w4_8bpc_c: 492.9
    mask_w4_8bpc_neon: 57.7
    mask_w8_8bpc_c: 1108.5
    mask_w8_8bpc_neon: 123.0
    mask_w16_8bpc_c: 2880.3
    mask_w16_8bpc_neon: 349.2
    mask_w32_8bpc_c: 8996.4
    mask_w32_8bpc_neon: 1368.1
    mask_w64_8bpc_c: 19570.3
    mask_w64_8bpc_neon: 3263.5
    mask_w128_8bpc_c: 46757.4
    mask_w128_8bpc_neon: 8743.1
    515e2667
Name
Last commit
Last update
doc Loading commit data...
include Loading commit data...
src Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitignore Loading commit data...
.gitlab-ci.yml Loading commit data...
CONTRIBUTING.md Loading commit data...
COPYING Loading commit data...
NEWS Loading commit data...
README.md Loading commit data...
THANKS.md Loading commit data...
meson.build Loading commit data...
meson_options.txt Loading commit data...