• Liwei Wang's avatar
    Add SSSE3 implementation for the 16x32,32x16 and 32x32 blocks in itx · bd12b1ec
    Liwei Wang authored
    Cycle times:
    inv_txfm_add_16x32_dct_dct_0_8bpc_c: 2464.6
    inv_txfm_add_16x32_dct_dct_0_8bpc_ssse3: 121.6
    inv_txfm_add_16x32_dct_dct_1_8bpc_c: 24751.6
    inv_txfm_add_16x32_dct_dct_1_8bpc_ssse3: 1101.9
    inv_txfm_add_16x32_dct_dct_2_8bpc_c: 24377.0
    inv_txfm_add_16x32_dct_dct_2_8bpc_ssse3: 1117.2
    inv_txfm_add_16x32_dct_dct_3_8bpc_c: 24155.6
    inv_txfm_add_16x32_dct_dct_3_8bpc_ssse3: 2349.3
    inv_txfm_add_16x32_dct_dct_4_8bpc_c: 24175.6
    inv_txfm_add_16x32_dct_dct_4_8bpc_ssse3: 1642.0
    inv_txfm_add_16x32_identity_identity_0_8bpc_c: 10304.7
    inv_txfm_add_16x32_identity_identity_0_8bpc_ssse3: 137.7
    inv_txfm_add_16x32_identity_identity_1_8bpc_c: 10341.6
    inv_txfm_add_16x32_identity_identity_1_8bpc_ssse3: 137.9
    inv_txfm_add_16x32_identity_identity_2_8bpc_c: 10299.9
    inv_txfm_add_16x32_identity_identity_2_8bpc_ssse3: 253.9
    inv_txfm_add_16x32_identity_identity_3_8bpc_c: 10331.4
    inv_txfm_add_16x32_identity_identity_3_8bpc_ssse3: 369.7
    inv_txfm_add_16x32_identity_identity_4_8bpc_c: 10360.4
    inv_txfm_add_16x32_identity_identity_4_8bpc_ssse3: 484.0
    inv_txfm_add_32x16_dct_dct_0_8bpc_c: 2288.4
    inv_txfm_add_32x16_dct_dct_0_8bpc_ssse3: 142.3
    inv_txfm_add_32x16_dct_dct_1_8bpc_c: 23819.9
    inv_txfm_add_32x16_dct_dct_1_8bpc_ssse3: 1740.1
    inv_txfm_add_32x16_dct_dct_2_8bpc_c: 23755.8
    inv_txfm_add_32x16_dct_dct_2_8bpc_ssse3: 1641.4
    inv_txfm_add_32x16_dct_dct_3_8bpc_c: 23839.9
    inv_txfm_add_32x16_dct_dct_3_8bpc_ssse3: 1559.0
    inv_txfm_add_32x16_dct_dct_4_8bpc_c: 23757.7
    inv_txfm_add_32x16_dct_dct_4_8bpc_ssse3: 1579.0
    inv_txfm_add_32x16_identity_identity_0_8bpc_c: 10381.7
    inv_txfm_add_32x16_identity_identity_0_8bpc_ssse3: 126.3
    inv_txfm_add_32x16_identity_identity_1_8bpc_c: 10402.5
    inv_txfm_add_32x16_identity_identity_1_8bpc_ssse3: 126.5
    inv_txfm_add_32x16_identity_identity_2_8bpc_c: 10429.2
    inv_txfm_add_32x16_identity_identity_2_8bpc_ssse3: 244.9
    inv_txfm_add_32x16_identity_identity_3_8bpc_c: 10382.0
    inv_txfm_add_32x16_identity_identity_3_8bpc_ssse3: 491.0
    inv_txfm_add_32x16_identity_identity_4_8bpc_c: 10381.0
    inv_txfm_add_32x16_identity_identity_4_8bpc_ssse3: 468.0
    inv_txfm_add_32x32_dct_dct_0_8bpc_c: 4168.2
    inv_txfm_add_32x32_dct_dct_0_8bpc_ssse3: 204.0
    inv_txfm_add_32x32_dct_dct_1_8bpc_c: 46306.2
    inv_txfm_add_32x32_dct_dct_1_8bpc_ssse3: 2216.0
    inv_txfm_add_32x32_dct_dct_2_8bpc_c: 46300.2
    inv_txfm_add_32x32_dct_dct_2_8bpc_ssse3: 2194.2
    inv_txfm_add_32x32_dct_dct_3_8bpc_c: 46350.1
    inv_txfm_add_32x32_dct_dct_3_8bpc_ssse3: 3484.4
    inv_txfm_add_32x32_dct_dct_4_8bpc_c: 46318.1
    inv_txfm_add_32x32_dct_dct_4_8bpc_ssse3: 3440.9
    inv_txfm_add_32x32_identity_identity_0_8bpc_c: 14663.1
    inv_txfm_add_32x32_identity_identity_0_8bpc_ssse3: 179.0
    inv_txfm_add_32x32_identity_identity_1_8bpc_c: 14737.0
    inv_txfm_add_32x32_identity_identity_1_8bpc_ssse3: 179.2
    inv_txfm_add_32x32_identity_identity_2_8bpc_c: 14640.4
    inv_txfm_add_32x32_identity_identity_2_8bpc_ssse3: 179.1
    inv_txfm_add_32x32_identity_identity_3_8bpc_c: 14638.5
    inv_txfm_add_32x32_identity_identity_3_8bpc_ssse3: 663.8
    inv_txfm_add_32x32_identity_identity_4_8bpc_c: 14635.6
    inv_txfm_add_32x32_identity_identity_4_8bpc_ssse3: 663.9
    bd12b1ec
Name
Last commit
Last update
doc Loading commit data...
include Loading commit data...
snap Loading commit data...
src Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitignore Loading commit data...
.gitlab-ci.yml Loading commit data...
CONTRIBUTING.md Loading commit data...
COPYING Loading commit data...
NEWS Loading commit data...
README.md Loading commit data...
THANKS.md Loading commit data...
meson.build Loading commit data...
meson_options.txt Loading commit data...