Skip to content

Add SSSE3 implementation for the 8x8 blocks in itx

Liwei Wang requested to merge liwei/dav1d:itx_ssse3 into master

Cycle times:

inv_txfm_add_8x8_adst_adst_0_8bpc_c: 2165.6
inv_txfm_add_8x8_adst_adst_0_8bpc_ssse3: 194.5
inv_txfm_add_8x8_adst_adst_1_8bpc_c: 2158.3
inv_txfm_add_8x8_adst_adst_1_8bpc_ssse3: 194.7
inv_txfm_add_8x8_adst_dct_0_8bpc_c: 2241.0
inv_txfm_add_8x8_adst_dct_0_8bpc_ssse3: 165.1
inv_txfm_add_8x8_adst_dct_1_8bpc_c: 2242.6
inv_txfm_add_8x8_adst_dct_1_8bpc_ssse3: 164.2
inv_txfm_add_8x8_adst_flipadst_0_8bpc_c: 2178.2
inv_txfm_add_8x8_adst_flipadst_0_8bpc_ssse3: 194.4
inv_txfm_add_8x8_adst_flipadst_1_8bpc_c: 2183.0
inv_txfm_add_8x8_adst_flipadst_1_8bpc_ssse3: 194.2
inv_txfm_add_8x8_adst_identity_0_8bpc_c: 1592.1
inv_txfm_add_8x8_adst_identity_0_8bpc_ssse3: 125.2
inv_txfm_add_8x8_adst_identity_1_8bpc_c: 1597.7
inv_txfm_add_8x8_adst_identity_1_8bpc_ssse3: 126.3
inv_txfm_add_8x8_dct_adst_0_8bpc_c: 2214.1
inv_txfm_add_8x8_dct_adst_0_8bpc_ssse3: 162.0
inv_txfm_add_8x8_dct_adst_1_8bpc_c: 2221.5
inv_txfm_add_8x8_dct_adst_1_8bpc_ssse3: 161.9
inv_txfm_add_8x8_dct_dct_0_8bpc_c: 2247.8
inv_txfm_add_8x8_dct_dct_0_8bpc_ssse3: 34.0
inv_txfm_add_8x8_dct_dct_1_8bpc_c: 2243.1
inv_txfm_add_8x8_dct_dct_1_8bpc_ssse3: 133.7
inv_txfm_add_8x8_dct_flipadst_0_8bpc_c: 2255.1
inv_txfm_add_8x8_dct_flipadst_0_8bpc_ssse3: 161.2
inv_txfm_add_8x8_dct_flipadst_1_8bpc_c: 2244.6
inv_txfm_add_8x8_dct_flipadst_1_8bpc_ssse3: 161.8
inv_txfm_add_8x8_dct_identity_0_8bpc_c: 1632.3
inv_txfm_add_8x8_dct_identity_0_8bpc_ssse3: 41.3
inv_txfm_add_8x8_dct_identity_1_8bpc_c: 1629.6
inv_txfm_add_8x8_dct_identity_1_8bpc_ssse3: 97.7
inv_txfm_add_8x8_flipadst_adst_0_8bpc_c: 2185.6
inv_txfm_add_8x8_flipadst_adst_0_8bpc_ssse3: 191.0
inv_txfm_add_8x8_flipadst_adst_1_8bpc_c: 2165.7
inv_txfm_add_8x8_flipadst_adst_1_8bpc_ssse3: 191.6
inv_txfm_add_8x8_flipadst_dct_0_8bpc_c: 2246.4
inv_txfm_add_8x8_flipadst_dct_0_8bpc_ssse3: 162.8
inv_txfm_add_8x8_flipadst_dct_1_8bpc_c: 2252.1
inv_txfm_add_8x8_flipadst_dct_1_8bpc_ssse3: 163.9
inv_txfm_add_8x8_flipadst_flipadst_0_8bpc_c: 2180.9
inv_txfm_add_8x8_flipadst_flipadst_0_8bpc_ssse3: 196.3
inv_txfm_add_8x8_flipadst_flipadst_1_8bpc_c: 2192.2
inv_txfm_add_8x8_flipadst_flipadst_1_8bpc_ssse3: 194.5
inv_txfm_add_8x8_flipadst_identity_0_8bpc_c: 1600.9
inv_txfm_add_8x8_flipadst_identity_0_8bpc_ssse3: 126.6
inv_txfm_add_8x8_flipadst_identity_1_8bpc_c: 1600.5
inv_txfm_add_8x8_flipadst_identity_1_8bpc_ssse3: 126.4
inv_txfm_add_8x8_identity_adst_0_8bpc_c: 1558.0
inv_txfm_add_8x8_identity_adst_0_8bpc_ssse3: 120.7
inv_txfm_add_8x8_identity_adst_1_8bpc_c: 1556.7
inv_txfm_add_8x8_identity_adst_1_8bpc_ssse3: 121.0
inv_txfm_add_8x8_identity_dct_0_8bpc_c: 1600.8
inv_txfm_add_8x8_identity_dct_0_8bpc_ssse3: 37.9
inv_txfm_add_8x8_identity_dct_1_8bpc_c: 1599.5
inv_txfm_add_8x8_identity_dct_1_8bpc_ssse3: 90.3
inv_txfm_add_8x8_identity_flipadst_0_8bpc_c: 1584.9
inv_txfm_add_8x8_identity_flipadst_0_8bpc_ssse3: 120.2
inv_txfm_add_8x8_identity_flipadst_1_8bpc_c: 1584.3
inv_txfm_add_8x8_identity_flipadst_1_8bpc_ssse3: 120.5
inv_txfm_add_8x8_identity_identity_0_8bpc_c: 975.9
inv_txfm_add_8x8_identity_identity_0_8bpc_ssse3: 54.7
inv_txfm_add_8x8_identity_identity_1_8bpc_c: 975.7
inv_txfm_add_8x8_identity_identity_1_8bpc_ssse3: 54.7

Merge request reports