Skip to content

x86/itx: Add {16x32,32x16,32x32} 12bpc AVX2 identity transforms

Matthias Dressel requested to merge another/dav1d:itx-12-avx2-idtx into master
inv_txfm_add_16x32_identity_identity_0_12bpc_c:      4287.9 ( 1.00x)
inv_txfm_add_16x32_identity_identity_0_12bpc_avx2:     31.4 (136.66x)
inv_txfm_add_16x32_identity_identity_1_12bpc_c:      4293.7 ( 1.00x)
inv_txfm_add_16x32_identity_identity_1_12bpc_avx2:     30.9 (139.07x)
inv_txfm_add_16x32_identity_identity_2_12bpc_c:      4273.8 ( 1.00x)
inv_txfm_add_16x32_identity_identity_2_12bpc_avx2:     97.3 (43.92x)
inv_txfm_add_16x32_identity_identity_3_12bpc_c:      4269.0 ( 1.00x)
inv_txfm_add_16x32_identity_identity_3_12bpc_avx2:    165.2 (25.83x)
inv_txfm_add_16x32_identity_identity_4_12bpc_c:      4284.4 ( 1.00x)
inv_txfm_add_16x32_identity_identity_4_12bpc_avx2:    235.2 (18.22x)

inv_txfm_add_32x16_identity_identity_0_12bpc_c:      4138.7 ( 1.00x)
inv_txfm_add_32x16_identity_identity_0_12bpc_avx2:     30.4 (136.26x)
inv_txfm_add_32x16_identity_identity_1_12bpc_c:      4147.5 ( 1.00x)
inv_txfm_add_32x16_identity_identity_1_12bpc_avx2:     30.7 (135.25x)
inv_txfm_add_32x16_identity_identity_2_12bpc_c:      4138.2 ( 1.00x)
inv_txfm_add_32x16_identity_identity_2_12bpc_avx2:     98.9 (41.84x)
inv_txfm_add_32x16_identity_identity_3_12bpc_c:      4136.6 ( 1.00x)
inv_txfm_add_32x16_identity_identity_3_12bpc_avx2:    167.7 (24.67x)
inv_txfm_add_32x16_identity_identity_4_12bpc_c:      4156.3 ( 1.00x)
inv_txfm_add_32x16_identity_identity_4_12bpc_avx2:    242.1 (17.17x)

inv_txfm_add_32x32_identity_identity_0_12bpc_c:      5785.8 ( 1.00x)
inv_txfm_add_32x32_identity_identity_0_12bpc_avx2:     20.7 (279.65x)
inv_txfm_add_32x32_identity_identity_1_12bpc_c:      5896.9 ( 1.00x)
inv_txfm_add_32x32_identity_identity_1_12bpc_avx2:     20.7 (285.01x)
inv_txfm_add_32x32_identity_identity_2_12bpc_c:      5799.5 ( 1.00x)
inv_txfm_add_32x32_identity_identity_2_12bpc_avx2:     68.9 (84.20x)
inv_txfm_add_32x32_identity_identity_3_12bpc_c:      5798.1 ( 1.00x)
inv_txfm_add_32x32_identity_identity_3_12bpc_avx2:    140.6 (41.25x)
inv_txfm_add_32x32_identity_identity_4_12bpc_c:      5803.3 ( 1.00x)
inv_txfm_add_32x32_identity_identity_4_12bpc_avx2:    308.2 (18.83x)

Merge request reports