• Victorien Le Couviour--Tuffet's avatar
    x86: add SSSE3 cdef filters implementation · 791ec219
    Victorien Le Couviour--Tuffet authored
    AVX2 adaption
    
    ---------------------
    x86_64:
    ------------------------------------------
    cdef_filter_4x4_8bpc_c: 1370.2
    cdef_filter_4x4_8bpc_ssse3: 142.3
    cdef_filter_4x4_8bpc_avx2: 106.7
    ------------------------------------------
    cdef_filter_4x8_8bpc_c: 2749.3
    cdef_filter_4x8_8bpc_ssse3: 257.2
    cdef_filter_4x8_8bpc_avx2: 178.8
    ------------------------------------------
    cdef_filter_8x8_8bpc_c: 5609.5
    cdef_filter_8x8_8bpc_ssse3: 438.1
    cdef_filter_8x8_8bpc_avx2: 250.6
    ------------------------------------------
    
    ---------------------
    x86_32:
    ------------------------------------------
    cdef_filter_4x4_8bpc_c: 1548.7
    cdef_filter_4x4_8bpc_ssse3: 179.8
    ------------------------------------------
    cdef_filter_4x8_8bpc_c: 3128.2
    cdef_filter_4x8_8bpc_ssse3: 328.1
    ------------------------------------------
    cdef_filter_8x8_8bpc_c: 6454.5
    cdef_filter_8x8_8bpc_ssse3: 584.4
    ------------------------------------------
    791ec219
cdef_init_tmpl.c 2.3 KB