• Victorien Le Couviour--Tuffet's avatar
    x86: cdef_filter: use a better constant for SSE4 · 22c3594d
    Victorien Le Couviour--Tuffet authored
    Port of dc2ae517 for AVX-2
    from Kyle Siefring.
    
    ---------------------
    x86_64:
    ------------------------------------------
    cdef_filter_4x4_8bpc_ssse3: 141.7
    cdef_filter_4x4_8bpc_sse4: 128.3
    ------------------------------------------
    cdef_filter_4x8_8bpc_ssse3: 253.4
    cdef_filter_4x8_8bpc_sse4: 228.5
    ------------------------------------------
    cdef_filter_8x8_8bpc_ssse3: 429.6
    cdef_filter_8x8_8bpc_sse4: 379.9
    ------------------------------------------
    
    ---------------------
    x86_32:
    ------------------------------------------
    cdef_filter_4x4_8bpc_ssse3: 184.3
    cdef_filter_4x4_8bpc_sse4: 168.9
    ------------------------------------------
    cdef_filter_4x8_8bpc_ssse3: 335.3
    cdef_filter_4x8_8bpc_sse4: 305.1
    ------------------------------------------
    cdef_filter_8x8_8bpc_ssse3: 579.1
    cdef_filter_8x8_8bpc_sse4: 517.0
    ------------------------------------------
    22c3594d
meson.build 7.11 KB