• Victorien Le Couviour--Tuffet's avatar
    x86: cdef_dir: optimize best cost finding for SSE · 91568b2a
    Victorien Le Couviour--Tuffet authored
    Port of 65ee1233 for AVX-2
    from Kyle Siefring to SSE41, and optimize SSSE3.
    
    ---------------------
    x86_64:
    ------------------------------------------
    before: cdef_dir_8bpc_ssse3: 110.3
     after: cdef_dir_8bpc_ssse3: 105.9
       new: cdef_dir_8bpc_sse4:   96.4
    ------------------------------------------
    
    ---------------------
    x86_32:
    ------------------------------------------
    before: cdef_dir_8bpc_ssse3: 120.6
     after: cdef_dir_8bpc_ssse3: 110.7
       new: cdef_dir_8bpc_sse4:  106.5
    ------------------------------------------
    91568b2a
Name
Last commit
Last update
doc Loading commit data...
include Loading commit data...
snap Loading commit data...
src Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitignore Loading commit data...
.gitlab-ci.yml Loading commit data...
CONTRIBUTING.md Loading commit data...
COPYING Loading commit data...
NEWS Loading commit data...
README.md Loading commit data...
THANKS.md Loading commit data...
meson.build Loading commit data...
meson_options.txt Loading commit data...