• Ronald S. Bultje's avatar
    Add a 4x4 cdef_filter AVX2 implementation · 46a3fd20
    Ronald S. Bultje authored
    cdef_filter_4x4_8bpc_c: 2273.6
    cdef_filter_4x4_8bpc_avx2: 113.6
    
    Decoding time reduces to 15.51s for first 1000 frames of chimera 1080p,
    from 23.1 before cdef_filter SIMD or 17.86 with only 8x8 cdef_filter
    SIMD.
    46a3fd20
Name
Last commit
Last update
doc Loading commit data...
include Loading commit data...
src Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitignore Loading commit data...
.gitlab-ci.yml Loading commit data...
CONTRIBUTING.md Loading commit data...
COPYING Loading commit data...
NEWS Loading commit data...
README.md Loading commit data...
THANKS.md Loading commit data...
meson.build Loading commit data...
meson_options.txt Loading commit data...