• Janne Grunau's avatar
    aarch64: Optimize various intra_predict asm functions · aec81efd
    Janne Grunau authored
    Make them at least as fast as the compiled C version (tested on
    cortex-a53 vs. gcc 4.9.2).
    
                            C     NEON (before)   NEON (after)
    intra_predict_4x4_dc:   260   335             260
    intra_predict_4x4_dct:  210   265             200
    intra_predict_8x8c_dc:  497   548             493
    intra_predict_8x8c_v:   232   309             179 (arm64)
    intra_predict_8x16c_dc: 795   830             790
    aec81efd
Name
Last commit
Last update
common Loading commit data...
doc Loading commit data...
encoder Loading commit data...
extras Loading commit data...
filters Loading commit data...
input Loading commit data...
output Loading commit data...
tools Loading commit data...
.gitignore Loading commit data...
AUTHORS Loading commit data...
COPYING Loading commit data...
Makefile Loading commit data...
config.guess Loading commit data...
config.sub Loading commit data...
configure Loading commit data...
example.c Loading commit data...
version.sh Loading commit data...
x264.c Loading commit data...
x264.h Loading commit data...
x264cli.h Loading commit data...
x264dll.c Loading commit data...
x264res.rc Loading commit data...