• Janne Grunau's avatar
    aarch64: NEON asm for integral init · be7e5fa6
    Janne Grunau authored
    integral_init4h_neon and integral_init8h_neon are 3-4 times faster than
    C. integral_init8v_neon is 6 times faster and integral_init4v_neon is 10
    times faster.
    be7e5fa6
mc-a.S 43.4 KB