Skip to content
  • Fiona Glaser's avatar
    x86: optimize and clean up predictor checking · 6371c3a5
    Fiona Glaser authored
    Branchlessly handle elimination of candidates in MMX roundclip asm.
    Add a new asm function, similar to roundclip, except without the round part.
    Optimize and organize the C code, and make both subme>=3 and subme<3 consistent.
    Add lots of explanatory comments and try to make things a little more understandable.
    ~5-10% faster with subme>=3, ~15-20% faster with subme<3.
    6371c3a5