1. 21 Jan, 2017 1 commit
  2. 20 Sep, 2016 2 commits
  3. 16 Jan, 2016 1 commit
  4. 11 Oct, 2015 2 commits
    • Martin Storsjö's avatar
      arm: Implement x264_pixel_sa8d_satd_16x16_neon · e8b95e92
      Martin Storsjö authored
      This requires spilling some registers to the stack,
      contray to the aarch64 version.
      
      checkasm timing        Cortex-A7      A8     A9
      sa8d_satd_16x16_neon          12936   6365   7492
      sa8d_satd_16x16_separate_neon 14841   6605   8324
      e8b95e92
    • Martin Storsjö's avatar
      arm: Add neon versions of vsad, asd8 and ssd_nv12_core · 8feb733e
      Martin Storsjö authored
      These are straight translations of the aarch64 versions.
      
      checkasm timing      Cortex-A7      A8      A9
      vsad_c                      16234   10984   9850
      vsad_neon                   2132    1020    789
      
      asd8_c                      5859    3561    3543
      asd8_neon                   1407    1279    1250
      
      ssd_nv12_c                  608096  591072  426285
      ssd_nv12_neon               72752   33549   41347
      8feb733e
  5. 25 Jul, 2015 1 commit
  6. 23 Feb, 2015 1 commit
  7. 16 Dec, 2014 6 commits
  8. 17 Oct, 2014 1 commit
  9. 26 Aug, 2014 2 commits
  10. 20 Jul, 2014 2 commits
  11. 22 Apr, 2014 5 commits
  12. 13 Mar, 2014 1 commit
  13. 12 Mar, 2014 2 commits
  14. 08 Jan, 2014 2 commits
  15. 23 Aug, 2013 1 commit
  16. 05 Jul, 2013 1 commit
    • Henrik Gramner's avatar
      x86: Remove X264_CPU_SSE_MISALIGN functions · ff41804e
      Henrik Gramner authored
      Prevents a crash if the misaligned exception mask bit is cleared for some reason.
      
      Misaligned SSE functions are only used on AMD Phenom CPUs and the benefit is miniscule.
      They also require modifying the MXCSR control register and by removing those functions
      we can get rid of that complexity altogether.
      
      VEX-encoded instructions also supports unaligned memory operands. I tried adding AVX
      implementations of all removed functions but there were no performance improvements on
      Ivy Bridge. pixel_sad_x3 and pixel_sad_x4 had significant code size reductions though
      so I kept them and added some minor cosmetics fixes and tweaks.
      ff41804e
  17. 20 May, 2013 2 commits
  18. 17 May, 2013 2 commits
  19. 23 Apr, 2013 5 commits