1. 21 Jan, 2017 1 commit
  2. 21 Nov, 2016 1 commit
    • Martin Storsjö's avatar
      arm: Don't use vcmp.f64 for testing for an all-zeros register · 834e1b11
      Martin Storsjö authored
      On iOS, vcmp.f64 can behave as if the register was zero, if the
      register (interpreted as a f64), was a denormal number.
      
      The vcmp.f64 (and other VFP instructions) will trap to the kernel
      (which is supposed to implement the FP operation, which it apparently
      doesn't do properly on iOS) if the value is a denormal. If this happens,
      the whole comparison ends up way more costly.
      834e1b11
  3. 16 Jan, 2016 1 commit
  4. 11 Oct, 2015 5 commits
    • Martin Storsjö's avatar
      arm: Implement luma intra deblocking · 654901df
      Martin Storsjö authored
      checkasm timing       Cortex-A7      A8     A9
      deblock_luma_intra[0]_c      5988    4653   4316
      deblock_luma_intra[0]_neon   3103    2170   2128
      deblock_luma_intra[1]_c      7119    5905   5347
      deblock_luma_intra[1]_neon   2068    1381   1412
      
      This includes extra optimizations by Janne Grunau.
      
      Timings from a separate build, on Exynos 5422:
      
                            Cortex-A7     A15
      deblock_luma_intra[0]_c      6627   3300
      deblock_luma_intra[0]_neon   3059   1128
      deblock_luma_intra[1]_c      7314   4128
      deblock_luma_intra[1]_neon   2038   720
      654901df
    • Martin Storsjö's avatar
      arm: Implement chroma intra deblock · 3902ae02
      Martin Storsjö authored
      checkasm timing              Cortex-A7      A8     A9
      deblock_chroma_420_intra_mbaff_c    1469    1276   1181
      deblock_chroma_420_intra_mbaff_neon 981     717    644
      deblock_chroma_intra[1]_c           2954    2402   2321
      deblock_chroma_intra[1]_neon        947     581    575
      deblock_h_chroma_420_intra_c        2859    2509   2264
      deblock_h_chroma_420_intra_neon     1480    1119   1028
      deblock_h_chroma_422_intra_c        6211    5030   4792
      deblock_h_chroma_422_intra_neon     2894    1990   2077
      3902ae02
    • Martin Storsjö's avatar
      arm: Implement x264_deblock_h_chroma_mbaff_neon · 6bbaa275
      Martin Storsjö authored
      checkasm timing        Cortex-A7      A8     A9
      deblock_chroma_420_mbaff_c    1944    1706   1526
      deblock_chroma_420_mbaff_neon 1210    873    865
      6bbaa275
    • Martin Storsjö's avatar
      arm: Implement x264_deblock_h_chroma_422_neon · 3c66591e
      Martin Storsjö authored
      checkasm timing       Cortex-A7      A8     A9
      deblock_h_chroma_422_c       6953    6269   5145
      deblock_h_chroma_422_neon    3905    2569   2551
      3c66591e
    • Martin Storsjö's avatar
      arm: Optimize x264_deblock_h_chroma_neon · 89439b2c
      Martin Storsjö authored
      Shuffle both chroma components together as a 16 bit unit, and
      don't write the unchanged columns (like in x264_deblock_h_luma_neon
      and in the aarch64 version of the function).
      
      This causes a minor slowdown for x264_deblock_v_chroma_neon, but
      it is negligible compared to the speedup.
      
      checkasm timing      Cortex-A7    A8    A9
      deblock_chroma[1]_c         4817  4057  3601
      deblock_chroma[1]_neon      1249  716   817   (before)
      deblock_chroma[1]_neon      1249  766   845   (after)
      
      deblock_h_chroma_420_c      3699  3275  2830
      deblock_h_chroma_420_neon   2068  1414  1400  (before)
      deblock_h_chroma_420_neon   1838  1355  1291  (after)
      89439b2c
  5. 23 Feb, 2015 1 commit
  6. 26 Aug, 2014 2 commits
  7. 22 Apr, 2014 1 commit
  8. 08 Jan, 2014 1 commit
  9. 09 Jan, 2013 1 commit
  10. 04 Feb, 2012 1 commit
  11. 22 Oct, 2011 1 commit
  12. 25 Jan, 2011 1 commit
  13. 18 Sep, 2010 1 commit
    • Fiona Glaser's avatar
      Update source file headers · 213a99d0
      Fiona Glaser authored
      Update dates, improve file descriptions, make things more consistent.
      Also add information about commercial licensing.
      213a99d0
  14. 09 Nov, 2009 1 commit
    • David Conrad's avatar
      Various ARM-related fixes · 53a5772a
      David Conrad authored
      Fix comment for mc_copy_neon.
      Fix memzero_aligned_neon prototype.
      Update NEON (i)dct_dc prototypes.
      Duplicate x86 behavior for global+hidden functions.
      53a5772a
  15. 02 Sep, 2009 1 commit
  16. 24 Aug, 2009 1 commit