1. 10 May, 2009 3 commits
  2. 21 Apr, 2009 2 commits
  3. 18 Apr, 2009 2 commits
  4. 17 Apr, 2009 1 commit
  5. 14 Apr, 2009 1 commit
  6. 09 Apr, 2009 1 commit
    • Fiona Glaser's avatar
      Various CABAC optimizations · 2bcc39fd
      Fiona Glaser authored
      Move calculation of b_intra out of the core residual loop and hardcode it where applicable.
      Inlining cabac_mb_mvd was unnecessary and wasted tremendous amounts of code size.  Inlining only cache_mvd is faster and significantly smaller.
      2bcc39fd
  7. 08 Apr, 2009 1 commit
  8. 05 Apr, 2009 1 commit
    • Fiona Glaser's avatar
      Faster CABAC RDO · be3c3d21
      Fiona Glaser authored
      Since the bypass case is quite unlikely, especially when doing merged sigmap/level coding,
      it's faster to use a branch than a cmov.
      be3c3d21
  9. 31 Mar, 2009 3 commits
  10. 30 Mar, 2009 3 commits
  11. 27 Mar, 2009 1 commit
  12. 19 Mar, 2009 1 commit
  13. 17 Mar, 2009 1 commit
    • Fiona Glaser's avatar
      SSE2 zigzag_interleave · d25d50c9
      Fiona Glaser authored
      Replace PHADD with FastShuffle (more accurate naming).
      This flag represents asm functions that rely on fast SSE2 shuffle units, and thus are only faster on Phenom, Nehalem, and Penryn CPUs.
      d25d50c9
  14. 10 Mar, 2009 1 commit
  15. 09 Mar, 2009 1 commit
  16. 08 Mar, 2009 1 commit
  17. 07 Mar, 2009 3 commits
    • Fiona Glaser's avatar
      SSSE3 hpel_filter_v · f701ebc8
      Fiona Glaser authored
      Optimized using the same method as in r1122.  Patch partially by Holger.
      ~8% faster hpel filter on 64-bit Nehalem
      f701ebc8
    • Fiona Glaser's avatar
      Update some asm copyright headers · 936f76e0
      Fiona Glaser authored
      936f76e0
    • Holger Lubitz's avatar
      Vastly faster SATD/SA8D/Hadamard_AC/SSD/DCT/IDCT · 54e38917
      Holger Lubitz authored
      Heavily optimized for Core 2 and Nehalem, but performance should improve on all modern x86 CPUs.
      16x16 SATD: +18% speed on K8(64bit), +22% on K10(32bit), +42% on Penryn(64bit), +44% on Nehalem(64bit), +50% on P4(32bit), +98% on Conroe(64bit)
      Similar performance boosts in SATD-like functions (SA8D, hadamard_ac) and somewhat less in DCT/IDCT/SSD.
      Overall performance boost is up to ~15% on 64-bit Conroe.
      54e38917
  18. 06 Mar, 2009 1 commit
  19. 04 Mar, 2009 4 commits
  20. 03 Mar, 2009 1 commit
  21. 26 Feb, 2009 1 commit
  22. 16 Feb, 2009 1 commit
  23. 14 Feb, 2009 1 commit
  24. 11 Feb, 2009 2 commits
  25. 10 Feb, 2009 1 commit
    • Manuel Rommel's avatar
      fix 10l in 75b495f2723fcb77f · 65304078
      Manuel Rommel authored
      Original thread:
      date: Mon, Feb 9, 2009 at 9:37 PM
      subject: [x264-devel] commit: Spare a vec_perm and a vec_mergeh though using a LUT of permutation vectors . (Guillaume Poirier )
      65304078
  26. 09 Feb, 2009 1 commit