1. 05 Apr, 2009 1 commit
    • Fiona Glaser's avatar
      Faster CABAC RDO · be3c3d21
      Fiona Glaser authored
      Since the bypass case is quite unlikely, especially when doing merged sigmap/level coding,
      it's faster to use a branch than a cmov.
      be3c3d21
  2. 31 Mar, 2009 3 commits
  3. 30 Mar, 2009 3 commits
  4. 27 Mar, 2009 1 commit
  5. 19 Mar, 2009 1 commit
  6. 17 Mar, 2009 1 commit
    • Fiona Glaser's avatar
      SSE2 zigzag_interleave · d25d50c9
      Fiona Glaser authored
      Replace PHADD with FastShuffle (more accurate naming).
      This flag represents asm functions that rely on fast SSE2 shuffle units, and thus are only faster on Phenom, Nehalem, and Penryn CPUs.
      d25d50c9
  7. 10 Mar, 2009 1 commit
  8. 09 Mar, 2009 1 commit
  9. 08 Mar, 2009 1 commit
  10. 07 Mar, 2009 3 commits
    • Fiona Glaser's avatar
      SSSE3 hpel_filter_v · f701ebc8
      Fiona Glaser authored
      Optimized using the same method as in r1122.  Patch partially by Holger.
      ~8% faster hpel filter on 64-bit Nehalem
      f701ebc8
    • Fiona Glaser's avatar
      Update some asm copyright headers · 936f76e0
      Fiona Glaser authored
      936f76e0
    • Holger Lubitz's avatar
      Vastly faster SATD/SA8D/Hadamard_AC/SSD/DCT/IDCT · 54e38917
      Holger Lubitz authored
      Heavily optimized for Core 2 and Nehalem, but performance should improve on all modern x86 CPUs.
      16x16 SATD: +18% speed on K8(64bit), +22% on K10(32bit), +42% on Penryn(64bit), +44% on Nehalem(64bit), +50% on P4(32bit), +98% on Conroe(64bit)
      Similar performance boosts in SATD-like functions (SA8D, hadamard_ac) and somewhat less in DCT/IDCT/SSD.
      Overall performance boost is up to ~15% on 64-bit Conroe.
      54e38917
  11. 06 Mar, 2009 1 commit
  12. 04 Mar, 2009 4 commits
  13. 03 Mar, 2009 1 commit
  14. 26 Feb, 2009 1 commit
  15. 16 Feb, 2009 1 commit
  16. 14 Feb, 2009 1 commit
  17. 11 Feb, 2009 2 commits
  18. 10 Feb, 2009 1 commit
    • Manuel Rommel's avatar
      fix 10l in 75b495f2723fcb77f · 65304078
      Manuel Rommel authored
      Original thread:
      date: Mon, Feb 9, 2009 at 9:37 PM
      subject: [x264-devel] commit: Spare a vec_perm and a vec_mergeh though using a LUT of permutation vectors . (Guillaume Poirier )
      65304078
  19. 09 Feb, 2009 7 commits
  20. 08 Feb, 2009 1 commit
  21. 04 Feb, 2009 2 commits
  22. 03 Feb, 2009 2 commits