1. 05 Dec, 2018 2 commits
  2. 03 Dec, 2018 1 commit
    • Ronald S. Bultje's avatar
      Make per-width versions of cfl_ac · 70fb01d8
      Ronald S. Bultje authored
      Also use aligned reads and writes in sub_loop, and integrate sum_loop into
      the main loop.
      
      before:
      cfl_ac_420_w4_8bpc_c: 367.4
      cfl_ac_420_w4_8bpc_avx2: 72.8
      cfl_ac_420_w8_8bpc_c: 621.6
      cfl_ac_420_w8_8bpc_avx2: 85.1
      cfl_ac_420_w16_8bpc_c: 983.4
      cfl_ac_420_w16_8bpc_avx2: 141.0
      
      after:
      cfl_ac_420_w4_8bpc_c: 376.2
      cfl_ac_420_w4_8bpc_avx2: 28.5
      cfl_ac_420_w8_8bpc_c: 607.2
      cfl_ac_420_w8_8bpc_avx2: 29.9
      cfl_ac_420_w16_8bpc_c: 962.1
      cfl_ac_420_w16_8bpc_avx2: 48.8
      70fb01d8
  3. 23 Nov, 2018 1 commit
  4. 12 Nov, 2018 1 commit
    • Ronald S. Bultje's avatar
      Add a max_width/height argument to angular_ipred_fn · 2f251bd1
      Ronald S. Bultje authored
      This is used in z2 to limit the number of pixels over which the
      filter is applied, as per "numPx" in 7.11.2.4 point 4 in the AV1
      specification. This only applies to z2, because in z1/3, the edge
      filter is (incomprehensibly) lengtened by the opposite side's edge
      length, which undoes the limit on the filter length (like a bug
      undoing another bug).
      
      I admit the code is getting rather complex, so we may want to
      redesign this to make writing SIMD easier.
      2f251bd1
  5. 20 Oct, 2018 2 commits
  6. 19 Oct, 2018 1 commit
  7. 08 Oct, 2018 1 commit