1. 18 Dec, 2018 1 commit
  2. 11 Dec, 2018 2 commits
  3. 10 Dec, 2018 1 commit
  4. 08 Dec, 2018 1 commit
  5. 07 Dec, 2018 1 commit
  6. 05 Dec, 2018 3 commits
  7. 03 Dec, 2018 1 commit
    • Ronald S. Bultje's avatar
      Make per-width versions of cfl_ac · 70fb01d8
      Ronald S. Bultje authored
      Also use aligned reads and writes in sub_loop, and integrate sum_loop into
      the main loop.
      
      before:
      cfl_ac_420_w4_8bpc_c: 367.4
      cfl_ac_420_w4_8bpc_avx2: 72.8
      cfl_ac_420_w8_8bpc_c: 621.6
      cfl_ac_420_w8_8bpc_avx2: 85.1
      cfl_ac_420_w16_8bpc_c: 983.4
      cfl_ac_420_w16_8bpc_avx2: 141.0
      
      after:
      cfl_ac_420_w4_8bpc_c: 376.2
      cfl_ac_420_w4_8bpc_avx2: 28.5
      cfl_ac_420_w8_8bpc_c: 607.2
      cfl_ac_420_w8_8bpc_avx2: 29.9
      cfl_ac_420_w16_8bpc_c: 962.1
      cfl_ac_420_w16_8bpc_avx2: 48.8
      70fb01d8
  8. 28 Nov, 2018 1 commit
  9. 27 Nov, 2018 1 commit
    • Nathan Egge's avatar
      Reset the random seed when testing each CPU type. · 560dc684
      Nathan Egge authored
      Any benchmark that uses random data as input gives bunk results as it
       currently uses differently random data on each run.
      This now makes any non-determinism in the tests repeatable across each
       call to check_cpu_flags() and checkasm_check_func().
      560dc684
  10. 26 Nov, 2018 1 commit
  11. 23 Nov, 2018 1 commit
  12. 20 Nov, 2018 2 commits
  13. 19 Nov, 2018 1 commit
    • Marvin Scholz's avatar
      build: Only add libdav1d_nasm_objs if needed · b3c522d5
      Marvin Scholz authored
      Current versions of meson have a bug that causes the need to add
      the nasm generated objects to checkasm, even though this should
      already be covered by the extract_all_objects() for libdav1d.
      Meson versions >= 0.48.999 (that is, Meson 0.49 and development
      versions states of that on git) fixed this issue so now adding
      this is not longer needed.
      Adding it regardless would actually cause an error because of
      symbols being present twice.
      b3c522d5
  14. 16 Nov, 2018 1 commit
  15. 15 Nov, 2018 1 commit
  16. 14 Nov, 2018 2 commits
  17. 12 Nov, 2018 1 commit
    • Ronald S. Bultje's avatar
      Add a max_width/height argument to angular_ipred_fn · 2f251bd1
      Ronald S. Bultje authored
      This is used in z2 to limit the number of pixels over which the
      filter is applied, as per "numPx" in 7.11.2.4 point 4 in the AV1
      specification. This only applies to z2, because in z1/3, the edge
      filter is (incomprehensibly) lengtened by the opposite side's edge
      length, which undoes the limit on the filter length (like a bug
      undoing another bug).
      
      I admit the code is getting rather complex, so we may want to
      redesign this to make writing SIMD easier.
      2f251bd1
  18. 10 Nov, 2018 1 commit
    • Henrik Gramner's avatar
      Split MC blend · 58fc5165
      Henrik Gramner authored
      The mstride == 0, mstride == 1, and mstride == w cases are very different
      from each other, and splitting them into separate functions makes it easier
      top optimize them.
      
      Also add some further optimizations to the AVX2 asm that became possible
      after this change.
      58fc5165
  19. 08 Nov, 2018 2 commits
  20. 07 Nov, 2018 2 commits
  21. 06 Nov, 2018 2 commits
  22. 05 Nov, 2018 3 commits
  23. 30 Oct, 2018 4 commits
  24. 29 Oct, 2018 2 commits
  25. 28 Oct, 2018 2 commits