1. 19 Feb, 2019 2 commits
    • Henrik Gramner's avatar
      x86: Fix buffer overread in ipred_filter_avx2.w32 · dce4e788
      Henrik Gramner authored
      Eliminates segfault with clusterfuzz-testcase-minimized-dav1d_fuzzer-5697181166600192.
      
      Credit to OSS-Fuzz.
      dce4e788
    • Xuefeng Jiang's avatar
      Add SSSE3 implementation for ipred_cfl, ipred_cfl_top, ipred_cfl_left and ipred_cfl_128 · 9caabc7b
      Xuefeng Jiang authored
      cfl_pred_cfl_128_w4_8bpc_c: 291.3
      cfl_pred_cfl_128_w4_8bpc_ssse3: 34.2
      cfl_pred_cfl_128_w8_8bpc_c: 838.8
      cfl_pred_cfl_128_w8_8bpc_ssse3: 55.4
      cfl_pred_cfl_128_w16_8bpc_c: 1640.6
      cfl_pred_cfl_128_w16_8bpc_ssse3: 91.8
      cfl_pred_cfl_128_w32_8bpc_c: 4073.5
      cfl_pred_cfl_128_w32_8bpc_ssse3: 206.6
      cfl_pred_cfl_left_w4_8bpc_c: 321.6
      cfl_pred_cfl_left_w4_8bpc_ssse3: 40.0
      cfl_pred_cfl_left_w8_8bpc_c: 887.8
      cfl_pred_cfl_left_w8_8bpc_ssse3: 62.9
      cfl_pred_cfl_left_w16_8bpc_c: 1932.6
      cfl_pred_cfl_left_w16_8bpc_ssse3: 97.4
      cfl_pred_cfl_left_w32_8bpc_c: 4795.4
      cfl_pred_cfl_left_w32_8bpc_ssse3: 213.8
      cfl_pred_cfl_top_w4_8bpc_c: 359.7
      cfl_pred_cfl_top_w4_8bpc_ssse3: 38.2
      cfl_pred_cfl_top_w8_8bpc_c: 1007.1
      cfl_pred_cfl_top_w8_8bpc_ssse3: 58.5
      cfl_pred_cfl_top_w16_8bpc_c: 1873.4
      cfl_pred_cfl_top_w16_8bpc_ssse3: 97.9
      cfl_pred_cfl_top_w32_8bpc_c: 4523.7
      cfl_pred_cfl_top_w32_8bpc_ssse3: 216.9
      cfl_pred_cfl_w4_8bpc_c: 537.4
      cfl_pred_cfl_w4_8bpc_ssse3: 44.9
      cfl_pred_cfl_w8_8bpc_c: 1031.1
      cfl_pred_cfl_w8_8bpc_ssse3: 66.7
      cfl_pred_cfl_w16_8bpc_c: 1751.5
      cfl_pred_cfl_w16_8bpc_ssse3: 99.9
      cfl_pred_cfl_w32_8bpc_c: 4814.5
      cfl_pred_cfl_w32_8bpc_ssse3: 219.8
      9caabc7b
  2. 17 Feb, 2019 2 commits
  3. 16 Feb, 2019 2 commits
  4. 15 Feb, 2019 3 commits
  5. 14 Feb, 2019 5 commits
  6. 13 Feb, 2019 13 commits
  7. 12 Feb, 2019 7 commits
  8. 11 Feb, 2019 1 commit
  9. 10 Feb, 2019 1 commit
  10. 09 Feb, 2019 2 commits
  11. 08 Feb, 2019 2 commits
    • Victorien Le Couviour--Tuffet's avatar
      looprestoration: add SSSE3 implementation · ab3da909
      Victorien Le Couviour--Tuffet authored
      AVX2 code adaption
      
      ---------------------
      x86_64:
      ------------------------------------------
      selfguided_3x3_8bpc_c: 308692.1
      selfguided_3x3_8bpc_ssse3: 112436.5
      selfguided_3x3_8bpc_avx2: 61749.8
      ------------------------------------------
      selfguided_5x5_8bpc_c: 312132.8
      selfguided_5x5_8bpc_ssse3: 79513.3
      selfguided_5x5_8bpc_avx2: 45947.3
      ------------------------------------------
      selfguided_mix_8bpc_c: 588951.9
      selfguided_mix_8bpc_ssse3: 196751.5
      selfguided_mix_8bpc_avx2: 109091.6
      ------------------------------------------
      wiener_chroma_8bpc_c: 258874.8
      wiener_chroma_8bpc_ssse3: 28172.4
      wiener_chroma_8bpc_avx2: 16910.5
      ------------------------------------------
      wiener_luma_8bpc_c: 264432.3
      wiener_luma_8bpc_ssse3: 27958.3
      wiener_luma_8bpc_avx2: 17303.8
      ------------------------------------------
      
      ---------------------
      x86_32:
      ------------------------------------------
      selfguided_3x3_8bpc_c: 350430.5
      selfguided_3x3_8bpc_ssse3: 128850.8
      ------------------------------------------
      selfguided_5x5_8bpc_c: 313963.6
      selfguided_5x5_8bpc_ssse3: 81988.8
      ------------------------------------------
      selfguided_mix_8bpc_c: 630584.2
      selfguided_mix_8bpc_ssse3: 211802.0
      ------------------------------------------
      wiener_chroma_8bpc_c: 288928.5
      wiener_chroma_8bpc_ssse3: 30336.7
      ------------------------------------------
      wiener_luma_8bpc_c: 284500.6
      wiener_luma_8bpc_ssse3: 29521.9
      ------------------------------------------
      ab3da909
    • Victorien Le Couviour--Tuffet's avatar