Commits · gles · Pablo Stebler / dav1d

Jun 10, 2019
- Performance improvements · e048bfe2
  Pablo Stebler authored Jun 08, 2019
  
  e048bfe2
- Add copyright notice · 7a431f13
  Pablo Stebler authored Jun 06, 2019
  
  7a431f13
- It works · 321ed5bb
  Pablo Stebler authored Jun 06, 2019
  
  321ed5bb
Jun 09, 2019
- meson: simplify a few checks for x86 targets · e0623286
  James Almer authored Jun 05, 2019
  
  e0623286
- x86: include config.asm in x86inc instead of every asm file · 1df18164
  James Almer authored Jun 05, 2019
  
  1df18164
Jun 07, 2019

checkasm: Check for __ARM_ARCH >= 7 for the arm cpu timer inline assembly · 13067916

Martin Storsjö authored Jun 06, 2019 and

Janne Grunau committed Jun 07, 2019

This fixes building with raspbian compilers, that default to armv6.
The isb instruction is unavailable on armv6, and the cycle counter
register is accessed differently there as well.

This fixes issue #282.

13067916

Jun 06, 2019
- CI: Added ppc64le build and test jobs · 6c90f005
  Konstantin Pavlov authored Jun 06, 2019
  
  6c90f005
Jun 05, 2019
- Update NEWS for 0.4.0 · 3e3855bf
  Jean-Baptiste Kempf authored May 22, 2019
  
  3e3855bf
- output: automatically use null muxer for /dev/null · 75c3f4a4
  Tristan Matthews authored Jun 05, 2019
  
  75c3f4a4
Jun 04, 2019

meson: Fix nasm detection · 098a565c

Marvin Scholz authored Jun 04, 2019

nasm -v can actually fail for example on macOS, where nasm could be a
stub executable that forwards commands to the real nasm, but if the real
nasm is not installed, fails.
This would lead to a confusing error message due to the out of bounds
array access, to avoid that, explicitly check the exit code.

098a565c

Jun 01, 2019
- checkasm: Fix out-of-bounds read in warp8x8 tests · 0040d92b
  Henrik Gramner authored Jun 01, 2019 and Henrik Gramner committed Jun 01, 2019
  
  0040d92b
May 31, 2019
- x86: Optimize warp8x8 AVX2 asm · 5bc43169
  Henrik Gramner authored May 31, 2019 and Henrik Gramner committed May 31, 2019
  
  5bc43169
May 24, 2019
- build: add option for fuzzer specific LDFLAGS · 785f00fe
  Janne Grunau authored May 24, 2019
```
Needed for oss-fuzz after switching to '-fsanitize=fuzzer' for the
libfuzzer based build. Adding '-fsanitize=fuzzer' for all oss-fuzz based
build breaks afl.
```
  785f00fe
- arm: Mark the stack as non-executable on ELF · 63eef332
  Martin Storsjö authored May 23, 2019
  
  63eef332
May 23, 2019
- Optimize coefficient decoding · 2cce131e
  Henrik Gramner authored May 23, 2019 and Henrik Gramner committed May 23, 2019
  
  2cce131e
May 21, 2019
- dav1d: reserve some bytes in Dav1dSettings · e88c8eed
  James Almer authored May 21, 2019
```
This way adding new fields in the future will not require breaking ABI
```
  e88c8eed
- build: Enable SSE2 by default on x86-32 · 3e0ec4cd
  Henrik Gramner authored May 16, 2019
  
  3e0ec4cd
- x86: Enable msac asm on x86-32 · 75558f8b
  Henrik Gramner authored May 16, 2019
  
  75558f8b
- Update THANKS.md · 664c6a5f
  Justin Bull authored May 20, 2019 and Jean-Baptiste Kempf committed May 21, 2019
  
  664c6a5f
- Hard wrap contribs. Add self for logo · bfd4ee57
  Justin Bull authored May 18, 2019 and Jean-Baptiste Kempf committed May 21, 2019
  
  bfd4ee57
May 19, 2019

ci: Add full testdata tests on aarch64 · a690e548
Martin Storsjö authored May 19, 2019
```
The armv7 runner doesn't seem to cope well with the testdata though.
```
a690e548
checkasm: Update the mc test to check all valid heights · 7d5f0d0c
Henrik Gramner authored May 19, 2019 and Martin Storsjö committed May 19, 2019

7d5f0d0c

arm: mc: Fix 8tap_v w8 with OBMC 3/4 heights · bf920fba

Martin Storsjö authored May 19, 2019

Also make sure that the w4 case can exit after processing 12 pixels,
where it is convenient.

This gives a small slowdown for in-order cores like A7, A8, A53, but
acutally seems to give a small speedup for out-of-order cores like
A9, A72 and A73.

AArch64:
Before:                      Cortex A53     A72     A73
mc_8tap_regular_w8_v_8bpc_neon:   223.8   247.3   228.5
After:
mc_8tap_regular_w8_v_8bpc_neon:   232.5   243.9   223.4

AArch32:
Before:                       Cortex A7      A8      A9     A53     A72     A73
mc_8tap_regular_w8_v_8bpc_neon:   550.2   470.7   520.5   257.0   256.4   248.2
After:
mc_8tap_regular_w8_v_8bpc_neon:   554.3   474.2   511.6   267.5   252.6   246.8

bf920fba

May 18, 2019
- Optimize obmc blend · f64fdae5
  Henrik Gramner authored May 18, 2019 and Henrik Gramner committed May 18, 2019
```
The last 1/4 of the mask is always zero, so we can skip some
calculations that doesn't change the output.
```
  f64fdae5
May 17, 2019
- Remove one multiply in Z2 filter top left · 3d6479ce
  Luc Trudeau authored May 17, 2019
  
  3d6479ce
- Reduce branching in intra angle to mode · d04d0a6c
  Luc Trudeau authored May 17, 2019
  
  d04d0a6c
- small code cleaning in intra_edge init_mode · af0375ca
  Luc Trudeau authored May 17, 2019
  
  af0375ca
May 16, 2019
- Fix unused function warning on parse_proc_cpuinfo() for Android · fc3777b4
  Dale Curtis authored Apr 25, 2019 and Dale Curtis committed May 16, 2019
  
  fc3777b4
- Use size_t for the msac window size · 60519f04
  Henrik Gramner authored May 16, 2019
```
Improves performance on 32-bit platforms over using uint64_t.
```
  60519f04
May 15, 2019

arm64: msac: Add handwritten versions of msac_decode_bool functions · 2e8a3a21

Martin Storsjö authored May 14, 2019

GCC                     Cortex A53   A72   A73
msac_decode_bool_c:           29.9  17.9  23.2
msac_decode_bool_neon:        27.4  15.3  20.4
msac_decode_bool_adapt_c:     49.2  26.8  31.0
msac_decode_bool_adapt_neon:  38.2  22.2  25.4
msac_decode_bool_equi_c:      26.6  16.8  19.4
msac_decode_bool_equi_neon:   23.9  13.7  15.7

Clang                   Cortex A53   A72   A73
msac_decode_bool_c:           28.0  16.4  23.1
msac_decode_bool_neon:        26.9  14.6  21.0
msac_decode_bool_adapt_c:     46.8  25.1  31.4
msac_decode_bool_adapt_neon:  36.2  19.0  26.2
msac_decode_bool_equi_c:      23.7  13.4  18.8
msac_decode_bool_equi_neon:   23.7  11.3  14.2

This is as fast as, or faster than, what either GCC or Clang
produces.

2e8a3a21

arm64: msac: Fix a typo in a comment · 84f938ec
Martin Storsjö authored May 15, 2019 and Jean-Baptiste Kempf committed May 15, 2019

84f938ec

May 14, 2019
- x86-64: Add msac_decode_bool and msac_decode_bool_adapt asm · e16e2726
  Henrik Gramner authored May 14, 2019 and Henrik Gramner committed May 14, 2019
  
  e16e2726
- Add dav1d logo · e25ed555
  Justin Bull authored May 14, 2019 and Jean-Baptiste Kempf committed May 14, 2019
```
Closes #274
```
  e25ed555
- x86-64: Add msac_decode_bool_equi asm · b20a2d63
  Henrik Gramner authored May 14, 2019
  
  b20a2d63
- Add a hard upper frame size limit on 32-bit systems · 30d5f486
  Henrik Gramner authored May 10, 2019 and Henrik Gramner committed May 14, 2019
```
Prevents overflows in malloc size calculations.
```
  30d5f486
- Add an option to limit the maximum frame size · 046188e4
  Henrik Gramner authored May 10, 2019 and Henrik Gramner committed May 14, 2019
  
  046188e4
May 12, 2019
- obu: add missing break to the default case of a switch statement · ed35b5ba
  James Almer authored May 12, 2019
  
  ed35b5ba
- obu: don't abort on unknown OBUs · d0e29420
  James Almer authored May 12, 2019
```
The spec states that a decoder should instead ignore them. Otherwise, streams
compliant with an hypothetical future revision of the spec may be rejected when
backwards compatibility is expected.
```
  d0e29420
May 11, 2019
- Update NEWS and version for 0.3.1 · c9427fd4
  Jean-Baptiste Kempf authored May 11, 2019
  
  c9427fd4
- tools: Add a cast to silence an MSVC warning · 6bb75a9d
  Martin Storsjö authored May 11, 2019
  
  6bb75a9d