dav1dplay improvements (handle other layouts, colorspaces, bit depths) and on-GPU AV1 film grain support
For a very dumb non-benchmark, I timed
dav1dplay --untimed --framethreads 8 --tilethreads 4 on the 10-bit Chimera sample with
- no settings: 1389.41s user 10.28s system 566% cpu 4:06.89 total
--gpugrain: 1307.27s user 9.42s system 554% cpu 3:57.60 total
apply_grain=0: 1285.95s user 11.23s system 547% cpu 3:57.01 total
This is a non-benchmark because film grain synthesis is obviously not the decoding bottleneck here, and seeing as I have 16 cores that are all mostly idle I would imagine that CPU film grain synthesis would mostly run concurrently with decoding anyway. But nonetheless, now it's fully free, rather than mostly free. It's the small victories that count, right?