Commits · d63eeb1ecc204 · VideoLAN / libplacebo

Jun 19, 2020

shaders/colorspace: introduce colorimetric gamut clipping · d63eeb1e

Niklas Haas authored 4 years ago

This change adds some smarter logic than the dumb per-channel clip. We
first get rid of negative (e.g. super-red) components by mixing in
enough gray until the result is no longer oversaturated, and then we
linearly scale all components down to get rid of too-bright values.

I experimented briefly with using e.g. BT.2390 tone-mapping instead of a
linear scale for the second step, but the results were not significantly
better enough to justify the performance hit of BT.2390.

I tested this by inspecting the resulting gamut-clipped values on a Yxy
chromaticity diagram and confirmed that the resulting color was indeed
at the same angle (relative to the white point) as the original, but
contained within the gamut (with Y preserved).

Subjective comparisons using a wide gamut monitor between the original
(unclipped) source and the clipped BT.709 versions of various test clips
seem visually pleasing, so I enabled it by default.

Fixes #101

d63eeb1e

shaders: enclose pl_shader_tone_map inside scope · 77e9c80d

Niklas Haas authored 4 years ago

For consistency with other functions, and in case we ever do end up
re-using pl_shader_tone_map multiple times within the shader (e.g. for
soft gamut mapping).

77e9c80d

Jun 17, 2020

spirv/glslang: fix build on 8.13.3743 · 82e3be18

Niklas Haas authored 4 years ago

The API change referenced in 52305682 was introduced in between patch
level 3743 and patch level 3763. The current check breaks the tagged
release 8.13.3743 which does not include the change yet.

Bump the check up to 3763 to get around this. In classic glslang style,
this means that there are a few commits which will always fail to
compile no matter what we do... but that's the reality we have to live
with when building against glslang. It's either that or they figure out
how to bump their patch level on breaking API changes.

82e3be18

spirv/glslang: fix compatibility with glslang >= 8.13.3743 · 52305682
Timo Gurr authored 4 years ago

52305682

shaders/colorspace: actually implement pl_get_detected_peak · 17938be3

Niklas Haas authored 4 years ago

Seems like I completely forgot about this function. Implement and test
it. Also bump the API version, not just because the signature changed,
but also because this is effectively the first version to actually
introduce the function - it's as though it didn't exist prior.

17938be3

dispatch: allow dispatching compute shaders with outputs · ac230145

Niklas Haas authored 4 years ago

I realized that for the purposes of e.g. `pl_shader_detect_peak`,
dispatching it as a compute shader while ignoring its output can be
useful. Based on this shader alone, there's no reason to force compute
shaders to have no output.

Not bumping the API version for this because it doesn't represent any
change in the signature, and only enables extra semantics that are
currently useless anyway. (The next commit will fix that)

ac230145

gpu: add pl_buf_copy · 7ed91730

Niklas Haas authored 4 years ago

This function can be used to copy bytes from one buffer to another,
which may be helpful in the case of e.g. needing to read-back from a
device-local shader storage buffer.

7ed91730

gpu: synchronize all pl_buf_* commands · ae15d28c

Niklas Haas authored 4 years ago

I didn't like the confusing way random pl_buf_* commands either required
host synchronization or didn't. This change makes thins uniform by
making it so that synchronization is *only* required for "external"
access to buffers (e.g. host-mapped buffers or external API interop).

In the vulkan implementation of these semantics, we expand the
`buf_barrier` mechanism to keep track of writes separately, so that we
don't unnecessary block simply due to reading from a buffer twice.
(Although in theory we could make pl_buf_write non-blocking by always
queueing a command buffer write even for host-mapped buffers.)

For OpenGL, we switch to just always using the GL commands, which should
be doing the equivalent internally already.

ae15d28c

gpu: relax multiple-of-4 requirement on pl_buf_read · 3e443350

Niklas Haas authored 4 years ago

Turns out with the way this is implemented there is effectively no need
for the multiple-of-4 restriction, since the actual access to the buffer
is via the host. So drop that requirement.

Not bumping the API version because it's such a minor change in
functionality.

3e443350

Jun 14, 2020

gpu: dispatch: vulkan: allow formatless storage images · 7832801e

Niklas Haas authored 4 years ago

Using the ReadWithoutFormat/WriteWithoutFormat device features. This
suppresses the annoying "Storable format has no matching format
qualifier" warning spam, and makes it a proper warning.

Technically an API break but I decided it's not major enough to bump the
API version, because it's sort of a strict upgrade, and I don't think
this API has any users besides mpv (which handles it correctly already).

7832801e

vulkan: refcount pl_tex · 781b2b06

Niklas Haas authored 4 years ago

See #87 for a
discussion of why.

Upon re-consideration I've determined that I'm not comfortable exposing
`pl_tex_poll` because I still fail to see a sane use case for this. The
only reasonable scenario is when using it together with `pl_tex_export`,
but that function already requires synchronization using semaphores.

I'd have to change the signature to make the `sync` optional, but I'm
not comfortable doing this without a good motivating example, because I
think omitting the `sync` and using CPU-based waits is a last resort.
(And that said, it would have to be done in both directions or risk UB)

781b2b06

vulkan: don't unnecessarily delay pl_buf_destroy · 29eb2f6a

Niklas Haas authored 4 years ago

For some reason this was done using a lazy destructor even though the
whole point of refcounting is to let commands hold on to a reference of
the buffer while it's in use. No need for the extra laziness, so just
directly deref buffers on pl_buf_destroy.

29eb2f6a

vulkan: improve surface format selection logic · ad72fc6f

Niklas Haas authored 4 years ago

Rather than the first compatible surface format, we pick the surface
with the highest score. That means: The highest total bit depth, the
best format (UNORM > SNORM > FLOAT > *), and, depending on user
preference, HDR > SDR (or vice versa).

ad72fc6f

vulkan: opengl: infringe on namespaces less · 3b528417
Niklas Haas authored 4 years ago
```
I should really stop doing this.
```
3b528417

Jun 13, 2020

dispatch: allow saving/loading the entire pl_dispatch cache state · 100d12e3

Niklas Haas authored 4 years ago

We sort of hackily recreate the concept of a program cache containing a
cache for multiple shaders, because of the way the `pl_pass` program
cache works.

In theory this is somewhat redundant with the existing concept of e.g. a
VkShaderModule, but we do things this way because it allows us to also
save other resources related to programs, for example the cached SPIR-V
(and also allows us to work with APIs that only cache programs on a
per-program basis)

We also expose this to the `pl_renderer` API.

Closes #99

100d12e3

dispatch: remove unused/leftover field · 0ad7955e

Niklas Haas authored 4 years ago

This was left over from a development version, but the actual logic was
to use the existence of `pass->pass` as a way to check if the pass is
failed or not.

Also, one of the checks was actually referencing this unused field.

0ad7955e

dispatch: cosmetic · 04eb4507
Niklas Haas authored 4 years ago
```
This is more in theme with the rest of the libplacebo code.
```
04eb4507

Jun 12, 2020

renderer: deprecate/remove pl_image.signature · 19e7008b

Niklas Haas authored 4 years ago

I decided the current semantics of this struct don't really make sense.
There's no point in forcing _every_ user to constantly be setting unique
signatures. Most users don't care, and in particular, I've made the same
mistake in my own code more often than not.

Furthermore, it's incredibly disingenuous to imply that libplacebo
caches images to speed up redraws when it effectively does not. I'm not
even sure if I think this is a good thing. It's probably a better idea
for the user to do this, because they're the only ones that have a clear
understanding of which frames will be redraws and which will not.

The bottom line is, I've decided that pl_render_image does not and
should not need to cache. So remove all references to the frame cache.

19e7008b

Jun 08, 2020

renderer: perform some basic parameter validity checks · d2c18e10

Niklas Haas authored 4 years ago

Fixes #96 by
making it clear to API users that such invocations are not supported by
the API. (Probably doesn't fix the actual bug there, but that seems to
be downstream)

d2c18e10

Jun 06, 2020

opengl: clean up EXT/version checks · d1fa963a
Niklas Haas authored 4 years ago
```
There were some FIXMEs that needed addressing.
```
d1fa963a
vulkan: rename internal constant (cosmetic) · 706a8d5a
Niklas Haas authored 4 years ago
```
VK_ is the vulkan namespace, so we shouldn't introduce upon it.
```
706a8d5a

opengl: implement pl_timer objects · b97f4d16

Niklas Haas authored 4 years ago

Using GL_ARB_timer_query and GL_TIME_ELAPSED, which matches the
semantics of pl_timer almost exactly identically.

b97f4d16

opengl: fix and refine memory barriers · 191dfbce

Niklas Haas authored 4 years ago

The current memory barriers were either used illegally
(tex_gl->target??), overspecified (e.g. the forced barrier after
glDispatchCompute), or just plain missing (e.g. barriers for storage
images that might also be used in other ways).

Fix this mess by making the barrier usage consistent:
1. Only submit memory barriers on descriptors when unbinding them from
   shaders that might write to the resource
2. Skip unnecessary memory barriers for readonly resources
3. Submit *all* the barriers required for the object's usage flags

191dfbce

dispatch: attach the appropriate memory qualifiers · eb859c1a

Niklas Haas authored 4 years ago

We consider all descriptors to be 'restrict'-qualified, because there
just plain is no use case we have for aliasing different descriptors.
Readonly/writeonly is alreayd handled by the descriptor access mode, so
no need to duplicate those memory qualifiers either.

The only ones left are 'coherent', which is required for the peak
detection shader's memory barriers to work, and 'volatile', which we
don't use. Might have a use for it in the future though, especially for
development purposes.

Slight performance gain for compute shaders (especially AV1 grain), but
more importantly, fixes a bug.

eb859c1a

Jun 05, 2020

opengl: suppress compiler warning · 4ac60af5

Niklas Haas authored 4 years ago

This warns about the cast from anonymous enum to named enum, so use a
preprocessor symbol for them instead.

4ac60af5

Jun 04, 2020

renderer: fix rect scaling bug for mismatch w/h ratio · 4b49bfde

Niklas Haas authored 4 years ago

This was done using one scale, but it should have been done with a
separate scale factor for the width and height.

Suggested by @jojo61

Fixes https://github.com/haasn/libplacebo/issues/80

4b49bfde

Jun 03, 2020
- opengl: don't treat gl_fb_query failure as a hard error · af341730
  Niklas Haas authored 4 years ago
```
Instead, implement a fallback path.
```
  af341730
- opengl: skip gl_fb_query on GLES 2's default fbo · fd33b958
  Niklas Haas authored 4 years ago
```
GLES 2.0 doesn't allow querying the default framebuffer like this.
```
  fd33b958
Jun 02, 2020
- opengl: test swapchain using pbuffers · d9b29c77
  Niklas Haas authored 4 years ago
```
We request the pbuffer surface type anyway, so might as well actually
create a pbuffer surface and test the swapchain, right?
```
  d9b29c77
- README: update coverage badge URL · 99907629
  Niklas Haas authored 4 years ago
```
Make it point at the actual HTML coverage report.
```
  99907629
- ci: coverage improvements · 208dd011
  Niklas Haas authored 4 years ago
```
Use `expose_as` to link to the HTML report, and also attach the xml as a
cobertura artifact
```
  208dd011
- ci: enable building coverage-html pages · 75124dc4
  Niklas Haas authored 4 years ago
  
  75124dc4
Jun 01, 2020

meson: always use the python version meson is using · 27087b85

Niklas Haas authored 4 years ago

Turns out:
1) utils_gen.py works on both py3 and py2,
2) meson can only use py3 anyway

This commit should guarantee it exists.

27087b85

meson: fall-back for systems without python3 symlink · 88dae4c5

Niklas Haas authored 4 years ago

I'm not entirely convinced this is sane or needed, and that this isn't a
meson bug, but whatever, it's an easy work-around to carry in order to
solve a very real bug.

Fixes #97

88dae4c5

May 31, 2020

vulkan: python3.6 compat · d6487db0

Niklas Haas authored 4 years ago

This style of matching elements based on tag contents was introduced in
python3.7, breaking builds for some users stuck with older python
versions. Fix it by simply turning the xpath expression into a manual
loop/check.

Fixes https://github.com/haasn/libplacebo/issues/77

d6487db0

May 29, 2020

vulkan: fix typo in utils_gen.py · 999d9465
Niklas Haas authored 4 years ago
```
xmfile -> xmlfile
```
999d9465

vulkan: provide better error message for missing `mako` · 4a346cc1

Niklas Haas authored 4 years ago

Users complained about being confronted with a python stack trace. So
just catch the failure and give a better error message to more clearly
communicate to the user what must be done.

Fixes #93

4a346cc1

vulkan: do a better job finding the vulkan xml · c747430c

Niklas Haas authored 4 years ago

Hard-code some common alternative locations to the vulkan XML, and
improve the error message otherwise.

Fixes https://github.com/haasn/libplacebo/issues/75
Fixes videolan/libplacebo#95

c747430c

May 28, 2020

shaders/sampling: generalize PL_SHADER_SIG_SAMPLER2D · bacc24a0

Niklas Haas authored 4 years ago

This allows us to accept sampler types other than sampler2D. In
principle, any sampler type is supported, but due to shaders/sampling
being the only user of this API, we can restrict ourselves to only 2D
samplers in the implementation. Notably, though, we also support e.g.
PL_SAMPLER_RECT.

bacc24a0

opengl: add raw texture interop API · 663c8cb3

Niklas Haas authored 4 years ago

VLC needs this in order to integrate the pl_opengl abstraction into
their OpenGL-based video output module.

Closes #91

663c8cb3