src/arm/64/mc.S · master · VideoLAN / dav1d

AArch64: Trim Armv8.0 Neon path of 6-tap and 8-tap MC functions · 82e9155c

Arpad Panyik authored Sep 10, 2024 and

Martin Storsjö committed Sep 12, 2024

There are some instruction sequences we could merge after the lane
load/store patch (ec5c3052).

This change will simplify the loading of filter weights to save 288
bytes in the Armv8.0 Neon path of 6-tap and 8-tap MC functions.

82e9155c