SVE2 SIMD
SVE2, or Scalable Vector Extension version two, is a SIMD extension of the Arm AArch64 architecture. SVE2 is a superset of both SVE and NEON. It scales from the same 128-bit width as NEON, up to 2048 bits wide, using one set of SIMD instructions.
Currently, the ARM Cortex-A510, Cortex-A710, Cortex-A715, Cortex-X2, Cortex-X3 and Neoverse N2 CPU cores support SVE2. Known chips to support SVE2 are Qualcomm Snapdragon 7 Gen 1, Snapdragon 8 Gen 1 and 8+ Gen 1, MediaTek Dimensity 9000 and 9000+, Samsung Exynos 2200 and Nvidia Grace.
Since the at least the current Snapdragon chips don't support AV1 decoding in hardware, there is a group of devices for which SVE2 SIMD could provide performance benefits. Since SVE2 is a superset of NEON, the current NEON assembly could be used as a starting point.
This issue can be used as a tracking issue like #215 or #316, if prefererend.