wiener simd unit test failure
https://code.videolan.org/ePirat/dav1d/-/jobs/4195
The issue is an overflow, need to split the pmullw in 2 stages (pmullw of the lower part, and left-shift of the higher part) and paddsw separately, that will resolve the overflow.