Skip to content
Snippets Groups Projects

ppc: itx pwr9

Merged Luca Barbato requested to merge lu_zero/dav1d:itx-pwr9 into master
3 unresolved threads

It is on top of the loopfilter work since also this rely on pwr9-only instructions

  • 4x4
  • 4x8
  • 8x4
  • 8x8
  • 16x4
  • 4x16

Tested with the argon samples (thank you @unlord for pointing me to it)

Edited by Luca Barbato

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
  • Matthias Dressel
  • Matthias Dressel
  • Luca Barbato added 5 commits

    added 5 commits

    Compare with previous version

  • Luca Barbato added 1 commit

    added 1 commit

    Compare with previous version

  • Luca Barbato resolved all threads

    resolved all threads

  • Luca Barbato changed the description

    changed the description

  • Henrik Gramner
  • Luca Barbato added 28 commits

    added 28 commits

    Compare with previous version

  • Luca Barbato added 16 commits

    added 16 commits

    Compare with previous version

  • Henrik Gramner
  • 923 STORE_4(dst + 4 * stride, stride, e, f, g, hh)
    924 }
    925
    926
    927 #define inv_txfm_fn4x8(type1, type2) \
    928 void dav1d_inv_txfm_add_##type1##_##type2##_4x8_8bpc_pwr9(uint8_t *dst, const ptrdiff_t stride, \
    929 int16_t *const coeff, const int eob) \
    930 { \
    931 i16x8 v = vec_splats((int16_t)(2896*8)); \
    932 LOAD_SCALE_COEFF_4x8(coeff, v) \
    933 type1##_4_in(c0, c1, c2, c3, c01, c23) \
    934 type1##_4_in(c4, c5, c6, c7, c45, c67) \
    935 memset(coeff, 0, sizeof(*coeff) * 4 * 8); \
    936 TRANSPOSE4_I32(c0, c1, c2, c3); \
    937 TRANSPOSE4_I32(c4, c5, c6, c7); \
    938 type2##_8_out(c0, c1, c2, c3, c4, c5, c6, c7, c01, c23, c45, c67) \
  • Henrik Gramner
  • Henrik Gramner
    Henrik Gramner @gramner started a thread on an outdated change in commit 6618c007
  • 65 assign_itx16_fn( , 8, 8, pwr9);
    66 assign_itx16_fn(R, 8, 16, pwr9);
    67 assign_itx2_fn (R, 8, 32, pwr9);
    68 assign_itx16_fn(R, 16, 4, pwr9);
    69 assign_itx16_fn(R, 16, 8, pwr9);
    70 assign_itx12_fn( , 16, 16, pwr9);
    71 assign_itx2_fn (R, 16, 32, pwr9);
    72 assign_itx1_fn (R, 16, 64, pwr9);
    73 assign_itx2_fn (R, 32, 8, pwr9);
    74 assign_itx2_fn (R, 32, 16, pwr9);
    75 assign_itx2_fn ( , 32, 32, pwr9);
    76 assign_itx1_fn (R, 32, 64, pwr9);
    77 assign_itx1_fn (R, 64, 16, pwr9);
    78 assign_itx1_fn (R, 64, 32, pwr9);
    79 assign_itx1_fn ( , 64, 64, pwr9);
    80 */
  • Luca Barbato added 7 commits

    added 7 commits

    Compare with previous version

  • Luca Barbato added 2 commits

    added 2 commits

    • 715b2d44 - ppc: itx 8x8 pwr9
    • 9c3cb099 - ppc: Remove high bitdepth macros from the 8bit-only code

    Compare with previous version

  • changed milestone to %1.5.0

  • Luca Barbato added 6 commits

    added 6 commits

    • 805d9e5a - 1 commit from branch videolan:master
    • e4a091f4 - ppc: itx 4x4 pwr9
    • c8d030df - ppc: itx 4x8 and 8x4 pwr9
    • fd048759 - ppc: itx 8x8 pwr9
    • 37546944 - ppc: Remove high bitdepth macros from the 8bit-only code
    • 7e888715 - ppc: itx 4x16 pwr9

    Compare with previous version

  • Luca Barbato added 1 commit

    added 1 commit

    Compare with previous version

  • Luca Barbato added 21 commits

    added 21 commits

    Compare with previous version

  • Ronald S. Bultje
  • Ronald S. Bultje mentioned in merge request !1723 (merged)

    mentioned in merge request !1723 (merged)

  • Luca Barbato added 45 commits

    added 45 commits

    Compare with previous version

  • Luca Barbato changed the description

    changed the description

  • Luca Barbato added 1 commit

    added 1 commit

    Compare with previous version

  • Ronald S. Bultje
  • 19 * DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE FOR
    20 * ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
    21 * (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
    22 * LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND
    23 * ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
    24 * (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
    25 * SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
    26 */
    27
    28 #include "src/ppc/dav1d_types.h"
    29 #include "src/ppc/itx.h"
    30 #include "src/ppc/utils.h"
    31
    32 #include "src/itx_1d.h"
    33
    34 #if BITDEPTH == 8
  • Ronald S. Bultje approved this merge request

    approved this merge request

  • Luca Barbato added 9 commits

    added 9 commits

    Compare with previous version

  • Please register or sign in to reply
    Loading