- 21 Sep, 2011 5 commits
-
-
Loren Merritt authored
i4x4 analysis cycles (per partition): penryn sandybridge 184-> 75 157-> 54 preset=superfast (sad) 281->165 225->124 preset=faster (satd with early termination) 332->165 263->124 preset=medium 379->165 297->124 preset=slower (satd without early termination) This is the first code in x264 that intentionally produces different behavior on different cpus: satd_x9 is implemented only on ssse3+ and checks all intra directions, whereas the old code (on fast presets) may early terminate after checking only some of them. There is no systematic difference on slow presets, though they still occasionally disagree about tiebreaks. For ease of debugging, add an option "--cpu-independent" to disable satd_x9 and any analogous future code.
-
Loren Merritt authored
Select asm subroutines more intelligently in the wrapper functions.
-
Loren Merritt authored
High bit depth Penryn, Sandybridge cycles: 4x4_ddl: 11->10, 9-> 8 4x4_ddr: 15->13, 12->11 4x4_hd: , 15->12 4x4_hu: , 14->13 4x4_vr: 15->14, 14->12 8x8_ddl: 32->19, 19->14 8x8_ddr: 42->19, 21->14 8x8_hd: , 15->13 8x8_hu: 21->17, 16->12 8x8_vr: 33->19, 8-bit Penryn, Sandybridge cycles: 4x4_ddr: 24->15, 4x4_hd: 24->16, 4x4_hu: 23->15, 4x4_vr: 23->16, 4x4_vl: 10-> 9, 8x8_ddl: 23->15, 8x8_hd: , 17->14 8x8_hu: , 15->14 8x8_vr: 20->16, 17->13
-
Loren Merritt authored
-
Yusuke Nakamura authored
According to the spec, when frame_packing_arrangement_type is equal to 0, quincunx_sampling_flag shall be equal to 1.
-
- 09 Sep, 2011 1 commit
-
-
Oka Motofumi authored
-
- 24 Aug, 2011 17 commits
-
-
Reinhard Tartler authored
-
Loren Merritt authored
~7% faster.
-
Loren Merritt authored
~40% faster. Also some other minor asm cosmetics.
-
Loren Merritt authored
Slightly improves compression and fixes a Valgrind error.
-
Loren Merritt authored
-
Anton Mitrofanov authored
Slightly faster.
-
Loren Merritt authored
-
Fiona Glaser authored
-
Fiona Glaser authored
Passed wrong idx to trellis.
-
Anton Mitrofanov authored
-
Anton Mitrofanov authored
-
Henrik Gramner authored
-
Loren Merritt authored
-
Loren Merritt authored
-
Henrik Gramner authored
Prevents mc from using uninitialized source pixels.
-
Henrik Gramner authored
-
Fiona Glaser authored
-
- 10 Aug, 2011 3 commits
-
-
Fiona Glaser authored
Should give much better quality, but still doesn't support MB-tree yet. Also check for the same interlaced options between passes. Various minor ratecontrol cosmetics.
-
Loren Merritt authored
-
Loren Merritt authored
pixel_ads1_avx, predict_8x8_hd_avxx High bit depth mc_copy_w8_sse2, denoise_dct_avx, prefetch_fenc/ref, and several pixel*sse4.
-
- 09 Aug, 2011 7 commits
-
-
Loren Merritt authored
Unused frame_sort. Unused x86_64 dequant_4x4dc_mmx2, predict_8x8_vr_mmx2. Unused and broken high_depth integral_init*h_sse4, optimize_chroma_*, dequant_flat_*, sub8x8_dct_dc_*, zigzag_sub_*. Useless high_depth dequant_sse4, dequant_dc_sse4.
-
Loren Merritt authored
-
Loren Merritt authored
These changes were split out of the cpuflags commit because they change the output executable.
-
Loren Merritt authored
Reduces the number of macro args that need to be passed around. Allows multiple implementations of a given macro (e.g. PALIGNR) to check cpuflags at the location where the macro is defined, instead of having to select implementations by %define at toplevel. Remove INIT_AVX, as it's replaced by "INIT_XMM avx". This commit does not change the stripped executable.
-
Loren Merritt authored
-
Loren Merritt authored
-
Henrik Gramner authored
Caused slightly worse compression.
-
- 05 Aug, 2011 3 commits
-
-
Loren Merritt authored
Previously required "--asm sse2fast,fastshuffle,sse4.2,avx".
-
Anton Mitrofanov authored
-
Yasuhiro Ikeda authored
Replace av_find_stream_info with avformat_find_stream_info. Now requires libavformat 53.3.0 or newer.
-
- 29 Jul, 2011 4 commits
-
-
Henrik Gramner authored
-
Anton Mitrofanov authored
This makes them available to external programs as part of the public API.
-
Henrik Gramner authored
-
Yusuke Nakamura authored
Fix crash when failed to parse timecode file before malloc pts. Fix detection of user timebase considered to be exceeding H.264 maximum.
-