- 28 Jul, 2009 1 commit
-
-
Fiona Glaser authored
Cache chroma MC during refine_bidir_rd and use both the luma and chroma caches to skip MC in macroblock_encode. Fix incorrect call to rd_cost_part; refine_bidir_rd output was incorrect for i8>0. Remove some redundant clips. ~12% faster refine_bidir_rd.
-
- 27 Jul, 2009 2 commits
-
-
Fiona Glaser authored
It does what it says it does.
-
Fiona Glaser authored
fprofile settings now actually fprofile QPRD. Don't use i_mbrd before initializing it.
-
- 26 Jul, 2009 4 commits
-
-
Fiona Glaser authored
Trellis used wrong lambda with trellis=1
-
Fiona Glaser authored
Also add a few more checks to eliminate the need for spel_border.
-
Fiona Glaser authored
Refactor trellis lambda selection to be done in analyse_init instead of in trellis. This will allow for more easy adaption of lambda later on; for now it allows constant lambda across variable QPs. QPRD is only available with adaptive quantization enabled and generally improves SSIM and visual quality. Additionally, weight the SSD values from RD based on the relative QP offset for chroma; helps visually at high QPs where chroma has a lower QP than luma. This fixes some visual artifacts created by QPRD at high QPs. Note that this generally hurts PSNR and SSIM, and so is only on when psy-RD is on.
-
Fiona Glaser authored
Palignr-based solution for the most commonly used qpel function. 1-1.5% faster overall on Core 2 chips.
-
- 22 Jul, 2009 1 commit
-
-
Loren Merritt authored
-
- 20 Jul, 2009 1 commit
-
-
Anton Mitrofanov authored
"Auto-variance" uses log(var)^2 instead of log(var) and attempts to adapt strength per-frame. Generates significantly better SSIM; on by default with --tune ssim. Whether it generates visually better quality is still up for debate. Available as --aq-mode 2.
-
- 17 Jul, 2009 1 commit
-
-
Fiona Glaser authored
~70% faster chroma MC on 32-bit Conroe Also slightly faster SSSE3 intra_sad_8x8c
-
- 12 Jul, 2009 1 commit
-
-
Fiona Glaser authored
-
- 10 Jul, 2009 1 commit
-
-
Fiona Glaser authored
Faster lossless, cleaner code. SSSE3 version of zigzag_sub_4x4_field, faster lossless interlaced coding.
-
- 09 Jul, 2009 1 commit
-
-
James Darnley authored
For some types of input file, x264 did the adjustment before width/height were known.
-
- 07 Jul, 2009 3 commits
-
-
Fiona Glaser authored
Also add b-adapt 2 to fprofile.
-
Fiona Glaser authored
Eliminate some redundant shifts.
-
Fiona Glaser authored
Other new features include "tune" and "profile" settings; see --help for more details. Unlike most other settings, "preset" and "tune" act before all other options. However, "profile" acts afterwards, overriding all other options. Our defaults have also changed: new defaults are --subme 7 --bframes 3 --8x8dct --no-psnr --no-ssim --threads auto --ref 3 --mixed-refs --trellis 1 --weightb --crf 23 --progress. Users will hopefully find these changes to greatly improve usability.
-
- 03 Jul, 2009 2 commits
-
-
Fiona Glaser authored
-
Fiona Glaser authored
Faster chroma encoding by terminating early if heuristics indicate that the block will be DC-only. This works because the vast majority of inter chroma blocks have no coefficients at all, and those that do are almost always DC-only. Add two new helper DSP functions for this: dct_dc_8x8 and var2_8x8. mmx/sse2/ssse3 versions of each. Early termination is disabled at very low QPs due to it not being useful there. Performance increase is ~1-2% without trellis, up to 5-6% with trellis=2. Increase is greater with lower bitrates.
-
- 26 Jun, 2009 3 commits
-
-
David Conrad authored
frame_init_lowres_core check didn't check the C plane. However, all x86 and PPC assembly was correct regardless of the unit test being incorrect.
-
Fiona Glaser authored
Improves sub-p8x8 mode decision.
-
Fiona Glaser authored
Also clean up a lot of pointless code duplication in CAVLC MV coding.
-
- 22 Jun, 2009 1 commit
-
-
Fiona Glaser authored
Faster CABAC CBF context calculation for inter blocks. Add x264_constant_p(), will probably be useful in the future as well. Simpler subpartition functions. Clean up and optimize mvd_cpn a bit more. Various other minor optimizations.
-
- 20 Jun, 2009 1 commit
-
-
David Wolstencroft authored
-
- 19 Jun, 2009 4 commits
-
-
Fiona Glaser authored
Faster CABAC mvd coding.
-
Fiona Glaser authored
Smaller code size, plus I get to use goto.
-
Fiona Glaser authored
ssim_end4_sse2 requires aligned sums
-
Fiona Glaser authored
The real reason I wrote this was an excuse to use shufpd.
-
- 11 Jun, 2009 2 commits
-
-
Mike Frysinger authored
-
Loren Merritt authored
-
- 27 May, 2009 4 commits
-
-
Loren Merritt authored
configure check for -mno-cygwin, since it's removed from gcc4.
-
Loren Merritt authored
2-4% faster dia, hex, and umh.
-
Loren Merritt authored
this change is useless on its own, but segregates the bitstream-changing part out of my next optimization.
-
Loren Merritt authored
x264 will now correctly warn about maxrate specified without bufsize even when a level is not set.
-
- 25 May, 2009 1 commit
-
-
Loren Merritt authored
-
- 24 May, 2009 5 commits
-
-
Fiona Glaser authored
Broke --me esa/tesa due to forgetting to add handling for x264_cost_mv_fpel.
-
Fiona Glaser authored
Faster subme<=1 with i4x4 enabled.
-
Fiona Glaser authored
-
Fiona Glaser authored
Have quantizers that use the same lambda share the same cost array.
-
Fiona Glaser authored
Backport CAVLC partial-inlining early termination to CABAC (~2-4% faster CABAC residual coding)
-
- 19 May, 2009 1 commit
-
-
Loren Merritt authored
-