1. 09 Nov, 2009 7 commits
    • Dylan Yudaken's avatar
      Weighted P-frame prediction · ccac8546
      Dylan Yudaken authored
      Merge Dylan's Google Summer of Code 2009 tree.
      Detect fades and use weighted prediction to improve compression and quality.
      "Blind" mode provides a small overall quality increase by using a -1 offset without doing any analysis, as described in JVT-AB033.
      "Smart", the default mode, also performs fade detection and decides weights accordingly.
      MB-tree takes into account the effects of "smart" analysis in lookahead, even further improving quality in fades.
      If psy is on, mbtree is on, interlaced is off, and weightp is off, fade detection will still be performed.
      However, it will be used to adjust quality instead of create actual weights.
      This will improve quality in fades when encoding in Baseline profile.
      
      Doesn't add support for interlaced encoding with weightp yet.
      Only adds support for luma weights, not chroma weights.
      Internal code for chroma weights is in, but there's no analysis yet.
      Baseline profile requires that weightp be off.
      All weightp modes may cause minor breakage in non-compliant decoders that take shortcuts in deblocking reference frame checks.
      "Smart" may cause serious breakage in non-compliant decoders that take shortcuts in handling of duplicate reference frames.
      
      Thanks to Google for sponsoring our most successful Summer of Code yet!
      ccac8546
    • Steven Walters's avatar
    • David Conrad's avatar
      Various ARM-related fixes · 53a5772a
      David Conrad authored
      Fix comment for mc_copy_neon.
      Fix memzero_aligned_neon prototype.
      Update NEON (i)dct_dc prototypes.
      Duplicate x86 behavior for global+hidden functions.
      53a5772a
    • Fiona Glaser's avatar
      Fix miscompilation with gcc 4.3 on ARM · 30b3825e
      Fiona Glaser authored
      Aliasing violation in spatial prediction caused nasty artifacts.
      Shut up two other GCC warnings while we're at it.
      30b3825e
    • Anton Mitrofanov's avatar
      Fix large file support, broken in r1302 · f3c9e6f3
      Anton Mitrofanov authored
      f3c9e6f3
    • Fiona Glaser's avatar
      Dramatically reduce size of pixel_ssd_* asm functions · 99cf5bf6
      Fiona Glaser authored
      ~10k of code size eliminated.
      99cf5bf6
    • Loren Merritt's avatar
      fix bottom-right pixel of lowres planes, which was uninitialized. · 3ddc66cc
      Loren Merritt authored
      weirdly, valgrind reported this only with --no-asm.
      3ddc66cc
  2. 29 Oct, 2009 2 commits
  3. 25 Oct, 2009 5 commits
  4. 19 Oct, 2009 1 commit
    • Lamont Alston's avatar
      Make B-pyramid spec-compliant · cf5ba813
      Lamont Alston authored
      The rules of the specification with regard to picture buffering for pyramid coding are widely ignored.
      x264's b-pyramid implementation, despite being practically identical to that proposed by the original paper, was technically not compliant.
      Now it is.
      Two modes are now available:
      1) strict b-pyramid, while worse for compression, follows the rule mandated by Blu-ray (no P-frames can reference B-frames)
      2) normal b-pyramid, which is like the old mode except fully compliant.
      This patch also adds MMCO support (necessary for compliant pyramid in some cases).
      MB-tree still doesn't support b-pyramid (but will soon).
      cf5ba813
  5. 18 Oct, 2009 1 commit
  6. 12 Oct, 2009 3 commits
    • Loren Merritt's avatar
      change all dct arrays to 1d. · 1fbba0ca
      Loren Merritt authored
      the C standard doesn't allow you to iterate 1-dimensionally over 2d arrays, and nothing other than the dsp functions themselves cares about the 2dness of dct.
      this fixes a miscompilation in x264_mb_optimize_chroma_dc.
      1fbba0ca
    • Anton Mitrofanov's avatar
      Optimize exp2fix8 · 1a1b9c6f
      Anton Mitrofanov authored
      Slightly faster and more accurate rounding.
      1a1b9c6f
    • Fiona Glaser's avatar
      Avoid scenecuts in flashes and similar situations · c695f524
      Fiona Glaser authored
      "Flashes" are defined as any scene which lasts a very short period before a previous scene returns.
      A common example of this is of course a camera flash.
      Accordingly, look ahead during scenecut analysis and rule out the possibility of certain frames being scenecuts.
      Also handles cases of tons of short scenes in sequence and avoids making those scenecuts as well.
      Can only catch flashes of 1 frame in length with b-adapt 1.
      With b-adapt 2, can catch flashes of length --bframes.
      Speed cost should be negligible.
      c695f524
  7. 07 Oct, 2009 3 commits
  8. 23 Sep, 2009 1 commit
  9. 21 Sep, 2009 2 commits
    • Fiona Glaser's avatar
      Add intra prediction modes to output stats · bbf573c7
      Fiona Glaser authored
      Also eliminate some NANs in stat output with intra-only encoding.
      Marginal speedup: disable stat calculation if log level is below X264_LOG_INFO.
      Various minor cosmetics.
      bbf573c7
    • Fiona Glaser's avatar
      Major API change: encapsulate NALs within libx264 · 7a0fbed7
      Fiona Glaser authored
      libx264 now returns NAL units instead of raw data.  x264_nal_encode is no longer a public function.
      See x264.h for full documentation of changes.
      New parameter: b_annexb, on by default.  If disabled, startcodes are replaced by sizes as in mp4.
      x264's VBV now works on a NAL level, taking into account escape codes.
      VBV will also take into account the bit cost of SPS/PPS, but only if b_repeat_headers is set.
      Add an overhead tracking system to VBV to better predict the constant overhead of frames (headers, NALU overhead, etc).
      7a0fbed7
  10. 14 Sep, 2009 1 commit
    • Fiona Glaser's avatar
      Make MV costs global instead of static · b1eac265
      Fiona Glaser authored
      Fixes some extremely rare threading race conditions and makes the code cleaner.
      Downside: slightly higher memory usage when calling multiple encoders from the same application.
      b1eac265
  11. 02 Sep, 2009 3 commits
    • David Conrad's avatar
      Compile fixes for pre-ARMv6T2 and/or PIC · de4c39b7
      David Conrad authored
      de4c39b7
    • Steven Walters's avatar
      Change priority handling on some OSs · bc120190
      Steven Walters authored
      Instead of setting the lookahead thread to max priority, lower all the other threads' priorities instead.
      This is particularly useful when the "max priority" is "realtime", as in Windows, which can cause some problems.
      bc120190
    • Steven Walters's avatar
      Threaded lookahead · 6940dcae
      Steven Walters authored
      Move lookahead into a separate thread, set to higher priority than the other threads, for optimal performance.
      Reduces the amount that lookahead bottlenecks encoding, greatly increasing performance with lookahead-intensive settings (e.g. b-adapt 2) on many-core CPUs.
      Buffer size can be controlled with --sync-lookahead, which defaults to auto (threads+bframes buffer size).
      Note that this buffer is separate from the rc-lookahead value.
      Note also that this does not split lookahead itself into multiple threads yet; this may be added in the future.
      Additionally, split frames into "fdec" and "fenc" frame types and keep the two separate.
      This split greatly reduces memory usage, which helps compensate for the larger lookahead size.
      Extremely special thanks to Michael Kazmier and Alex Giladi of Avail Media, the original authors of this patch.
      6940dcae
  12. 31 Aug, 2009 1 commit
    • Fiona Glaser's avatar
      Multi-slice encoding support · 4ccbb199
      Fiona Glaser authored
      Slicing support is available through three methods (which can be mixed):
      --slices sets a number of slices per frame and ensures rectangular slices (required for Blu-ray).  Overridden by either of the following options:
      --slice-max-mbs sets a maximum number of macroblocks per slice.
      --slice-max-size sets a maximum slice size, in bytes (includes NAL overhead).
      Implement macroblock re-encoding support to allow highly accurate slice size limitation.  Might be useful for other things in the future, too.
      4ccbb199
  13. 28 Aug, 2009 1 commit
  14. 27 Aug, 2009 2 commits
  15. 24 Aug, 2009 6 commits
  16. 23 Aug, 2009 1 commit
    • David Conrad's avatar
      GSOC merge part 2: ARM stack alignment · ca7da1ae
      David Conrad authored
      Neither GCC nor ARMCC support 16 byte stack alignment despite the fact that NEON loads require it.
      These macros only work for arrays, but fortunately that covers almost all instances of stack alignment in x264.
      ca7da1ae