summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
...
* H.264: Add more x86 assembly for 10-bit H.264 predict functionsDaniel Kang2011-07-13
| | | | | | | | Mainly ported from 8-bit H.264 predict. Some code ported from x264. LGPL ok by author. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* H.264: add filter_mb_fast support for >8-bit decodingJason Garrett-Glaser2011-07-11
| | | | Much faster high bit depth deblocking.
* dsputil: remove ff_emulated_edge_mc macro used in one placeMans Rullgard2011-07-10
| | | | | | | | | | This macro can cause problems in conjunction with the bitdepth template expansion. It was presumably added to keep source compatibility when high bitdepth support was added. However, emulated_edge_mc is a dsputil pointer and should not be called directly, so there is little reason to keep such a macro. Signed-off-by: Mans Rullgard <mans@mansr.com>
* H.264: Add x86 assembly for 10-bit H.264 predict functionsDaniel Kang2011-07-08
| | | | | | | | Mainly ported from 8-bit H.264 predict. Some code ported from x264. LGPL ok by author. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* YASM: Shut up unused variable compiler warning with --disable-yasm.Daniel Kang2011-07-04
| | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86_32: Fix build on x86_32 with --disable-yasm.Daniel Kang2011-07-04
| | | | Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* Fix build with --disable-yasm.Daniel Kang2011-07-03
| | | | Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* H.264: Add x86 assembly for 10-bit H.264 qpel functions.Daniel Kang2011-07-03
| | | | | | | | Mainly ported from 8-bit H.264 qpel. Some code ported from x264. LGPL ok by author. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* ac3dsp: add x86-optimized versions of ac3dsp.extract_exponents().Justin Ruggles2011-07-01
|
* ac3enc: add int32_t array clipping function to DSPUtil, including x86 versions.Justin Ruggles2011-07-01
|
* configure: Drop check for availability of ten assembler operands.Diego Biurrun2011-06-28
| | | | | This was done to support gcc 2.95, which is an old legacy compiler that fails to compile the current codebase anyway.
* doxygen: Consistently use '@' instead of '\' for Doxygen markup.Diego Biurrun2011-06-24
| | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
* h264: Add x86 assembly for 10-bit weight/biweight H.264 functions.Daniel Kang2011-06-21
| | | | | | Mainly ported from 8-bit H.264 weight/biweight. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: cabac: fix register constraints for 32-bit modeMans Rullgard2011-06-20
| | | | | | | | Some operands need to be accessed in byte mode, which restricts the available registers in 32-bit mode. Using the 'q' constraint selects a suitable register. Signed-off-by: Mans Rullgard <mans@mansr.com>
* cabac: move x86 asm to libavcodec/x86/cabac.hMans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: h264: cast pointers to intptr_t rather than intMans Rullgard2011-06-20
| | | | | | | Only the low-order bits are used here so the type is not important, but this avoids a compiler warning. Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: h264: remove hardcoded edi in decode_significance_8x8_x86()Mans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: h264: remove hardcoded esi in decode_significance[_8x8]_x86()Mans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: h264: remove hardcoded edx in decode_significance[_8x8]_x86()Mans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: h264: remove hardcoded eax in decode_significance[_8x8]_x86()Mans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: cabac: remove hardcoded ebx in inline asmMans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* x86: cabac: remove hardcoded struct offsets from inline asmMans Rullgard2011-06-20
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* Fix build if yasm is not available.Ronald S. Bultje2011-06-18
|
* H.264: Add x86 assembly for 10-bit MC Chroma H.264 functions.Daniel Kang2011-06-18
| | | | | | Mainly ported from 8-bit H.264 MC Chroma. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* 4:4:4 H.264 decoding supportJason Garrett-Glaser2011-06-13
| | | | Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
* Roll back 4:4:4 H.264 for nowJason Garrett-Glaser2011-06-13
| | | | Needs some ARM/PPC asm modifications.
* 4:4:4 H.264 decoding supportJason Garrett-Glaser2011-06-13
| | | | Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
* h264: Fix 10-bit H.264 x86 chroma v loopfilter asm.Oskar Arvidsson2011-06-10
| | | | | | The tc variable was not splatted correctly. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* H264: x86 predict init cosmetics.Daniel Kang2011-06-08
| | | | | | Change indentation and whitespace; also move HAVE_YASM blocks. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* Add x86 assembly for some 10-bit H.264 intra predict functions.Daniel Kang2011-06-06
| | | | | | | Parts are inspired from the 8-bit H.264 predict code in Libav. Other parts ported from x264 with relicensing permission from author. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* Cosmetic changes to h264_idct_10bit.asm.Loren Merritt2011-06-02
| | | | | | Removes redundant dword tags and whitespace changes. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* 2x faster h264_idct_add8_10.Loren Merritt2011-06-02
| | | | Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* h264: remove CONFIG_GPL from x86 intra prediction code.Ronald S. Bultje2011-06-02
| | | | | The authors permitted relicensing to LGPL a long time ago (Holger, Loren and Jason).
* h264/10bit: add HAVE_ALIGNED_STACK checks.Daniel Kang2011-05-31
| | | | | | | Fixes regression in 836f47d34b49e8ba9883e738a42f154130421caa in ICC-10.x, since ICC<=11.0 doesn't align stack upon function calls. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* Update 8-bit H.264 IDCT function names to reflect bit-depth.Daniel Kang2011-05-31
| | | | Signed-off-by: Ronald S. Bultje <rbultje@google.com>
* Add IDCT functions for 10-bit H.264.Daniel Kang2011-05-31
| | | | | | | | Ports the majority of IDCT functions for 10-bit H.264. Parts are inspired from 8-bit IDCT code in Libav; other parts ported from x264 with relicensing permission from author. Signed-off-by: Ronald S. Bultje <rbultje@google.com>
* ac3dsp: do not use the ff_* prefix when referencing ff_ac3_bap_bits.Justin Ruggles2011-05-28
| | | | | | this should fix the windows builds Signed-off-by: Martin Storsjö <martin@martin.st>
* ac3enc: modify mantissa bit counting to keep bap counts for all values of bapJustin Ruggles2011-05-28
| | | | | | instead of just 0 to 4. This does all the actual bit counting as a final step.
* x86: Add appropriate ifdefs around certain AVX functions.Diego Biurrun2011-05-27
| | | | | nasm versions prior to 2.09 have trouble assembling some of our AVX code. Protect these sections by preprocessor macros to allow compilation to pass.
* x86 asm: Add SECTION_TEXT to dct32_sse.asm.Dave Yeo2011-05-23
| | | | | | | This fixes the following error on OS/2: error: segment name `.text align=16' not recognized Signed-off-by: Diego Biurrun <diego@biurrun.de>
* dct32_sse: eliminate some spillsLoren Merritt2011-05-22
| | | | 125->104 cycles on penryn (x86_64 only)
* Fix dct32() compilation with --disable-yasmVitor Sessak2011-05-22
| | | | Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* dct32: Add AVX implementation of 32-point DCTVitor Sessak2011-05-21
|
* dct32: Change pass 6 permutation to allow for AVX implementationVitor Sessak2011-05-21
|
* dct32: port SSE 32-point DCT to YASMVitor Sessak2011-05-21
|
* multiple inclusion guard cleanupDiego Biurrun2011-05-21
| | | | | Add missing multiple inclusion guards; clean up #endif comments; add missing library prefixes; keep guard names consistent.
* Add support for a.out object format to assembler macros.Dave Yeo2011-05-20
| | | | | | This format is still used by e.g. OS/2. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* mpegaudiodsp: fix x86 and ppc makefilesMans Rullgard2011-05-19
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* Move some mpegaudio functions to new mpegaudiodsp subsystemMans Rullgard2011-05-19
| | | | | | | | This separation allows these functions to be used in a cleaner fashion from other codecs (e.g. qdm2) and simplifies creating optimised versions of them. Signed-off-by: Mans Rullgard <mans@mansr.com>
* 10l: wrap float_interleave functions in HAVE_YASM.Justin Ruggles2011-05-18
| | | | fixes compilation with --disable-yasm