summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
* 4:4:4 H.264 decoding supportJason Garrett-Glaser2011-06-13
| | | | Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
* Roll back 4:4:4 H.264 for nowJason Garrett-Glaser2011-06-13
| | | | Needs some ARM/PPC asm modifications.
* 4:4:4 H.264 decoding supportJason Garrett-Glaser2011-06-13
| | | | Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
* h264: Fix 10-bit H.264 x86 chroma v loopfilter asm.Oskar Arvidsson2011-06-10
| | | | | | The tc variable was not splatted correctly. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* H264: x86 predict init cosmetics.Daniel Kang2011-06-08
| | | | | | Change indentation and whitespace; also move HAVE_YASM blocks. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* Add x86 assembly for some 10-bit H.264 intra predict functions.Daniel Kang2011-06-06
| | | | | | | Parts are inspired from the 8-bit H.264 predict code in Libav. Other parts ported from x264 with relicensing permission from author. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* Cosmetic changes to h264_idct_10bit.asm.Loren Merritt2011-06-02
| | | | | | Removes redundant dword tags and whitespace changes. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* 2x faster h264_idct_add8_10.Loren Merritt2011-06-02
| | | | Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* h264: remove CONFIG_GPL from x86 intra prediction code.Ronald S. Bultje2011-06-02
| | | | | The authors permitted relicensing to LGPL a long time ago (Holger, Loren and Jason).
* h264/10bit: add HAVE_ALIGNED_STACK checks.Daniel Kang2011-05-31
| | | | | | | Fixes regression in 836f47d34b49e8ba9883e738a42f154130421caa in ICC-10.x, since ICC<=11.0 doesn't align stack upon function calls. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* Update 8-bit H.264 IDCT function names to reflect bit-depth.Daniel Kang2011-05-31
| | | | Signed-off-by: Ronald S. Bultje <rbultje@google.com>
* Add IDCT functions for 10-bit H.264.Daniel Kang2011-05-31
| | | | | | | | Ports the majority of IDCT functions for 10-bit H.264. Parts are inspired from 8-bit IDCT code in Libav; other parts ported from x264 with relicensing permission from author. Signed-off-by: Ronald S. Bultje <rbultje@google.com>
* ac3dsp: do not use the ff_* prefix when referencing ff_ac3_bap_bits.Justin Ruggles2011-05-28
| | | | | | this should fix the windows builds Signed-off-by: Martin Storsjö <martin@martin.st>
* ac3enc: modify mantissa bit counting to keep bap counts for all values of bapJustin Ruggles2011-05-28
| | | | | | instead of just 0 to 4. This does all the actual bit counting as a final step.
* x86: Add appropriate ifdefs around certain AVX functions.Diego Biurrun2011-05-27
| | | | | nasm versions prior to 2.09 have trouble assembling some of our AVX code. Protect these sections by preprocessor macros to allow compilation to pass.
* x86 asm: Add SECTION_TEXT to dct32_sse.asm.Dave Yeo2011-05-23
| | | | | | | This fixes the following error on OS/2: error: segment name `.text align=16' not recognized Signed-off-by: Diego Biurrun <diego@biurrun.de>
* dct32_sse: eliminate some spillsLoren Merritt2011-05-22
| | | | 125->104 cycles on penryn (x86_64 only)
* Fix dct32() compilation with --disable-yasmVitor Sessak2011-05-22
| | | | Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* dct32: Add AVX implementation of 32-point DCTVitor Sessak2011-05-21
|
* dct32: Change pass 6 permutation to allow for AVX implementationVitor Sessak2011-05-21
|
* dct32: port SSE 32-point DCT to YASMVitor Sessak2011-05-21
|
* multiple inclusion guard cleanupDiego Biurrun2011-05-21
| | | | | Add missing multiple inclusion guards; clean up #endif comments; add missing library prefixes; keep guard names consistent.
* Add support for a.out object format to assembler macros.Dave Yeo2011-05-20
| | | | | | This format is still used by e.g. OS/2. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* mpegaudiodsp: fix x86 and ppc makefilesMans Rullgard2011-05-19
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* Move some mpegaudio functions to new mpegaudiodsp subsystemMans Rullgard2011-05-19
| | | | | | | | This separation allows these functions to be used in a cleaner fashion from other codecs (e.g. qdm2) and simplifies creating optimised versions of them. Signed-off-by: Mans Rullgard <mans@mansr.com>
* 10l: wrap float_interleave functions in HAVE_YASM.Justin Ruggles2011-05-18
| | | | fixes compilation with --disable-yasm
* Add float_interleave() to FmtConvertContext with x86-optimized versions.Justin Ruggles2011-05-18
| | | | | Partially based on patches by clsid2 in ffdshow-tryout. ff_float_interleave6() x86 improvements by Loren Merrit.
* Modify x86util.asm to ease transitioning to 10-bit H.264 assembly.Daniel Kang2011-05-17
| | | | | | | Arguments for variable size instructions are added to many macros, along with other various changes. The x86util.asm code was ported from x264. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* h264dsp_mmx: Add #ifdefs around some mmxext functions on x86_64.Gil Pedersen2011-05-16
| | | | | | This fixes linking errors due to undefined symbols on x86_64 OS X. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* Fix FSF address copy paste error in some license headers.Diego Biurrun2011-05-14
|
* 10-bit H.264 x86 chroma v loopfilter asmJason Garrett-Glaser2011-05-11
| | | | Also delete some unused deblock asm macros.
* Port x86 10-bit H.264 deblock asm from x264Jason Garrett-Glaser2011-05-10
|
* Update x86 H.264 deblock asmJason Garrett-Glaser2011-05-10
| | | | Includes AVX versions from x264.
* h264dsp_mmx: place bracket outside #if/#endif block.Ronald S. Bultje2011-05-10
| | | | Should fix compile on systems missing yasm/nasm.
* Adds 8-, 9- and 10-bit versions of some of the functions used by the h264 ↵Oskar Arvidsson2011-05-10
| | | | | | | | | | | | | | | | | decoder. This patch lets e.g. dsputil_init chose dsp functions with respect to the bit depth to decode. The naming scheme of bit depth dependent functions is <base name>_<bit depth>[_<prefix>] (i.e. the old clear_blocks_c is now named clear_blocks_8_c). Note: Some of the functions for high bit depth is not dependent on the bit depth, but only on the pixel size. This leaves some room for optimizing binary size. Preparatory patch for high bit depth h264 decoding support. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* Remove disabled non-optimized code variants.Diego Biurrun2011-04-29
|
* Add AVX FFT implementation.Vitor Sessak2011-04-26
| | | | Signed-off-by: Reinhard Tartler <siretart@tauware.de>
* Update x86inc.asm from x264 to allow AVX emulation using SSE and MMX.Vitor Sessak2011-04-26
| | | | Signed-off-by: Reinhard Tartler <siretart@tauware.de>
* dsputil: allow to skip drawing of top/bottom edges.Alexander Strange2011-03-26
|
* Add apply_window_int16() to DSPContext with x86-optimized versions and use itJustin Ruggles2011-03-22
| | | | in the ac3_fixed encoder.
* Move dct and rdft definitions to separate filesMans Rullgard2011-03-20
| | | | | | | This leaves fft.h with only the core FFT and MDCT definitions thus making it more managable. Signed-off-by: Mans Rullgard <mans@mansr.com>
* Replace FFmpeg with Libav in licence headersMans Rullgard2011-03-19
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* ac3enc: add float_to_fixed24() with x86-optimized versions to AC3DSPContextJustin Ruggles2011-03-17
| | | | and use in scale_coefficients() for the floating-point AC-3 encoder.
* mathops: fix MULL() when the compiler does not inline the function.Justin Ruggles2011-03-15
| | | | | | | If the function is not inlined, an immmediate cannot be used for the shift parameter, so the %cl register must be used instead in that case. This fixes compilation for x86-32 using gcc with --disable-optimizations.
* mathops: change "g" constraint to "rm" in x86-32 version of MUL64().Justin Ruggles2011-03-15
| | | | | The 1-arg imul instruction cannot take an immediate argument, only a register or memory argument.
* mathops: convert MULL/MULH/MUL64 to inline functions rather than macros.Justin Ruggles2011-03-15
| | | | | | This fixes unexpected name collisions that were occurring with variables declared within the macros. It also fixes the fate-acodec-ac3_fixed regression test on x86-32.
* ac3enc: add SIMD-optimized shifting functions for use with the fixed-point ↵Justin Ruggles2011-03-14
| | | | AC3 encoder.
* Add CONFIG_AC3DSP symbol to simplify makefilesMans Rullgard2011-03-12
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* dsputil_mmx.c: remove ff_vector128.Ronald S. Bultje2011-02-19
| | | | Remove ff_vector128, it is identical to ff_pb_80.
* dsputil: move VC1-specific stuff into VC1DSPContext.Ronald S. Bultje2011-02-17
|