summaryrefslogtreecommitdiff
path: root/libavcodec/x86/dsputil_mmx.c
Commit message (Collapse)AuthorAge
* Enable SSE2 (put|avg)_pixels_16_sse2David Conrad2010-02-10
| | | | | | | | SVQ1 chroma has been special-cased aligned to 16-bytes since at least r15466 Other architectures also assume 16-byte alignment here too but set STRIDE_ALIGN to 16. Originally committed as revision 21736 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Implement an sse version of scalarproduct_float().Alex Converse2010-01-22
| | | | Originally committed as revision 21386 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Move array specifiers outside DECLARE_ALIGNED() invocationsMåns Rullgård2010-01-22
| | | | Originally committed as revision 21377 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Fix XvMC. XvMCCreateBlocks() may not allocate 16-byte aligned blocks,Gwenole Beauchesne2010-01-04
| | | | | | so we can't use SSE-optimized routines. Originally committed as revision 21011 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Get rid of pointless CONFIG_ANY_H263 preprocessor definition.Diego Biurrun2009-12-30
| | | | Originally committed as revision 20975 to svn://svn.ffmpeg.org/ffmpeg/trunk
* r20739 broke compilation on systems without yasmLoren Merritt2009-12-05
| | | | Originally committed as revision 20742 to svn://svn.ffmpeg.org/ffmpeg/trunk
* refactor and optimize scalarproductLoren Merritt2009-12-05
| | | | | | | | 29-105% faster apply_filter, 6-90% faster ape decoding on core2 (Any x86 other than core2 probably gets much less, since this is mostly due to ssse3 cachesplit avoidance and I haven't written the full gamut of other cachesplit modes.) 9-123% faster ape decoding on G4. Originally committed as revision 20739 to svn://svn.ffmpeg.org/ffmpeg/trunk
* port ape dsp functions from sse2 to mmxLoren Merritt2009-12-03
| | | | | | now requires yasm Originally committed as revision 20722 to svn://svn.ffmpeg.org/ffmpeg/trunk
* huffyuv: add some const qualifiersLoren Merritt2009-10-18
| | | | Originally committed as revision 20290 to svn://svn.ffmpeg.org/ffmpeg/trunk
* simd add_hfyu_left_predictionLoren Merritt2009-10-18
| | | | | | | 2.2x faster than C on conroe, 3.6x on penryn. 4-6% faster huffyuv decoding if using left or plane mode and yuv Originally committed as revision 20287 to svn://svn.ffmpeg.org/ffmpeg/trunk
* cosmetics: fix indentation after previous commitMåns Rullgård2009-09-27
| | | | Originally committed as revision 20062 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Drop unused args from vector_fmul_add_add, simpify code, and renameMåns Rullgård2009-09-27
| | | | | | | | | The src3 and step arguments to vector_fmul_add_add() are always zero and one, respectively. This removes these arguments from the function, simplifies the code accordingly, and renames the function to better match the new operation. Originally committed as revision 20061 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Mark "i" parameter of vector_clipf_sse() as early-clobberVitor Sessak2009-08-27
| | | | Originally committed as revision 19731 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Mark parameter src of vector_clipf() as constVitor Sessak2009-08-27
| | | | Originally committed as revision 19729 to svn://svn.ffmpeg.org/ffmpeg/trunk
* SSE optimized vector_clipf(). 10% faster TwinVQ decoding.Vitor Sessak2009-08-27
| | | | Originally committed as revision 19728 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Do not check for both CONFIG_VC1_DECODER and CONFIG_WMV3_DECODER,Diego Biurrun2009-07-29
| | | | | | the former depends upon the latter. Originally committed as revision 19533 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Do not redundantly check for both CONFIG_THEORA_DECODER and CONFIG_VP3_DECODER.Diego Biurrun2009-07-22
| | | | | | The Theora decoder depends on the VP3 decoder. Originally committed as revision 19492 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Icc 11.1 still does not align the stack pointer, disable some x264 functions.Carl Eugen Hoyos2009-07-17
| | | | Originally committed as revision 19454 to svn://svn.ffmpeg.org/ffmpeg/trunk
* SSE version of clear_blocksJason Garrett-Glaser2009-06-16
| | | | Originally committed as revision 19206 to svn://svn.ffmpeg.org/ffmpeg/trunk
* avg_ pixel functions need to use (dst+pix+1)>>1 to average with existingDavid Conrad2009-04-15
| | | | | | | pixels, not (dst+pix)>>1. This makes the mmx functions bitexact with the C functions. Originally committed as revision 18527 to svn://svn.ffmpeg.org/ffmpeg/trunk
* VC1: extend MMX qpel MC to include MMX2 avg qpelDavid Conrad2009-04-15
| | | | Originally committed as revision 18519 to svn://svn.ffmpeg.org/ffmpeg/trunk
* VC1: add and use avg_no_rnd chroma MC functionsDavid Conrad2009-04-14
| | | | Originally committed as revision 18518 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Rename put_no_rnd_h264_chroma* to reflect its usage in VC1 onlyDavid Conrad2009-04-14
| | | | Originally committed as revision 18517 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Rename FF_MM_MMXEXT to FF_MM_MMX2, for both clarity and consistencyStefano Sabatini2009-04-04
| | | | | | with libswscale. Originally committed as revision 18330 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Mark line_skip3 asm argument as output-only instead of using av_uninit.Reimar Döffinger2009-04-03
| | | | Originally committed as revision 18327 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Mark put_signed_pixels_clamped_mmx output operands as early-clobber becauseReimar Döffinger2009-04-03
| | | | | | they are. Hopefully fixes some FATE errors, too. Originally committed as revision 18326 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Use DECLARE_ASM_CONST for non-global ff_vector128 constant used via MANGLEReimar Döffinger2009-04-03
| | | | Originally committed as revision 18325 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Rewrite put_signed_pixels_clamped_mmx() to eliminate mmx.h from dsputil_mmx.c.Alex Converse2009-04-02
| | | | Originally committed as revision 18319 to svn://svn.ffmpeg.org/ffmpeg/trunk
* add SSE2 version of vp6_filter_diagZuxy Meng2009-02-13
| | | | | | original patch by Zuxy Meng zuxy.meng _at_ gmail _dot_ com Originally committed as revision 17195 to svn://svn.ffmpeg.org/ffmpeg/trunk
* add MMX version of vp6_filter_diagSebastien Lucas2009-02-12
| | | | | | original patch by Sebastien Lucas sebastien.lucas _at_ gmail _dot_ com Originally committed as revision 17194 to svn://svn.ffmpeg.org/ffmpeg/trunk
* convert ff_pw_64 into an xmm_reg for future use in vp6 sse codeAurelien Jacobs2009-02-12
| | | | Originally committed as revision 17192 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Add check whether the compiler/assembler supports 10 or more operands.Diego Biurrun2009-02-11
| | | | | | thanks to Loren for some help with the asm statements Originally committed as revision 17151 to svn://svn.ffmpeg.org/ffmpeg/trunk
* ff_add_hfyu_median_prediction_mmx2Loren Merritt2009-02-08
| | | | | | overall ffvhuff decoding speedup: 28% on core2, 25% on k8. Originally committed as revision 17059 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Workaround for gcc 3.4 to align sh properlyDavid Conrad2009-01-26
| | | | Originally committed as revision 16797 to svn://svn.ffmpeg.org/ffmpeg/trunk
* cosmetics: Remove pointless period after copyright statement non-sentences.Diego Biurrun2009-01-19
| | | | Originally committed as revision 16684 to svn://svn.ffmpeg.org/ffmpeg/trunk
* replace all occurrence of ENABLE_ by the corresponding CONFIG_, HAVE_ or ARCH_Aurelien Jacobs2009-01-14
| | | | | | and remove all ENABLE_ definitions. Originally committed as revision 16600 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Change semantic of CONFIG_*, HAVE_* and ARCH_*.Aurelien Jacobs2009-01-13
| | | | | | They are now always defined to either 0 or 1. Originally committed as revision 16590 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Add missing 'void' keyword to parameterless function declarations.Diego Biurrun2009-01-05
| | | | Originally committed as revision 16436 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Use H264 MMX chroma functions to accelerate RV40 decoding.Mathieu Velten2009-01-04
| | | | | | Patch by Mathieu Velten (matmaul A gmail) Originally committed as revision 16419 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Add x264 SSE2 iDCT functions to H.264 decoder.Jason Garrett-Glaser2009-01-03
| | | | Originally committed as revision 16409 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Fix h264 decoding on SSE2 cores with icc compilation.Carl Eugen Hoyos2008-12-28
| | | | Originally committed as revision 16373 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Fix compilation without optimization under 64-bit with x264 deblock asm enabled.Jason Garrett-Glaser2008-12-26
| | | | Originally committed as revision 16313 to svn://svn.ffmpeg.org/ffmpeg/trunk
* Rename libavcodec/i386/ --> libavcodec/x86/.Diego Biurrun2008-12-22
It contains optimizations that are not specific to i386 and libavutil uses this naming scheme already. Originally committed as revision 16270 to svn://svn.ffmpeg.org/ffmpeg/trunk