summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Expand)AuthorAge
* Revert r24931, it broke Win32 and some BSD compiles (yay fate).Ronald S. Bultje2010-08-25
* Mark xmm6 and xmm7 as clobbered in ff_vp3_idct_sse2(), which is contributingRonald S. Bultje2010-08-25
* VP6: fix vp6_filter_diag4_mmx/sse on 64-bitMåns Rullgård2010-08-25
* Move vp6_filter_diag4() x86 SIMD code from inline ASM to YASM. This shouldRonald S. Bultje2010-08-25
* Move vp6_filter_diag4() from DSPContext to VP56DSPContext.Ronald S. Bultje2010-08-25
* Remove global mm_flags variableMåns Rullgård2010-08-24
* Mark xmm registers as clobbered in simple loopfilter. Should fix the lastRonald S. Bultje2010-08-24
* imdct/x86: Use "s->mdct_size" instead of "1 << s->mdct_bits".Alex Converse2010-08-23
* Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures).Ronald S. Bultje2010-08-23
* Convert ff_imdct_half_sse() to yasm.Alex Converse2010-08-22
* VP5/6/8: ~7% faster arithmetic decodingJason Garrett-Glaser2010-08-12
* Split h264dsp and h264pred in configure.Jason Garrett-Glaser2010-08-07
* Add file missing in r24702Jason Garrett-Glaser2010-08-05
* H.264: SSE2/SSSE3 weighted prediction asmEli Friedman2010-08-05
* Move cavs dsp functions to their own structMåns Rullgård2010-08-03
* VP5/6/8: add one inline missed in r24677Jason Garrett-Glaser2010-08-03
* VP8: move zeroing of luma DC block into the WHTJason Garrett-Glaser2010-08-02
* Use word-writing instead of dword-writing (with two cached but otherwiseRonald S. Bultje2010-07-31
* Remove x86/mmx.h. It is not used anymore and has been deprecated for years.Vitor Sessak2010-07-31
* Convert deinterlacing MMX code to YASMVitor Sessak2010-07-31
* Fix compilation in x86_64. I broke it with r24580.Vitor Sessak2010-07-29
* Translate libmpeg2 MMX IDCT to plain asmVitor Sessak2010-07-29
* Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster.Ronald S. Bultje2010-07-26
* VP8: Much faster SSE2 MCJason Garrett-Glaser2010-07-26
* Enable no-loop memory/register saving for ssse3/sse4 also.Ronald S. Bultje2010-07-26
* Save a register (or regsize of stackspace for x86-32) for the no-loopRonald S. Bultje2010-07-26
* Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. thisRonald S. Bultje2010-07-26
* Split pextrw macro-spaghetti into several opt-specific macros, this will makeRonald S. Bultje2010-07-26
* Fix obvious bug in assignment. Somehow, the test vectors don't test this...Ronald S. Bultje2010-07-25
* Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so thisRonald S. Bultje2010-07-24
* Inline asm for VP56 arith coderEli Friedman2010-07-23
* VP8: optimize DC-only chroma case in the same way as luma.Jason Garrett-Glaser2010-07-23
* VP8 asm: cosmetics (spacing)Jason Garrett-Glaser2010-07-23
* VP8: 30% faster idct_mbJason Garrett-Glaser2010-07-23
* VP8: clear DCT blocks in iDCT instead of using clear_blocks.Jason Garrett-Glaser2010-07-23
* Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles onRonald S. Bultje2010-07-22
* Fix and enable horizontal >=SSE2 mbedge loopfilter.Ronald S. Bultje2010-07-22
* relicense h264 deblock sse2 to lgplLoren Merritt2010-07-22
* sync yasm macros from x264Loren Merritt2010-07-21
* Eliminate one instruction in VP8 dc_add_sse4Jason Garrett-Glaser2010-07-21
* Various VP8 x86 deblocking speedupsJason Garrett-Glaser2010-07-21
* Make mmx VP8 WHT fasterJason Garrett-Glaser2010-07-21
* Add header declarations for mmx/sse constants missing themDavid Conrad2010-07-21
* Move ff_pw_* from vc1dsp_mmx.c to dsputil_mmx.cDavid Conrad2010-07-21
* VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16)Ronald S. Bultje2010-07-20
* Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder.Ronald S. Bultje2010-07-20
* Revert r24339 (it causes fate failures on x86-64) - I'll figure out what'sRonald S. Bultje2010-07-19
* Remove FF_MM_SSE2/3 flags for CPUs where this is generally not faster thanRonald S. Bultje2010-07-19
* Implement chroma (width=8) inner loopfilter MMX/MMX2/SSE2 functions.Ronald S. Bultje2010-07-19
* Be more efficient with registers or stack memory. Saves 8/16 bytes stackRonald S. Bultje2010-07-19