libav.git - [no description]

	Commit message (Collapse)	Author	Age
*	Update Fiona's name in copyright statements.	Diego Biurrun	2014-07-01
\|
*	x86: Make function prototype comments in assembly code consistent	Diego Biurrun	2014-03-13
\| \| \| \|	This helps grepping for functions, among other things.
*	x86: vp8dsp: Split loopfilter code into a separate file	Diego Biurrun	2013-11-01
\|
*	dsputil: Make dsputil selectable	Ronald S. Bultje	2013-04-10
\| \| \| \|	Signed-off-by: Martin Storsjö <martin@martin.st>
*	Drop DCTELEM typedef	Diego Biurrun	2013-01-22
\| \| \| \| \| \|	It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
*	x86inc: support stack mem allocation and re-alignment in PROLOGUE	Ronald S. Bultje	2012-12-12
\| \| \| \| \| \| \|	Use this in VP8/H264-8bit loopfilter functions so they can be used if there is no aligned stack (e.g. MSVC 32bit or ICC 10.x). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
*	x86: mmx2 ---> mmxext in asm constructs	Diego Biurrun	2012-11-14
\|
*	x86: yasm: Use complete source path for macro helper %includes	Diego Biurrun	2012-10-31
\| \| \| \| \|	This is more consistent with the way we handle C #includes and it simplifies the build system.
*	x86: include x86inc.asm in x86util.asm	Diego Biurrun	2012-10-31
\| \| \| \|	This is necessary to allow refactoring some x86util macros with cpuflags.
*	x86: add colons after labels	Mans Rullgard	2012-08-07
\| \| \| \| \| \|	nasm prints a warning if the colon is missing. Signed-off-by: Mans Rullgard <mans@mansr.com>
*	x86inc: add SPLATB_LOAD, SPLATB_REG, PSHUFLW macros	Loren Merritt	2012-07-05
\| \| \| \|	Signed-off-by: Diego Biurrun <diego@biurrun.de>
*	vp8dsp x86: perform rounding shift with a single instruction	Christophe GISQUET	2012-04-04
\| \| \| \|	Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
*	vp8: convert mbedge loopfilter x86 assembly to use named arguments.	Ronald S. Bultje	2012-03-10
\|
*	vp8: convert inner loopfilter x86 assembly to use named arguments.	Ronald S. Bultje	2012-03-10
\|
*	vp8: convert simple loopfilter x86 assembly to use named arguments.	Ronald S. Bultje	2012-03-03
\|
*	vp8: convert idct x86 assembly to use named arguments.	Ronald S. Bultje	2012-03-03
\|
*	vp8: convert mc x86 assembly to use named arguments.	Ronald S. Bultje	2012-03-03
\|
*	vp8: convert loopfilter x86 assembly to use cpuflags().	Ronald S. Bultje	2012-03-03
\|
*	vp8: convert idct/mc x86 assembly to use cpuflags().	Ronald S. Bultje	2012-03-03
\|
*	vp8: disable mmx functions with sse/sse2 counterparts on x86-64.	Ronald S. Bultje	2012-03-02
\| \| \| \| \|	x86-64 is guaranteed to have at least SSE2, therefore the MMX/MMX2 functions will never be used in practice.
*	Fix NASM include directive	Dave Yeo	2011-08-15
\| \| \| \|	Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
*	Move x86util.asm from libavcodec/ to libavutil/.	Ronald S. Bultje	2011-08-12
\| \| \| \|	This allows using it in swscale also.
*	Move x86inc.asm to libavutil/.	Ronald S. Bultje	2011-08-12
\| \| \| \|	This allows using it in libswscale/ also.
*	Modify x86util.asm to ease transitioning to 10-bit H.264 assembly.	Daniel Kang	2011-05-17
\| \| \| \| \| \| \|	Arguments for variable size instructions are added to many macros, along with other various changes. The x86util.asm code was ported from x264. Signed-off-by: Diego Biurrun <diego@biurrun.de>
*	Fix FSF address copy paste error in some license headers.	Diego Biurrun	2011-05-14
\|
*	Replace FFmpeg with Libav in licence headers	Mans Rullgard	2011-03-19
\| \| \| \|	Signed-off-by: Mans Rullgard <mans@mansr.com>
*	Use "d" suffix for general-purpose registers used with movd.	Reimar Döffinger	2010-09-05
\| \| \| \| \| \| \| \|	This increases compatibilty with nasm and is also more consistent, e.g. with h264_intrapred.asm and h264_chromamc.asm that already do it that way. Originally committed as revision 25042 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Mark xmm registers as clobbered in simple loopfilter. Should fix the last	Ronald S. Bultje	2010-08-24
\| \| \| \| \| \|	two VP8-related fate failures on Win64. Originally committed as revision 24908 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Fix segfaults in VP8 SIMD code on Win64 (and FATE/win64 failures).	Ronald S. Bultje	2010-08-23
\| \| \| \|	Originally committed as revision 24871 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8: move zeroing of luma DC block into the WHT	Jason Garrett-Glaser	2010-08-02
\| \| \| \| \| \| \|	Lets us do the zeroing in asm instead of C. Also makes it consistent with the way the regular iDCT code does it. Originally committed as revision 24668 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Use word-writing instead of dword-writing (with two cached but otherwise	Ronald S. Bultje	2010-07-31
\| \| \| \| \| \| \| \| \| \|	unchanged bytes) in the horizontal simple loopfilter. This makes the filter quite a bit faster in itself (~30 cycles less on Core1), probably mostly because we don't need a complex 4x4 transpose, but only a simple byte interleave. Also allows using pextrw on SSE4, which speeds up even more (e.g. 25% faster on Core i7). Originally committed as revision 24638 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Use pmaddubsw for the mbedge_filter (>=ssse3), 6-10 cycles faster.	Ronald S. Bultje	2010-07-26
\| \| \| \|	Originally committed as revision 24514 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8: Much faster SSE2 MC	Jason Garrett-Glaser	2010-07-26
\| \| \| \| \| \| \|	5-10% faster or more on Phenom, Athlon 64, and some others. Helps some on pre-SSSE3 Intel chips as well, but not as much. Originally committed as revision 24513 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Enable no-loop memory/register saving for ssse3/sse4 also.	Ronald S. Bultje	2010-07-26
\| \| \| \|	Originally committed as revision 24511 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Save a register (or regsize of stackspace for x86-32) for the no-loop	Ronald S. Bultje	2010-07-26
\| \| \| \| \| \| \|	mbedge loopfilter functions, by re-using space that holds a variable that we no longer need. Originally committed as revision 24510 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Use nested ifs instead of &&, which appears to not work with %ifidn (i.e. this	Ronald S. Bultje	2010-07-26
\| \| \| \| \| \|	construct was always enabled, even for <ssse3 versions). Originally committed as revision 24509 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Split pextrw macro-spaghetti into several opt-specific macros, this will make	Ronald S. Bultje	2010-07-26
\| \| \| \| \| \| \| \|	future new optimizations (imagine a sse5) much easier. Also fix a bug where we used the direction (%2) rather than optimization (%1) to enable this, which means it wasn't ever actually used... Originally committed as revision 24507 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Fix obvious bug in assignment. Somehow, the test vectors don't test this...	Ronald S. Bultje	2010-07-25
\| \| \| \|	Originally committed as revision 24489 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Fix SPLATB_REG mess. Used to be a if/elseif/elseif/elseif spaghetti, so this	Ronald S. Bultje	2010-07-24
\| \| \| \| \| \| \| \|	splits it into small optimization-specific macros which are selected for each DSP function. The advantage of this approach is that the sse4 functions now use the ssse3 codepath also without needing an explicit sse4 codepath. Originally committed as revision 24487 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8: optimize DC-only chroma case in the same way as luma.	Jason Garrett-Glaser	2010-07-23
\| \| \| \| \| \| \|	Add MMX idct_dc_add4uv function for this case. ~40% faster chroma idct. Originally committed as revision 24455 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8 asm: cosmetics (spacing)	Jason Garrett-Glaser	2010-07-23
\| \| \| \|	Originally committed as revision 24453 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8: 30% faster idct_mb	Jason Garrett-Glaser	2010-07-23
\| \| \| \| \| \| \| \| \| \|	Take shortcuts based on statistically common situations. Add 4-at-a-time idct_dc function (mmx and sse2) since rows of 4 DC-only DCT blocks are common. TODO: tie this more directly into the MB mode, since the DC-level transform is only used for non-splitmv blocks? Originally committed as revision 24452 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8: clear DCT blocks in iDCT instead of using clear_blocks.	Jason Garrett-Glaser	2010-07-23
\| \| \| \| \| \|	~0.3% faster overall. Originally committed as revision 24448 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Use pextrw for SSE4 mbedge filter result writing, speedup 5-10cycles on	Ronald S. Bultje	2010-07-22
\| \| \| \| \| \|	CPUs supporting it. Originally committed as revision 24437 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Fix and enable horizontal >=SSE2 mbedge loopfilter.	Ronald S. Bultje	2010-07-22
\| \| \| \|	Originally committed as revision 24409 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Eliminate one instruction in VP8 dc_add_sse4	Jason Garrett-Glaser	2010-07-21
\| \| \| \|	Originally committed as revision 24405 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Various VP8 x86 deblocking speedups	Jason Garrett-Glaser	2010-07-21
\| \| \| \| \| \| \|	SSSE3 versions, improve SSE2 versions a bit. SSE2/SSSE3 mbedge h functions are currently broken, so explicitly disable them. Originally committed as revision 24403 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Make mmx VP8 WHT faster	Jason Garrett-Glaser	2010-07-21
\| \| \| \| \| \| \|	Avoid pextrw, since it's slow on many older CPUs. Now it doesn't require mmxext either. Originally committed as revision 24397 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	VP8 MBedge loopfilter MMX/MMX2/SSE2 functions for both luma (width=16)	Ronald S. Bultje	2010-07-20
\| \| \| \| \| \|	and chroma (width=8). Originally committed as revision 24378 to svn://svn.ffmpeg.org/ffmpeg/trunk
*	Chroma (width=8) inner loopfilter MMX/MMX2/SSE2 for VP8 decoder.	Ronald S. Bultje	2010-07-20
\| \| \| \|	Originally committed as revision 24377 to svn://svn.ffmpeg.org/ffmpeg/trunk