libav.git - [no description]

	Commit message (Collapse)	Author	Age
*	swresample/x86/resample: Remove obsolete MMXEXT functions	Andreas Rheinhardt	2022-06-14
\| \| \| \| \| \| \| \| \| \| \|	x64 always has MMX, MMXEXT, SSE and SSE2 and this means that some functions for MMX, MMXEXT, SSE and 3dnow are always overridden by other functions (unless one e.g. explicitly disables SSE2). So given that the only systems which benefit from the MMXEXT resamplers (which are overridden by SSE2) are truely ancient 32bit x86s they are removed. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
*	swresample/x86/rematrix: Remove obsolete MMX functions	Andreas Rheinhardt	2022-06-14
\| \| \| \| \| \| \| \| \| \| \|	x64 always has MMX, MMXEXT, SSE and SSE2 and this means that some functions for MMX, MMXEXT and 3dnow are always overridden by other functions (unless one e.g. explicitly disables SSE2) for x64. So given that the only systems that benefit from these functions are truely ancient 32bit x86s they are removed. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
*	swresample/x86/audio_convert: Remove obsolete MMX functions	Andreas Rheinhardt	2022-06-14
\| \| \| \| \| \| \| \| \| \| \|	x64 always has MMX, MMXEXT, SSE and SSE2 and this means that some functions for MMX, MMXEXT and 3dnow are always overridden by other functions (unless one e.g. explicitly disables SSE2) for x64. So given that the only systems that benefit from these functions are truely ancient 32bit x86s they are removed. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
*	Replace all occurences of av_mallocz_array() by av_calloc()	Andreas Rheinhardt	2021-09-20
\| \| \| \| \| \| \|	They do the same. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
*	Include attributes.h directly	Andreas Rheinhardt	2021-04-19
\| \| \| \| \| \| \| \|	Some files currently rely on libavutil/cpu.h to include it for them; yet said file won't use include it any more after the currently deprecated functions are removed, so include attributes.h directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
*	swresample: Use channel count in rematrix initialization	Marcin Gorzel	2018-07-26
\| \| \| \| \| \| \| \|	Rematrixing supports up to 64 channels. However, there is only a limited number of channel layouts defined. Since the in/out channel count is currently obtained from the channel layout, for undefined layouts (e.g. for 9, 10, 11 channels etc.) the rematrixing fails. This patch changes rematrix init methods to use in (used) and out channel count directly instead of computing it from channel layout. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
*	build: Generalize yasm/nasm-related variable names	Diego Biurrun	2017-06-21
\| \| \| \| \| \| \| \|	None of them are specific to the YASM assembler. (Cherry-picked from libav commit 39e208f4d4756367c7cd2d581847e0c1b8a429c1) Signed-off-by: James Almer <jamrial@gmail.com>
*	swresample/x86/resample: extend resample_double to support avx and fma3	Muhammad Faiz	2017-03-19
\| \| \| \| \| \| \| \| \|	benchmark: sse2 10.670s avx 8.763s fma3 8.380s Signed-off-by: Muhammad Faiz <mfcc64@gmail.com>
*	swresample/resample: optimize exact_rational=on:linear_interp=on case	Muhammad Faiz	2016-11-25
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	separate dsp.resample to dsp.resample_common and dsp.resample_linear and choose to call faster resample_common even when linear_interp=on when c->frac and c->dst_incr_mod are both zero speed up resampling when exact_rational and linear_interp are both enabled because exact_rational force c->frac and c->dst_incr_mod to be zero when soft compensation does not happen benchmark on exact_rational=on:linear_interp=on old new real 8.432s 5.097s user 7.679s 4.989s sys 0.125s 0.107s Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Muhammad Faiz <mfcc64@gmail.com>
*	swresample/x86: add support for exact_rational	Muhammad Faiz	2016-06-21
\| \| \| \| \| \| \|	phase_shift and phase_mask is removed generally exact_rational=on is faster than exact_rational=off Signed-off-by: Muhammad Faiz <mfcc64@gmail.com>
*	swresample: add exact_rational option	Muhammad Faiz	2016-06-13
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	give high quality resampling as good as with linear_interp=on as fast as without linear_interp=on tested visually with ffplay ffplay -f lavfi "aevalsrc='sin(10000tt)', aresample=osr=48000, showcqt=gamma=5" ffplay -f lavfi "aevalsrc='sin(10000tt)', aresample=osr=48000:linear_interp=on, showcqt=gamma=5" ffplay -f lavfi "aevalsrc='sin(10000tt)', aresample=osr=48000:exact_rational=on, showcqt=gamma=5" slightly speed improvement for fair comparison with -cpuflags 0 audio.wav is ~ 1 hour 44100 stereo 16bit wav file ffmpeg -i audio.wav -af aresample=osr=48000 -f null - old new real 13.498s 13.121s user 13.364s 12.987s sys 0.131s 0.129s linear_interp=on old new real 23.035s 23.050s user 22.907s 22.917s sys 0.119s 0.125s exact_rational=on real 12.418s user 12.298s sys 0.114s possibility to decrease memory usage if soft compensation is ignored Signed-off-by: Muhammad Faiz <mfcc64@gmail.com>
*	x86: use the new helper macros where useful	James Almer	2016-02-14
\| \| \| \| \|	Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/audio_convert: fix clobbering of xmm registers	James Almer	2015-10-01
\| \| \| \| \|	Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86: move XOP emulation code back to x86inc	James Almer	2015-08-03
\| \| \| \| \| \| \| \| \| \|	Only two functions that use xop multiply-accumulate instructions where the first operand is the same as the fourth actually took advantage of the macros. This further reduces differences with x264's x86inc. Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
*	swresample/x86: add missing colon to labels	James Almer	2015-07-26
\| \| \| \| \| \|	Silences warnings with Nasm Signed-off-by: James Almer <jamrial@gmail.com>
*	x86: check for AV_CPU_FLAG_AVXSLOW where useful	James Almer	2015-06-01
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample: add av_cold to init functions	Michael Niedermayer	2015-02-21
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	x86/swr: make pack_8ch functions work with compilers without aligned stack	James Almer	2015-02-15
\| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com>
*	swresample/x86/rematrix_init: Check av_malloc* return codes, forward errors	Michael Niedermayer	2015-02-09
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample/x86/rematrix_init: Use av_mallocz_array()	Michael Niedermayer	2015-02-09
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	x86/swr: add SSE/AVX unpack_6ch functions	James Almer	2015-01-12
\| \| \| \| \| \| \|	int32/float only Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: load constants outside the loop in pack_6ch functions	James Almer	2015-01-11
\| \| \| \| \|	Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: disable pack_8ch functions on msvc/icl x86_32	James Almer	2014-12-31
\| \| \| \| \| \|	Until a proper fix is committed. Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: add missing alignment check to pack_6ch functions	James Almer	2014-12-31
\| \| \| \| \|	Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: add SSE2/AVX pack_8ch functions	James Almer	2014-12-30
\| \| \| \| \| \|	Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: add ff_float_to_int32_a_avx2	James Almer	2014-11-07
\| \| \| \| \| \| \| \|	13797 decicycles in ff_float_to_int32_a_sse2, 32768 runs, 0 skips 8603 decicycles in ff_float_to_int32_a_avx2, 32766 runs, 2 skips Reviewed-by: Christophe Gisquet <christophe.gisquet@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: replace sse4 instructions in pack_6ch with sse ones	James Almer	2014-11-06
\| \| \| \| \| \| \| \| \|	There's no benefit from using blendps here except on CPUs with AVX, where it's faster than shufps according to Intel's documentation. As such, rename the sse4 functions to sse/sse2 and use shufps instead. Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
*	x86/swr: use lavu helper macros to check CPU extensions	James Almer	2014-07-04
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	x86/swr: split audioconvert and rematrix DSP into separate files	James Almer	2014-07-04
\| \| \| \| \| \| \|	Also rename resample_x86_dsp.c to resample_init.c Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: initialize only the necessary resample dsp functions	James Almer	2014-07-04
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: rename swresample_dsp init functions to swri_resample_dsp	James Almer	2014-07-02
\| \| \| \| \| \| \|	The swresample_ prefix is not for internal functions Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	x86/swr: add ff_resample_{common, linear}_int16_xop	James Almer	2014-07-02
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	x86/swr: add ff_resample_{common, linear}_float_fma	James Almer	2014-07-02
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	x86/swr: convert resample_{common, linear}_double_sse2 to yasm	James Almer	2014-07-01
\| \| \| \| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> 312531 -> 311528 dezicycles Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: convert resample_common/linear_int16_mmx2/sse2 to yasm.	Ronald S. Bultje	2014-06-30
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: rewrite resample_common/linear_float_sse/avx in yasm.	Ronald S. Bultje	2014-06-28
\| \| \| \| \| \| \| \| \| \| \|	Linear interpolation goes from 63 (llvm) or 58 (gcc) to 48 (yasm) cycles/sample on 64bit, or from 66 (llvm/gcc) to 52 (yasm) cycles/ sample on 32bit. Bon-linear goes from 43 (llvm) or 38 (gcc) to 32 (yasm) cycles/sample on 64bit, or from 46 (llvm) or 44 (gcc) to 38 (yasm) cycles/sample on 32bit (all testing on OSX 10.9.2, llvm 5.1 and gcc 4.8/9). Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: compile mmx2 s16p functions only on x86-32.	Ronald S. Bultje	2014-06-15
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: add prototypes for resample dsp functions	James Almer	2014-06-15
\| \| \| \| \| \| \| \|	Should fix compilation failures with MSVC and any other compiler without inline asm support. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: remove obsolete function prototypes.	Ronald S. Bultje	2014-06-15
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swr: split out DSP functions.	Ronald S. Bultje	2014-06-14
\| \| \| \| \| \| \| \| \| \|	DSP bits of swri_resample go into their own mini-DSP functions; DSP init goes from a per-call branch in multiple_resample to a proper DSP init routine; x86 bits go into x86/; swri_resample() moves out of resample_template.c into resample.c because it's independent of DSP code or sample type; multiple_resample() is simplified. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample: add swri_resample_float_avx	James Almer	2014-05-16
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	inline asm: fix arrays as named constraints.	Matt Oliver	2014-05-07
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample/resample: add missing xmm clobbers	James Almer	2014-05-07
\| \| \| \| \| \| \|	Might fix fate-swr on ICL Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample: add swri_resample_double_sse2	James Almer	2014-04-25
\| \| \| \| \|	Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample/resample: sse float linear interpolation	James Almer	2014-03-24
\| \| \| \| \| \| \|	About two times faster Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample/resample: mmx2/sse2 int16 linear interpolation	James Almer	2014-03-24
\| \| \| \| \| \| \|	About three times faster Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample: add swri_resample_float_sse	James Almer	2014-03-20
\| \| \| \| \| \| \|	At least two times faster than the C version. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	Automatically change MANGLE() into named inline asm operands when direct ↵	Matt Oliver	2014-03-18
\| \| \| \| \| \| \| \|	symbol reference in inline asm are not supported. This is part of the patch-set for intel C inline asm on windows support Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample: change COMMON_CORE_INT16 asm from SSSE3 to SSE2	James Almer	2014-03-18
\| \| \| \| \| \| \| \| \|	pshuf+paddd is slightly faster than phaddd. The real gain is in pre-ssse3 processors like AMD K8 and K10, which get a big boost in performance compared to the mmxext version Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
*	swresample: Add arm&x86 clobber tests	Martin Storsjö	2014-01-18
\| \| \| \|	Signed-off-by: Michael Niedermayer <michaelni@gmx.at>