summaryrefslogtreecommitdiff
path: root/libswresample/x86/audio_convert.asm
Commit message (Collapse)AuthorAge
* swresample/x86/audio_convert: Remove obsolete MMX functionsAndreas Rheinhardt2022-06-14
| | | | | | | | | | | x64 always has MMX, MMXEXT, SSE and SSE2 and this means that some functions for MMX, MMXEXT and 3dnow are always overridden by other functions (unless one e.g. explicitly disables SSE2) for x64. So given that the only systems that benefit from these functions are truely ancient 32bit x86s they are removed. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>
* x86/audio_convert: fix clobbering of xmm registersJames Almer2015-10-01
| | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* swresample/x86: add missing colon to labelsJames Almer2015-07-26
| | | | | | Silences warnings with Nasm Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: make pack_8ch functions work with compilers without aligned stackJames Almer2015-02-15
| | | | Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add SSE/AVX unpack_6ch functionsJames Almer2015-01-12
| | | | | | | int32/float only Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: load constants outside the loop in pack_6ch functionsJames Almer2015-01-11
| | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: disable pack_8ch functions on msvc/icl x86_32James Almer2014-12-31
| | | | | | Until a proper fix is committed. Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add missing alignment check to pack_6ch functionsJames Almer2014-12-31
| | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add SSE2/AVX pack_8ch functionsJames Almer2014-12-30
| | | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add ff_float_to_int32_a_avx2James Almer2014-11-07
| | | | | | | | 13797 decicycles in ff_float_to_int32_a_sse2, 32768 runs, 0 skips 8603 decicycles in ff_float_to_int32_a_avx2, 32766 runs, 2 skips Reviewed-by: Christophe Gisquet <christophe.gisquet@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: replace sse4 instructions in pack_6ch with sse onesJames Almer2014-11-06
| | | | | | | | | There's no benefit from using blendps here except on CPUs with AVX, where it's faster than shufps according to Intel's documentation. As such, rename the sse4 functions to sse/sse2 and use shufps instead. Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86: Fix compilation with nasm on PPC & OS/2Ronald S. Bultje2013-10-08
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swresample/x86/audio_convert: add emms to CONVMichael Niedermayer2013-06-18
| | | | | | Might fix Ticket1874 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* Merge commit '6860b4081d046558c44b1b42f22022ea341a2a73'Michael Niedermayer2012-10-31
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '6860b4081d046558c44b1b42f22022ea341a2a73': x86: include x86inc.asm in x86util.asm cng: Reindent some incorrectly indented lines cngdec: Allow flushing the decoder cngdec: Make the dbov variable have the right unit cngdec: Fix the memset size to cover the full array cngdec: Update the LPC coefficients after averaging the reflection coefficients configure: fix print_config() with broke awks Conflicts: libavcodec/x86/ac3dsp.asm libavcodec/x86/dct32.asm libavcodec/x86/deinterlace.asm libavcodec/x86/dsputil.asm libavcodec/x86/dsputilenc.asm libavcodec/x86/fft.asm libavcodec/x86/fmtconvert.asm libavcodec/x86/h264_chromamc.asm libavcodec/x86/h264_deblock.asm libavcodec/x86/h264_deblock_10bit.asm libavcodec/x86/h264_idct.asm libavcodec/x86/h264_idct_10bit.asm libavcodec/x86/h264_intrapred.asm libavcodec/x86/h264_intrapred_10bit.asm libavcodec/x86/h264_weight.asm libavcodec/x86/vc1dsp.asm libavcodec/x86/vp3dsp.asm libavcodec/x86/vp56dsp.asm libavcodec/x86/vp8dsp.asm Merged-by: Michael Niedermayer <michaelni@gmx.at>
* Add some missing _EXTERNAL suffixes to yasm source files.Carl Eugen Hoyos2012-08-31
|
* swr/x86: 10l, missed some SSE2 instructions in code marked as SSE.Michael Niedermayer2012-07-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: Add ff_pack_6ch_float_to_int32_a_avx and ↵Michael Niedermayer2012-05-13
| | | | | | ff_pack_6ch_float_to_int32_a_sse4 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: add ff_pack_6ch_int32_to_float_a_avx and ↵Michael Niedermayer2012-05-13
| | | | | | ff_pack_6ch_int32_to_float_a_sse4 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: make NOP_N macro less picky on its parametersMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: Change FLOAT_TO_INT32_N to need 1 register lessMichael Niedermayer2012-05-13
| | | | | | same speed on sandy bridge Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: rename 6ch pack to what it isMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: make the converter registers parametersMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: cosmeticsMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: unaligned AVX/SSE4 float and int32 6ch packMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: Implement MMX, SSE4 and AVX 6ch float and int32 packing function.Justin Ruggles2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr-x86-simd: add ff_unpack_2ch_int16_to_int16/int32/float_a_ssse3Michael Niedermayer2012-05-06
| | | | | | more than 10% faster (tested on sandybridge) Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr-x86-simd: stereo unpack S16/S32/FLT-> S16/S32/FLT SSE/SSE2 (16 new SIMD ↵Michael Niedermayer2012-05-06
| | | | | | functions) Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr-x86-SIMD: 3 instructions less for stereo planar->packed s32/flt->s16Michael Niedermayer2012-05-06
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: replace the remaining 2 audio convert SIMD macros by the new onesMichael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: fix internal asm labelsMichael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: fix PACK_2CH register countMichael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: replace planar->planar/packed->packed FLT<->S16/S32 SIMD by new macrosMichael Niedermayer2012-05-05
| | | | | | this simplifies the code Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: implement stereo S16/S32/FLT->S16/S32/FLT planar->packed in SSE/SSE2Michael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: change simd len argument to be in samples instead of dst bytes.Michael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add ff_int32_to_float_a_avxMichael Niedermayer2012-05-03
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: int32_to_int16_mmx/sseMichael Niedermayer2012-04-29
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: float_to_int16_sse2()Michael Niedermayer2012-04-29
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: float_to_int32_sse2()Michael Niedermayer2012-04-29
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add int16_to_float_sse2()Michael Niedermayer2012-04-28
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add int32_to_float_sse2Michael Niedermayer2012-04-28
| | | | | | could be done for sse/3dnow too if someone wants Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: int16->int32: use the old index negate trick to avoid 2 addsMichael Niedermayer2012-04-28
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: more correct cglobal parameters to int16->int32Michael Niedermayer2012-04-28
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: seperate functions for aligned & unalignedMichael Niedermayer2012-04-28
| | | | | | If someone has an idea on how to do this cleaner, its welcome Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add int16_to_int32_mmx/sseMichael Niedermayer2012-04-28
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>