summaryrefslogtreecommitdiff
path: root/libswresample/x86/audio_convert.asm
Commit message (Collapse)AuthorAge
* x86/audio_convert: fix clobbering of xmm registersJames Almer2015-10-01
| | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* swresample/x86: add missing colon to labelsJames Almer2015-07-26
| | | | | | Silences warnings with Nasm Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: make pack_8ch functions work with compilers without aligned stackJames Almer2015-02-15
| | | | Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add SSE/AVX unpack_6ch functionsJames Almer2015-01-12
| | | | | | | int32/float only Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: load constants outside the loop in pack_6ch functionsJames Almer2015-01-11
| | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: disable pack_8ch functions on msvc/icl x86_32James Almer2014-12-31
| | | | | | Until a proper fix is committed. Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add missing alignment check to pack_6ch functionsJames Almer2014-12-31
| | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add SSE2/AVX pack_8ch functionsJames Almer2014-12-30
| | | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: add ff_float_to_int32_a_avx2James Almer2014-11-07
| | | | | | | | 13797 decicycles in ff_float_to_int32_a_sse2, 32768 runs, 0 skips 8603 decicycles in ff_float_to_int32_a_avx2, 32766 runs, 2 skips Reviewed-by: Christophe Gisquet <christophe.gisquet@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
* x86/swr: replace sse4 instructions in pack_6ch with sse onesJames Almer2014-11-06
| | | | | | | | | There's no benefit from using blendps here except on CPUs with AVX, where it's faster than shufps according to Intel's documentation. As such, rename the sse4 functions to sse/sse2 and use shufps instead. Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* x86: Fix compilation with nasm on PPC & OS/2Ronald S. Bultje2013-10-08
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swresample/x86/audio_convert: add emms to CONVMichael Niedermayer2013-06-18
| | | | | | Might fix Ticket1874 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* Merge commit '6860b4081d046558c44b1b42f22022ea341a2a73'Michael Niedermayer2012-10-31
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '6860b4081d046558c44b1b42f22022ea341a2a73': x86: include x86inc.asm in x86util.asm cng: Reindent some incorrectly indented lines cngdec: Allow flushing the decoder cngdec: Make the dbov variable have the right unit cngdec: Fix the memset size to cover the full array cngdec: Update the LPC coefficients after averaging the reflection coefficients configure: fix print_config() with broke awks Conflicts: libavcodec/x86/ac3dsp.asm libavcodec/x86/dct32.asm libavcodec/x86/deinterlace.asm libavcodec/x86/dsputil.asm libavcodec/x86/dsputilenc.asm libavcodec/x86/fft.asm libavcodec/x86/fmtconvert.asm libavcodec/x86/h264_chromamc.asm libavcodec/x86/h264_deblock.asm libavcodec/x86/h264_deblock_10bit.asm libavcodec/x86/h264_idct.asm libavcodec/x86/h264_idct_10bit.asm libavcodec/x86/h264_intrapred.asm libavcodec/x86/h264_intrapred_10bit.asm libavcodec/x86/h264_weight.asm libavcodec/x86/vc1dsp.asm libavcodec/x86/vp3dsp.asm libavcodec/x86/vp56dsp.asm libavcodec/x86/vp8dsp.asm Merged-by: Michael Niedermayer <michaelni@gmx.at>
* Add some missing _EXTERNAL suffixes to yasm source files.Carl Eugen Hoyos2012-08-31
|
* swr/x86: 10l, missed some SSE2 instructions in code marked as SSE.Michael Niedermayer2012-07-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: Add ff_pack_6ch_float_to_int32_a_avx and ↵Michael Niedermayer2012-05-13
| | | | | | ff_pack_6ch_float_to_int32_a_sse4 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: add ff_pack_6ch_int32_to_float_a_avx and ↵Michael Niedermayer2012-05-13
| | | | | | ff_pack_6ch_int32_to_float_a_sse4 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: make NOP_N macro less picky on its parametersMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: Change FLOAT_TO_INT32_N to need 1 register lessMichael Niedermayer2012-05-13
| | | | | | same speed on sandy bridge Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: rename 6ch pack to what it isMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample-simd: make the converter registers parametersMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: cosmeticsMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: unaligned AVX/SSE4 float and int32 6ch packMichael Niedermayer2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* libswresample: Implement MMX, SSE4 and AVX 6ch float and int32 packing function.Justin Ruggles2012-05-13
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr-x86-simd: add ff_unpack_2ch_int16_to_int16/int32/float_a_ssse3Michael Niedermayer2012-05-06
| | | | | | more than 10% faster (tested on sandybridge) Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr-x86-simd: stereo unpack S16/S32/FLT-> S16/S32/FLT SSE/SSE2 (16 new SIMD ↵Michael Niedermayer2012-05-06
| | | | | | functions) Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr-x86-SIMD: 3 instructions less for stereo planar->packed s32/flt->s16Michael Niedermayer2012-05-06
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: replace the remaining 2 audio convert SIMD macros by the new onesMichael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: fix internal asm labelsMichael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: fix PACK_2CH register countMichael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: replace planar->planar/packed->packed FLT<->S16/S32 SIMD by new macrosMichael Niedermayer2012-05-05
| | | | | | this simplifies the code Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: implement stereo S16/S32/FLT->S16/S32/FLT planar->packed in SSE/SSE2Michael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: change simd len argument to be in samples instead of dst bytes.Michael Niedermayer2012-05-05
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add ff_int32_to_float_a_avxMichael Niedermayer2012-05-03
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: int32_to_int16_mmx/sseMichael Niedermayer2012-04-29
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: float_to_int16_sse2()Michael Niedermayer2012-04-29
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: float_to_int32_sse2()Michael Niedermayer2012-04-29
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add int16_to_float_sse2()Michael Niedermayer2012-04-28
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add int32_to_float_sse2Michael Niedermayer2012-04-28
| | | | | | could be done for sse/3dnow too if someone wants Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: int16->int32: use the old index negate trick to avoid 2 addsMichael Niedermayer2012-04-28
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: more correct cglobal parameters to int16->int32Michael Niedermayer2012-04-28
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: seperate functions for aligned & unalignedMichael Niedermayer2012-04-28
| | | | | | If someone has an idea on how to do this cleaner, its welcome Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* swr: add int16_to_int32_mmx/sseMichael Niedermayer2012-04-28
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>