Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | Consistently use "cpu_flags" as variable/parameter name for CPU flags | Diego Biurrun | 2013-07-18 |
| | |||
* | cosmetics: Remove unnecessary extern keywords from function declarations | Diego Biurrun | 2013-03-27 |
| | |||
* | x86: lavr: add SSE2/AVX dither_int_to_float() | Justin Ruggles | 2013-01-08 |
| | |||
* | x86: lavr: add SSE2 quantize() for dithering | Justin Ruggles | 2013-01-08 |
| | |||
* | x86: lavr: use the x86inc.asm automatic stack alignment in mixing functions | Justin Ruggles | 2013-01-05 |
| | | | | CC:libav-stable@libav.org | ||
* | x86: lavr: fix stack allocation for 7 and 8 channel downmixing on x86-32 | Ronald S. Bultje | 2012-11-17 |
| | | | | Fixes crashes on Win32 and stack overruns on x86-32 in general. | ||
* | build: Drop AVX assembly ifdefs | Diego Biurrun | 2012-11-11 |
| | | | | An assembler able to cope with AVX instructions is now required. | ||
* | x86: PALIGNR: port to cpuflags | Diego Biurrun | 2012-11-09 |
| | |||
* | x86: avresample: Add missing colons to assembly labels | Diego Biurrun | 2012-11-06 |
| | | | | YASM accepts labels without colons, but NASM issues warnings. | ||
* | x86: yasm: Use complete source path for macro helper %includes | Diego Biurrun | 2012-10-31 |
| | | | | | This is more consistent with the way we handle C #includes and it simplifies the build system. | ||
* | x86: include x86inc.asm in x86util.asm | Diego Biurrun | 2012-10-31 |
| | | | | This is necessary to allow refactoring some x86util macros with cpuflags. | ||
* | lavr: handle clipping in the float to s32 conversion | Justin Ruggles | 2012-10-13 |
| | | | | | | | | | | | We cannot clip to INT_MAX because that value cannot be exactly represented by a float value and ends up overflowing during conversion anyway. We need to use a slightly smaller float value, which ends up with slightly inaccurate results for samples which clip or nearly clip, but it is close enough. Using doubles as intermediates in the conversion would be more accurate, but it takes about twice as much time. Signed-off-by: Luca Barbato <lu_zero@gentoo.org> | ||
* | x86: Replace checks for CPU extensions and flags by convenience macros | Diego Biurrun | 2012-09-08 |
| | | | | | This separates code relying on inline from that relying on external assembly and fixes instances where the coalesced check was incorrect. | ||
* | x86: Split inline and external assembly #ifdefs | Diego Biurrun | 2012-08-31 |
| | |||
* | x86: cosmetics: Comment some #endifs for better readability | Diego Biurrun | 2012-08-30 |
| | |||
* | lavr: x86: optimized 6-channel flt to fltp conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel flt to fltp conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 6-channel flt to s16p conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel flt to s16p conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 6-channel s16 to fltp conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel s16 to fltp conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 6-channel s16 to s16p conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel s16 to s16p conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel fltp to flt conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 6-channel fltp to s16 conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel fltp to s16 conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 6-channel s16p to flt conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel s16p to flt conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 6-channel s16p to s16 conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | lavr: x86: optimized 2-channel s16p to s16 conversion | Justin Ruggles | 2012-08-23 |
| | |||
* | x86: add colons after labels | Mans Rullgard | 2012-08-07 |
| | | | | | | nasm prints a warning if the colon is missing. Signed-off-by: Mans Rullgard <mans@mansr.com> | ||
* | lavr: x86: improve non-SSE4 version of S16_TO_S32_SX macro | Justin Ruggles | 2012-07-27 |
| | | | | | Removes a false dependency on existing contents of the 2nd dst register, giving better performance for OOE. | ||
* | lavr: add x86-optimized mixing functions | Justin Ruggles | 2012-07-27 |
| | | | | | Adds optimized functions for mixing 3 through 8 input channels to 1 and 2 output channels in fltp or s16p format with flt coeffs. | ||
* | x86inc: automatically insert vzeroupper for YMM functions. | Ronald S. Bultje | 2012-07-26 |
| | |||
* | lavr: x86: add missing vzeroupper in ff_mix_1_to_2_fltp_flt() | Justin Ruggles | 2012-07-25 |
| | |||
* | lavr: x86: fix ff_conv_fltp_to_flt_6ch function prototypes | Justin Ruggles | 2012-06-26 |
| | | | | | | Changed to match the number of parameters in conv_func_interleave(), which is how they are called. The change isn't strictly necessary because the 4th parameter is not used, but the code is clearer if they match. | ||
* | lavr: x86: merge some branches | Justin Ruggles | 2012-06-25 |
| | |||
* | lavr: Add x86-optimized function for flt to s32 conversion | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: Add x86-optimized function for flt to s16 conversion | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: Add x86-optimized functions for s32 to flt conversion | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: Add x86-optimized functions for s32 to s16 conversion | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: Add x86-optimized functions for s16 to flt conversion | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: Add x86-optimized function for s16 to s32 conversion | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: add x86-optimized functions for mixing 1-to-2 s16p with flt coeffs | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: add x86-optimized functions for mixing 1-to-2 fltp with flt coeffs | Justin Ruggles | 2012-06-18 |
| | |||
* | lavr: add x86-optimized function for mixing 2 to 1 s16p with q8 coeffs | Justin Ruggles | 2012-05-29 |
| | |||
* | lavr: add x86-optimized functions for mixing 2 to 1 s16p with float coeffs | Justin Ruggles | 2012-05-29 |
| | |||
* | lavr: replace the SSE version of ff_conv_fltp_to_flt_6ch() with SSE4 and AVX | Justin Ruggles | 2012-05-09 |
| | | | | | The current SSE version is slower than the MMX version on Athlon64 and Sandy Bridge, but the SSE4 and AVX versions are faster on Sandy Bridge. | ||
* | build: cosmetics: Add missing end-of-line backslashes to item lists. | Diego Biurrun | 2012-05-07 |
| | |||
* | Fix compilation with YASM/NASM without AVX support. | Reimar Döffinger | 2012-05-01 |
| | | | | | Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de> Signed-off-by: Justin Ruggles <justin.ruggles@gmail.com> |