Commit message (Collapse) | Author | Age | |
---|---|---|---|
* | x86: float dsp: unroll SSE versions | Christophe Gisquet | 2014-02-20 |
| | | | | | | | | | | vector_fmul and vector_fmac_scalar are guaranteed that they can process in batch of 16 elements, but their SSE versions only does 8 at a time. Therefore, unroll them a bit. 299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64. Signed-off-by: Janne Grunau <janne-libav@jannau.net> | ||
* | x86: float dsp: butterflies_float SSE | Christophe Gisquet | 2013-05-03 |
| | | | | | 97c -> 49c Some codecs could benefit from more unrolling, but AAC doesn't. | ||
* | floatdsp: move scalarproduct_float from dsputil to avfloatdsp. | Ronald S. Bultje | 2013-01-22 |
| | | | | This makes the aac decoder and all voice codecs independent of dsputil. | ||
* | floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp. | Ronald S. Bultje | 2013-01-22 |
| | | | | | | Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also. | ||
* | floatdsp: move vector_fmul_add from dsputil to avfloatdsp. | Ronald S. Bultje | 2013-01-22 |
| | |||
* | x86: float_dsp: fix loading of the len parameter on x86-32 | Justin Ruggles | 2012-12-07 |
| | |||
* | x86: float_dsp: fix compilation of ff_vector_dmul_scalar_avx() on x86-32 | Justin Ruggles | 2012-12-06 |
| | | | | Signed-off-by: Janne Grunau <janne-libav@jannau.net> | ||
* | float_dsp: add vector_dmul_scalar() to multiply a vector of doubles | Justin Ruggles | 2012-12-05 |
| | | | | Include x86-optimized versions for SSE2 and AVX. | ||
* | x86: float_dsp: add SSE version of vector_fmul_scalar() | Justin Ruggles | 2012-11-26 |
| | |||
* | build: Drop AVX assembly ifdefs | Diego Biurrun | 2012-11-11 |
| | | | | An assembler able to cope with AVX instructions is now required. | ||
* | x86: include x86inc.asm in x86util.asm | Diego Biurrun | 2012-10-31 |
| | | | | This is necessary to allow refactoring some x86util macros with cpuflags. | ||
* | x86: float_dsp: fix ff_vector_fmac_scalar_avx() on Win64 | Justin Ruggles | 2012-09-07 |
| | | | | | The SWAP macro does not work for explicit xmm/ymm usage, so instead just move the scalar value from xmm2 to xmm0. | ||
* | x86: Split inline and external assembly #ifdefs | Diego Biurrun | 2012-08-31 |
| | |||
* | x86: add colons after labels | Mans Rullgard | 2012-08-07 |
| | | | | | | nasm prints a warning if the colon is missing. Signed-off-by: Mans Rullgard <mans@mansr.com> | ||
* | x86inc: automatically insert vzeroupper for YMM functions. | Ronald S. Bultje | 2012-07-26 |
| | |||
* | float_dsp: add x86-optimized functions for vector_fmac_scalar() | Justin Ruggles | 2012-06-18 |
| | |||
* | Add a float DSP framework to libavutil | Justin Ruggles | 2012-06-08 |
Move vector_fmul() from DSPContext to AVFloatDSPContext. |