Commit message (Collapse) | Author | Age | ||
---|---|---|---|---|
... | ||||
* | dsputil: Split off quarterpel bits into their own context | Diego Biurrun | 2014-05-29 | |
| | ||||
* | dsputil: Move APE-specific bits into apedsp | Diego Biurrun | 2014-05-29 | |
| | ||||
* | dsputil: Move SVQ1 encoding specific bits into svq1enc | Diego Biurrun | 2014-05-29 | |
| | ||||
* | dsputil: Split off HuffYUV encoding bits into their own context | Diego Biurrun | 2014-05-27 | |
| | | | | Also shorten HuffYUV context member names to avoid clutter. | |||
* | dsputil: Split off HuffYUV decoding bits into their own context | Diego Biurrun | 2014-05-27 | |
| | | | | Also shorten HuffYUV context member names to avoid clutter. | |||
* | x86/synth_filter: remove the fma3 version ifdefs | James Almer | 2014-04-13 | |
| | | | | | | | This fixes compilation failures with --disable-fma3 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Anton Khirnov <anton@khirnov.net> | |||
* | DNxHD: convert inline asm to yasm | Timothy Gu | 2014-04-11 | |
| | ||||
* | DNxHD: make get_pixel_8x4_sym accept ptrdiff_t as stride | Timothy Gu | 2014-04-11 | |
| | ||||
* | x86: dsputil: Move ff_apply_window_int16_* bits to ac3dsp, where they belong | Diego Biurrun | 2014-04-04 | |
| | ||||
* | x86: h264_qpel: Simplify an #if conditional | Diego Biurrun | 2014-04-04 | |
| | | | | The extra conditions are covered by previous #ifs and conditional compilation. | |||
* | x86: Drop some unnecessary YASM ifdefs | Diego Biurrun | 2014-04-04 | |
| | | | | Dead code elimination is enough to avoid undefined references in these cases. | |||
* | x86: dsputil: Eliminate some unnecessary dsputil_x86.h #includes | Diego Biurrun | 2014-04-04 | |
| | ||||
* | Remove a number of unnecessary dsputil.h #includes | Diego Biurrun | 2014-04-04 | |
| | ||||
* | x86/synth_filter: add synth_filter_fma3 | James Almer | 2014-04-04 | |
| | | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Anton Khirnov <anton@khirnov.net> | |||
* | x86/synth_filter: add synth_filter_avx | James Almer | 2014-04-04 | |
| | | | | | | | | | | | | Sandy Bridge Win64: 180 cycles in ff_synth_filter_inner_sse2 150 cycles in ff_synth_filter_inner_avx Also switch some instructions to a three operand format to avoid assembly errors with Yasm 1.1.0 or older. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Anton Khirnov <anton@khirnov.net> | |||
* | x86/synth_filter: add synth_filter_sse | James Almer | 2014-04-04 | |
| | | | | | | | Build only on x86_32 targets. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Anton Khirnov <anton@khirnov.net> | |||
* | On2 VP7 decoder | Peter Ross | 2014-04-04 | |
| | | | | | | | | | Further performance improvements and security fixes by Vittorio Giovara, Luca Barbato and Diego Biurrun. Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org> Signed-off-by: Diego Biurrun <diego@biurrun.de> | |||
* | x86: hpeldsp: Keep all rnd_template instantiations in hpeldsp_init | Diego Biurrun | 2014-03-26 | |
| | | | | | There is no point in having a separate file just for the instantiation that provides the public functions. | |||
* | Add missing headers to make template files compile (more) standalone | Diego Biurrun | 2014-03-26 | |
| | ||||
* | x86: h264_qpel: Fix typo in CALL_2X_PIXELS macro invocation | Diego Biurrun | 2014-03-26 | |
| | | | | This fixes FATE with mmxext CPUFLAGS set. | |||
* | x86: dsputil: Move hpeldsp-related declarations to a separate header | Diego Biurrun | 2014-03-22 | |
| | ||||
* | x86: dsputil: Move fpel declarations to a separate header | Diego Biurrun | 2014-03-22 | |
| | ||||
* | dsputil: Refactor duplicated CALL_2X_PIXELS / PIXELS16 macros | Diego Biurrun | 2014-03-22 | |
| | ||||
* | imgconvert: Move ff_deinterlace_line_*_mmx declarations out of dsputil | Diego Biurrun | 2014-03-22 | |
| | ||||
* | x86: dsputil: Move inline assembly macros to a separate header | Diego Biurrun | 2014-03-22 | |
| | ||||
* | dsputil: Use correct type in me_cmp_func function pointer | Diego Biurrun | 2014-03-20 | |
| | ||||
* | build: Group general components separate from de/encoders in arch Makefiles | Diego Biurrun | 2014-03-20 | |
| | | | | This is in line with how the top-level libavcodec Makefile is structured. | |||
* | dsputil: Propagate bit depth information to all (sub)init functions | Diego Biurrun | 2014-03-20 | |
| | | | | This avoids recalculating the value over and over again. | |||
* | x86: dsputil_init: Drop some unnecessary parentheses | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: dsputil_init: K&R formatting cosmetics | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: dsputil_x86.h: K&R formatting cosmetics | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: motion_est: K&R formatting cosmetics | Diego Biurrun | 2014-03-13 | |
| | ||||
* | dsputilenc_mmx: K&R formatting cosmetics | Diego Biurrun | 2014-03-13 | |
| | ||||
* | dsputil_mmx: K&R formatting cosmetics | Diego Biurrun | 2014-03-13 | |
| | ||||
* | dsputilenc_mmx: Merge two assignment blocks with identical conditions | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: Make function prototype comments in assembly code consistent | Diego Biurrun | 2014-03-13 | |
| | | | | This helps grepping for functions, among other things. | |||
* | x86: h264_idct_10_bit: Use proper type in function prototype comments | Diego Biurrun | 2014-03-13 | |
| | ||||
* | Update dsputil- and SIMD-related comments to match reality more closely | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: Add some more missing headers | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: mpegvideoenc: Remove some remnants of the long-gone libmpeg2 IDCT | Diego Biurrun | 2014-03-13 | |
| | ||||
* | x86: dcadsp: Fix linking with yasm and optimizations disabled | Diego Biurrun | 2014-03-05 | |
| | | | | | Some optimized functions reference optimized symbols, so the functions must be explicitly disabled when those symbols are unavailable. | |||
* | x86: cabac: Use correct #includes to make header compile standalone | Diego Biurrun | 2014-03-05 | |
| | ||||
* | dcadec: simplify decoding of VQ high frequencies | Christophe Gisquet | 2014-02-28 | |
| | | | | | | | | | | | | | | | | | | | The vector dequantization has a test in a loop preventing effective SIMD implementation. By moving it out of the loop, this loop can be DSPized. Therefore, modify the current DSP implementation. In particular, the DSP implementation no longer has to handle null loop sizes. The decode_hf implementations have following timings: For x86 Arrandale: C SSE SSE2 SSE4 win32: 260 162 119 104 win64: 242 N/A 89 72 The arm NEON optimizations follow in a later patch as external asm. The now unused check for the y modifier in arm inline asm is removed from configure. | |||
* | x86: synth filter float: implement SSE2 version | Christophe Gisquet | 2014-02-28 | |
| | | | | | | | | | | | | | | Timings for Arrandale: C SSE win32: 2108 334 win64: 1152 322 Factorizing the inner loop with a call/jmp is a >15 cycles cost, even with the jmp destination being aligned. Unrolling for ARCH_X86_64 is a 20 cycles gain. Signed-off-by: Janne Grunau <janne-libav@jannau.net> | |||
* | x86: dcadsp: implement SSE lfe_dir | Christophe Gisquet | 2014-02-28 | |
| | | | | | | | | Results for Arrandale/Windows: 32: 1670 -> 316 64: 728 -> 298 Signed-off-by: Janne Grunau <janne-libav@jannau.net> | |||
* | prores: Use consistent names for DSP arch initialization functions | Diego Biurrun | 2014-02-28 | |
| | ||||
* | x86: dsputil: Use correct file name as multiple inclusion guard | Diego Biurrun | 2014-02-20 | |
| | ||||
* | x86: dca: Add missing multiple inclusion guards | Diego Biurrun | 2014-02-19 | |
| | ||||
* | dca: include dcadsp.h in {arm,x86}/dca.h for checkheaders | Janne Grunau | 2014-02-08 | |
| | ||||
* | x86: use the inline int8x8_fmul_int32 only if inline SSE2 is availbale | Janne Grunau | 2014-02-08 | |
| | | | | | Fixes compilation with MSVC. Also does not rely on on earlier config.h include but include it directly. |