summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
* x86: hpel: Move {avg,put}_pixels16_sse2 to hpeldspDiego Biurrun2013-01-31
|
* x86: Simplify some arch conditionalsDiego Biurrun2013-01-29
|
* x86: hpeldsp: Fix a typo, use the right registerMichael Niedermayer2013-01-28
| | | | | | This makes the code actually work. Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: Fix error by not using redzone and register nameDaniel Kang2013-01-28
| | | | | Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* dsputil: x86: Correct the number of registers used in put_no_rnd_pixels16_l2Daniel Kang2013-01-27
| | | | | | put_no_rnd_pixels16_l2 allocated 5 instead of 6 registers. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* dsputil: add missing HAVE_YASM guardDaniel Kang2013-01-27
| | | | | | | Fix compile error under "--disable-optimizations --disable-yasm --disable-inline-asm" Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* dsputil: x86: Convert mpeg4 qpel and dsputil avg to yasmDaniel Kang2013-01-27
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Move H264/QPEL specific asm from dsputil.asm to h264_qpel_*.asm.Ronald S. Bultje2013-01-26
|
* x86: h264qpel: Move stray comment to the right spot and clarify itDiego Biurrun2013-01-26
|
* x86: h264qpel: add cpu flag checks for init functionJanne Grunau2013-01-24
| | | | | The code was copied from per cpu extension init function so the checks for supported extensions was overlooked.
* dsputil: Separate h264 qpelMans Rullgard2013-01-24
| | | | | | | | | | The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* dsputil: remove one array dimension from avg_no_rnd_pixels_tab.Ronald S. Bultje2013-01-22
|
* dsputil: remove avg_no_rnd_pixels8.Ronald S. Bultje2013-01-22
| | | | This is never used.
* Drop DCTELEM typedefDiego Biurrun2013-01-22
| | | | | | It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* vorbisdsp: convert x86 simd functions from inline asm to yasm.Ronald S. Bultje2013-01-22
|
* floatdsp: move scalarproduct_float from dsputil to avfloatdsp.Ronald S. Bultje2013-01-22
| | | | This makes the aac decoder and all voice codecs independent of dsputil.
* floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp.Ronald S. Bultje2013-01-22
| | | | | | Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.
* floatdsp: move vector_fmul_add from dsputil to avfloatdsp.Ronald S. Bultje2013-01-22
|
* x86: ac3: Fix HAVE_MMXEXT condition to only refer to external assemblyDiego Biurrun2013-01-21
| | | | CC: libav-stable@libav.org
* dsputilenc: x86: Convert pixel inline asm to yasmDaniel Kang2013-01-21
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* vorbisdsp: change block_size type from int to intptr_t.Ronald S. Bultje2013-01-20
| | | | This saves one instruction in the x86-64 assembly.
* dsputil: remove butterflies_float_interleave.Ronald S. Bultje2013-01-20
| | | | The function is unused.
* dsputil: drop non-compliant "fast" qpel mc functionsMans Rullgard2013-01-20
| | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
* Move vorbis_inverse_coupling from dsputil to vorbisdspcontext.Ronald S. Bultje2013-01-19
| | | | | Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.
* vp3: integrate clear_blocks with idct of previous block.Ronald S. Bultje2013-01-19
| | | | | | | | | This is identical to what e.g. vp8 does, and prevents the function call overhead (plus dependency on dsputil for this particular function). Arm asm updated by Janne Grunau <janne-libav@jannau.net>. Signed-off-by: Janne Grunau <janne-libav@jannau.net>
* x86: dsputil: Drop some unused macro definitionsDiego Biurrun2013-01-18
|
* lavc: Move vector_fmul_window to AVFloatDSPContextJustin Ruggles2013-01-16
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* x86: Add PAVGB macro to abstract pavgb/pavgusb instruction via cpuflagsDiego Biurrun2013-01-15
|
* x86: ABS2: port to cpuflagsDiego Biurrun2013-01-14
|
* Drop Snow codecDiego Biurrun2013-01-06
| | | | Snow is a toy codec with no real-world use and horrible code.
* x86: sbrdsp: Implement SSE qmf_post_shuffleChristophe Gisquet2013-01-06
| | | | | | 255 to 174 cycles on Arrandale / Win64. Unrolling yields no gain. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: sbrdsp: Implement SSE sum64x5Christophe Gisquet2013-01-06
| | | | | | 698 to 174 cycles on Arrandale. Unrolling is a 6 cycles gain. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: ABS1: port to cpuflagsDiego Biurrun2013-01-06
|
* lavc: introduce VideoDSPContextRonald S. Bultje2012-12-20
| | | | | | | | Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* x86inc: support stack mem allocation and re-alignment in PROLOGUERonald S. Bultje2012-12-12
| | | | | | | Use this in VP8/H264-8bit loopfilter functions so they can be used if there is no aligned stack (e.g. MSVC 32bit or ICC 10.x). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* ac3dec: make downmix() take array of pointers to channel dataMans Rullgard2012-12-09
|
* SBR DSP x86: implement SSE sbr_hf_genChristophe Gisquet2012-12-07
| | | | | | | | | | | | Start and end index are multiple of 2, therefore guaranteeing aligned access. Also, this allows to generate 4 floats per loop, keeping the alignment all along. Timing: - 32 bits: 326c -> 172c - 64 bits: 323c -> 156c Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: dsputilenc: port to cpuflagsDiego Biurrun2012-11-28
|
* x86: h264dsp: Fix linking with yasm and optimizations disabledDiego Biurrun2012-11-28
| | | | | Some optimized functions reference optimized symbols, so the functions must be explicitly disabled when those symbols are unavailable.
* x86: h264_idct: port to cpuflagsDiego Biurrun2012-11-28
|
* x86: h264_weight: port to cpuflagsDiego Biurrun2012-11-27
|
* x86: fix build without inline asmDiego Biurrun2012-11-26
| | | | | | | | The qpel functions referenced here are not related to h264 and should thus never have been under CONFIG_H264QPEL. Signed-off-by: Mans Rullgard <mans@mansr.com> Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: h264 qpel: use the correct number of utilized xmm regs in cglobalJustin Ruggles2012-11-25
| | | | Fixes xmm register clobbering on win64.
* x86: h264: Convert 8-bit QPEL inline assembly to YASMDaniel Kang2012-11-25
| | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: h264: Remove 3dnow QPEL codeDaniel Kang2012-11-25
| | | | | | | The only CPUs that have 3dnow and don't have mmxext are 12 years old. Moreover, AMD has dropped 3dnow extensions from newer CPUs. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: h264_chromamc: port to cpuflagsDiego Biurrun2012-11-25
|
* x86: h264_intrapred: Fix C function names in commentsDiego Biurrun2012-11-18
| | | | | Function names changed after switching to declaration with PRED4x4/8x8/8x8L/16x16 macros in the C code.
* x86: SPLATD: port to cpuflagsDiego Biurrun2012-11-18
|
* x86: dsputil: port to cpuflagsDiego Biurrun2012-11-16
|
* x86: mmx2 ---> mmxext in asm constructsDiego Biurrun2012-11-14
|