summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
...
* x86: dsputil: Move rv40-specific functions where they belongDiego Biurrun2013-05-07
|
* x86: dsputil hpeldsp: Move shared template functions into separate objectDiego Biurrun2013-05-07
|
* x86: rnd_template: Eliminate pointless OP_AVG macro indirectionDiego Biurrun2013-05-07
|
* x86: hpeldsp: Move avg_pixels8_x2_mmx() out of hpeldsp_rnd_template.cDiego Biurrun2013-05-06
| | | | | The function is only instantiated once, so there is no point in keeping it in a template file.
* x86: hpeldsp: Only compile MMX hpeldsp code if MMX is enabledDiego Biurrun2013-05-06
|
* x86: More specific ifdefs for dsputil/hpeldsp init functionsDiego Biurrun2013-05-06
|
* avcodec: Add av_cold attributes to init functions missing themDiego Biurrun2013-05-04
|
* silly typo fixesDiego Biurrun2013-05-03
|
* x86: sbrdsp: Implement SSE2 qmf_deint_bflyChristophe Gisquet2013-05-03
| | | | | | | | | | Sandybridge: 47 cycles Having a loop counter is a 7 cycle gain. Unrolling is another 7 cycle gain. Working in reverse scan is another 6 cycles. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: dsputil: Move cavs and vc1-specific functions where they belongDiego Biurrun2013-05-02
|
* x86: dsputil: Move avg_pixels16_mmx() out of rnd_template.cDiego Biurrun2013-05-02
| | | | | The function does not do any rounding, so there is no point in keeping it in a round template file.
* x86: dsputil: Move avg_pixels8_mmx() out of rnd_template.cDiego Biurrun2013-05-02
| | | | | The function is only instantiated once, so there is no point in keeping it in a template file.
* x86: Move duplicated put_pixels{8|16}_mmx functions into their own fileDiego Biurrun2013-05-02
|
* x86: Drop unnecessary ff_ name prefixes from static functionsDiego Biurrun2013-04-30
|
* mpegaudiosp: More consistent names for ppc/x86 optimization filesDiego Biurrun2013-04-30
|
* x86: dsputil: Remove a set of pointless #ifs around function declarationsDiego Biurrun2013-04-30
|
* x86: dsputil: cosmetics: Group ff_{avg|put}_pixels16_mmxext() declarationsDiego Biurrun2013-04-30
|
* x86: hpeldsp: Remove unused macro definitionsDiego Biurrun2013-04-29
|
* x86: ac3dsp: Remove 3dnow version of ff_ac3_extract_exponentsDiego Biurrun2013-04-26
| | | | | | | The function requires increasing the fuzz factor for the ac3/eac3 encode tests and even so makes fate fail. It only provides a slight encoding speedup for legacy CPUs that do not support SS2. Thus its benefit is not worth the trouble it creates and fixing it would be a waste of time.
* x86: Rename dsputil_rnd_template.c to rnd_template.cMartin Storsjö2013-04-25
| | | | | | | This makes it less confusing when this template is shared both by dsputil and by hpeldsp. Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: Get rid of duplication between *_rnd_template.cMartin Storsjö2013-04-23
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: Factorize duplicated inline assembly snippetsMartin Storsjö2013-04-23
| | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
* x86: Move some conditional code around to avoid unused variable warningsDiego Biurrun2013-04-22
|
* x86: cavs: Refactor duplicate dspfunc macroDiego Biurrun2013-04-22
|
* x86: cavs: Put mmx-specific code into its own init functionDiego Biurrun2013-04-22
| | | | | Before, this code was labeled as mmxext and enabled both for the 3dnow and the mmxext case.
* x86: Remove some duplicate function declarationsDiego Biurrun2013-04-22
|
* x86: Remove unused inline asm instruction definesMartin Storsjö2013-04-20
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: hpeldsp: Move half-pel assembly from dsputil to hpeldspRonald S. Bultje2013-04-19
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* vp3: Use full transpose for all IDCTsRonald S. Bultje2013-04-15
| | | | | | | | | | | This way, the special IDCT permutations are no longer needed. This is similar to how H264 does it, and removes the dsputil dependency imposed by the scantable code. Also remove the unused type == 0 cases from the plain C version of the idct. Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: Move constants to the only place where they are usedRonald S. Bultje2013-04-15
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: dsputil: Move some ifdefs to avoid unused variable warningsDiego Biurrun2013-04-12
|
* x86: dsputil: cosmetics: Remove two pointless variable indirectionsDiego Biurrun2013-04-12
|
* x86: dsputil: Refactor some ff_{avg|put}_pixels function declarationsDiego Biurrun2013-04-12
|
* x86: dsputil: ff_h263_*_loop_filter declarations to a more suitable placeDiego Biurrun2013-04-12
|
* x86: h264qpel: int --> ptrdiff_t for some line_size parametersDiego Biurrun2013-04-12
|
* Move misplaced file author information where it belongsDiego Biurrun2013-04-11
|
* dsputil: Make dsputil selectableRonald S. Bultje2013-04-10
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* h264: Integrate clear_blocks calls with IDCTRonald S. Bultje2013-04-10
| | | | | | | | | The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: qpel: Move fullpel and l2 functions to a separate fileRonald S. Bultje2013-04-08
| | | | | | | This way, they can be shared between mpeg4qpel and h264qpel without requiring either one to be compiled unconditionally. Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: sbrdsp: Implement SSE neg_odd_64Christophe Gisquet2013-04-05
| | | | | | | | | | Timing on Arrandale: C SSE Win32: 57 44 Win64: 47 38 Unrolling and not storing mask both save some cycles. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* cosmetics: Remove unnecessary extern keywords from function declarationsDiego Biurrun2013-03-27
|
* x86: vc1dsp: Fix indentationMartin Storsjö2013-03-26
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: consistently use unaligned movs in the unaligned bswapJanne Grunau2013-03-25
| | | | Fixes fate errors in asv1, ffvhuff and huffyuv on x86_32.
* x86: Change a missed occurrance of int to ptrdiff_t for stridesMartin Storsjö2013-03-24
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* x86: Remove win64 xmm clobbering wrappers for the now removed ↵Martin Storsjö2013-03-23
| | | | | | avcodec_encode_video function Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: convert remaining functions to use ptrdiff_t stridesLuca Barbato2013-03-12
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Revert "Move H264/QPEL specific asm from dsputil.asm to h264_qpel_*.asm."Diego Biurrun2013-02-28
| | | | | | | This reverts commit f90ff772e7e35b4923c2de429d1fab9f2569b568. The code should be put back in h264_qpel_8bit.asm, but unfortunately it is unconditionally used from dsputil_mmx.c since 71155d7.
* x86: dsputil: Drop some unused function #definesDiego Biurrun2013-02-26
|
* x86: dsputil: Drop aliasing of ff_put_pixels8_mmx to ff_put_pixels8_mmxextDiego Biurrun2013-02-26
| | | | | | The external assembly function uses mmxext instructions and should not be masqueraded as an mmx-only function. Instead, use the mmx-only inline assembly function.
* x86: vc1dsp: Move ff_avg_vc1_mspel_mc00_mmxext out of dsputil_mmx.cDiego Biurrun2013-02-26
|