summaryrefslogtreecommitdiff
path: root/libavcodec/ppc
Commit message (Collapse)AuthorAge
...
* ppc: dsputil: comment formatting and wording/grammar improvementsDiego Biurrun2014-03-20
|
* ppc: Add some missing headersDiego Biurrun2014-03-13
|
* dsputil: Remove prototypes for nonexisting optimization functionsDiego Biurrun2014-03-13
|
* ppc: reduce overreads when loading 8 pixels in altivec dsp functionsJanne Grunau2014-02-14
| | | | | | | | | | | Altivec can only load naturally aligned vectors. To handle possibly unaligned data a second vector is loaded from an offset of the original location and the data is recovered through a vector permutation. Overreads are minimal if the offset for second load points to the last element of data. This is 7 for loading eight 8-bit pixels and overreads are reduced from 16 bytes to 8 bytes if the pixels are 64-bit aligned. For unaligned pixels the overread is reduced from 23 bytes to 15 bytes in the worst case.
* vp8: fix PPC assembly to work if src_stride != dst_strideRonald S. Bultje2014-02-09
| | | | | Signed-off-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: Janne Grunau <janne-libav@jannau.net>
* h264: do not use 422 functions for monochromeAnton Khirnov2014-01-06
| | | | | | | Fixes invalid memory access. Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind CC:libav-stable@libav.org
* ppc: cosmetics: Consistently format CPU flag detection invocationsDiego Biurrun2013-08-29
|
* ppc: Add missing AltiVec cpuflag detection invocationsDiego Biurrun2013-08-29
|
* ppc: fdct: Remove vim editor settings commentDiego Biurrun2013-08-28
|
* ppc: Add and use convenience macro to check for AltiVec availabilityDiego Biurrun2013-08-28
|
* altivec: perform an explicit unaligned loadKostya Shishkov2013-08-16
| | | | | | | | Implicit vector loads on POWER7 hardware can use the VSX instruction set instead of classic Altivec/VMX. Let's force a VMX load in this case. Signed-off-by: Martin Storsjö <martin@martin.st>
* Consistently use "cpu_flags" as variable/parameter name for CPU flagsDiego Biurrun2013-07-18
|
* fmtconvert: Explicitly use int32_t instead of intChristophe Gisquet2013-07-17
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* fix scalarproduct_and_madd_int16_altivec() for orders > 16Kostya Shishkov2013-05-26
| | | | the second and third sources were incremented only by half of the needed size
* ppc: Only compile AltiVec FFT assembly when AltiVec is enabledDiego Biurrun2013-05-02
|
* ppc: Drop unnecessary ff_ name prefixes from static functionsDiego Biurrun2013-04-30
|
* ppc: More consistent arch initializationDiego Biurrun2013-04-30
|
* ppc: Move AltiVec utility headers out of AltiVec ifdefsDiego Biurrun2013-04-30
| | | | | Now that the headers themselves have ifdef protection this is no longer necessary and more consistent with normal include handling.
* ppc: More consistent names for H.264 optimizations filesDiego Biurrun2013-04-30
|
* mpegaudiosp: More consistent names for ppc/x86 optimization filesDiego Biurrun2013-04-30
|
* ppc: hpeldsp: Include attributes.hMartin Storsjö2013-04-20
| | | | | | This fixes building in configurations where altivec is disabled. Signed-off-by: Martin Storsjö <martin@martin.st>
* ppc: hpeldsp: Move half-pel assembly from dsputil to hpeldspRonald S. Bultje2013-04-19
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* vp3: Use full transpose for all IDCTsRonald S. Bultje2013-04-15
| | | | | | | | | | | This way, the special IDCT permutations are no longer needed. This is similar to how H264 does it, and removes the dsputil dependency imposed by the scantable code. Also remove the unused type == 0 cases from the plain C version of the idct. Signed-off-by: Martin Storsjö <martin@martin.st>
* h264: Integrate clear_blocks calls with IDCTRonald S. Bultje2013-04-10
| | | | | | | | | The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by: Martin Storsjö <martin@martin.st>
* dsputil: convert remaining functions to use ptrdiff_t stridesLuca Barbato2013-03-12
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Remove unnecessary dsputil.h #includesDiego Biurrun2013-02-26
|
* dsputil: Move LOCAL_ALIGNED macros to libavutilDiego Biurrun2013-02-08
|
* dsputil: Separate h264chromaDiego Biurrun2013-02-06
|
* Add av_cold attributes to arch-specific init functionsDiego Biurrun2013-02-05
|
* Use ptrdiff_t instead of int for {avg, put}_pixels line_size parameter.Diego Biurrun2013-02-05
| | | | | This avoids SIMD-optimized functions having to sign-extend their line size argument manually to be able to do pointer arithmetic.
* ppc: fmtconvert: Drop two unused variables.Diego Biurrun2013-02-01
|
* dsputil: Separate h264 qpelMans Rullgard2013-01-24
| | | | | | | | | | The sh4 optimizations are removed, because the code is 100% identical to the C code, so it is unlikely to provide any real practical benefit. Signed-off-by: Diego Biurrun <diego@biurrun.de> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* Drop DCTELEM typedefDiego Biurrun2013-01-22
| | | | | | It does not help as an abstraction and adds dsputil dependencies. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
* floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp.Ronald S. Bultje2013-01-22
| | | | | | Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.
* floatdsp: move vector_fmul_add from dsputil to avfloatdsp.Ronald S. Bultje2013-01-22
|
* vorbisdsp: change block_size type from int to intptr_t.Ronald S. Bultje2013-01-20
| | | | This saves one instruction in the x86-64 assembly.
* ppc: vorbisdsp: Drop some unnecessary #includesDiego Biurrun2013-01-20
| | | | Also fixes compilation with AltiVec disabled.
* ppc: Include string.h for memsetMartin Storsjö2013-01-20
| | | | | | | This fixes build failures on ppc machines with a compiler that supports -Werror=implicit-function-declaration. Signed-off-by: Martin Storsjö <martin@martin.st>
* Move vorbis_inverse_coupling from dsputil to vorbisdspcontext.Ronald S. Bultje2013-01-19
| | | | | Conveniently (together with Justin's earlier patches), this makes our vorbis decoder entirely independent of dsputil.
* vp3: integrate clear_blocks with idct of previous block.Ronald S. Bultje2013-01-19
| | | | | | | | | This is identical to what e.g. vp8 does, and prevents the function call overhead (plus dependency on dsputil for this particular function). Arm asm updated by Janne Grunau <janne-libav@jannau.net>. Signed-off-by: Janne Grunau <janne-libav@jannau.net>
* lavc: Move vector_fmul_window to AVFloatDSPContextJustin Ruggles2013-01-16
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* lavc: introduce VideoDSPContextRonald S. Bultje2012-12-20
| | | | | | | | Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* ppc: replace pointer casting with AV_COPY32Mans Rullgard2012-11-12
| | | | | | This removes warnings about strict aliasing violations. Signed-off-by: Mans Rullgard <mans@mansr.com>
* ppc: fix some unused variable warningsMans Rullgard2012-11-12
| | | | | | | The third argument of OP_U8_ALTIVEC is evaluated at most once so there is no need for a potentially unused temporary variable. Signed-off-by: Mans Rullgard <mans@mansr.com>
* build: non-x86: Only compile mpegvideo optimizations when necessaryDiego Biurrun2012-10-09
|
* ppc: fix Altivec build with old compilersMans Rullgard2012-10-08
| | | | | | | | | The vec_splat() intrinsic requires a constant argument for the element number, and the code relies on the compiler unrolling the loop to provide this. Manually unrolling the loop avoids this reliance and works with all compilers. Signed-off-by: Mans Rullgard <mans@mansr.com>
* ppc: fmtconvert: kill VLA in float_to_int16_interleave_altivec()Mans Rullgard2012-10-05
| | | | Signed-off-by: Mans Rullgard <mans@mansr.com>
* Add more missing includes after removing the implicit common.hMartin Storsjö2012-08-16
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* Add some more missing includes after removing the implicit common.hMartin Storsjö2012-08-15
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* Don't include common.h from avutil.hMartin Storsjö2012-08-15
| | | | Signed-off-by: Martin Storsjö <martin@martin.st>