summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
* x86/sbrdsp: remove an unnecessary mova in sbr_autocorrelateJames Almer2015-08-06
| | | | Signed-off-by: James Almer <jamrial@gmail.com>
* x86inc: Drop SECTION_TEXT macroHenrik Gramner2015-08-04
| | | | | The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`.
* x86inc: Support arbitrary stack alignmentsHenrik Gramner2015-08-04
| | | | | | Change ALLOC_STACK to always align the stack before allocating stack space for consistency. Previously alignment would occur either before or after allocating stack space depending on whether manual alignment was required or not.
* x86: move XOP emulation code back to x86incJames Almer2015-08-03
| | | | | | | | | | Only two functions that use xop multiply-accumulate instructions where the first operand is the same as the fourth actually took advantage of the macros. This further reduces differences with x264's x86inc. Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>
* Merge commit 'ebaf571aca2dd6ce3caeeeec4210a3fccd47e7db'Hendrik Leppkes2015-08-02
|\ | | | | | | | | | | | | | | | | | | | | * commit 'ebaf571aca2dd6ce3caeeeec4210a3fccd47e7db': x86: dct: Disable dct32_float_sse on x86-64 Conflicts: libavcodec/x86/dct32.asm libavcodec/x86/dct_init.c Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>
| * x86: dct: Disable dct32_float_sse on x86-64Henrik Gramner2015-08-02
| | | | | | | | | | | | | | There is an SSE2 implementation so the SSE version is never used. The "SSE" version also happens to contain SSE2 instructions on x86-64. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* | x86/aacpsdsp: add SSE and SSE3 optimized functionsJames Almer2015-07-30
| | | | | | | | | | | | | | Between 1.5 and 2.5 times faster Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>
* | Merge commit '059a934806d61f7af9ab3fd9f74994b838ea5eba'Michael Niedermayer2015-07-27
|\| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '059a934806d61f7af9ab3fd9f74994b838ea5eba': lavc: Consistently prefix input buffer defines Conflicts: doc/examples/decoding_encoding.c libavcodec/4xm.c libavcodec/aac_adtstoasc_bsf.c libavcodec/aacdec.c libavcodec/aacenc.c libavcodec/ac3dec.h libavcodec/asvenc.c libavcodec/avcodec.h libavcodec/avpacket.c libavcodec/dvdec.c libavcodec/ffv1enc.c libavcodec/g2meet.c libavcodec/gif.c libavcodec/h264.c libavcodec/h264_mp4toannexb_bsf.c libavcodec/huffyuvdec.c libavcodec/huffyuvenc.c libavcodec/jpeglsenc.c libavcodec/libxvid.c libavcodec/mdec.c libavcodec/motionpixels.c libavcodec/mpeg4videodec.c libavcodec/mpegvideo.c libavcodec/noise_bsf.c libavcodec/nuv.c libavcodec/nvenc.c libavcodec/options.c libavcodec/parser.c libavcodec/pngenc.c libavcodec/proresenc_kostya.c libavcodec/qsvdec.c libavcodec/svq1enc.c libavcodec/tiffenc.c libavcodec/truemotion2.c libavcodec/utils.c libavcodec/utvideoenc.c libavcodec/vc1dec.c libavcodec/wmalosslessdec.c libavformat/adxdec.c libavformat/aiffdec.c libavformat/apc.c libavformat/apetag.c libavformat/avidec.c libavformat/bink.c libavformat/cafdec.c libavformat/flvdec.c libavformat/id3v2.c libavformat/isom.c libavformat/matroskadec.c libavformat/mov.c libavformat/mpc.c libavformat/mpc8.c libavformat/mpegts.c libavformat/mvi.c libavformat/mxfdec.c libavformat/mxg.c libavformat/nutdec.c libavformat/oggdec.c libavformat/oggparsecelt.c libavformat/oggparseflac.c libavformat/oggparseopus.c libavformat/oggparsespeex.c libavformat/omadec.c libavformat/rawdec.c libavformat/riffdec.c libavformat/rl2.c libavformat/rmdec.c libavformat/rtpdec_latm.c libavformat/rtpdec_mpeg4.c libavformat/rtpdec_qdm2.c libavformat/rtpdec_svq3.c libavformat/sierravmd.c libavformat/smacker.c libavformat/smush.c libavformat/spdifenc.c libavformat/takdec.c libavformat/tta.c libavformat/utils.c libavformat/vqf.c libavformat/westwood_vqa.c libavformat/xmv.c libavformat/xwma.c libavformat/yop.c Merged-by: Michael Niedermayer <michael@niedermayer.cc>
* | Merge commit '7c6eb0a1b7bf1aac7f033a7ec6d8cacc3b5c2615'Michael Niedermayer2015-07-27
|\| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '7c6eb0a1b7bf1aac7f033a7ec6d8cacc3b5c2615': lavc: AV-prefix all codec flags Conflicts: doc/examples/muxing.c ffmpeg.c ffmpeg_opt.c ffplay.c libavcodec/aacdec.c libavcodec/aacenc.c libavcodec/ac3dec.c libavcodec/ac3enc_float.c libavcodec/atrac1.c libavcodec/atrac3.c libavcodec/atrac3plusdec.c libavcodec/dcadec.c libavcodec/ffv1enc.c libavcodec/h264.c libavcodec/h264_loopfilter.c libavcodec/h264_mb.c libavcodec/imc.c libavcodec/libmp3lame.c libavcodec/libtheoraenc.c libavcodec/libtwolame.c libavcodec/libvpxenc.c libavcodec/libxavs.c libavcodec/libxvid.c libavcodec/mpeg12dec.c libavcodec/mpeg12enc.c libavcodec/mpegaudiodec_template.c libavcodec/mpegvideo.c libavcodec/mpegvideo_enc.c libavcodec/mpegvideo_motion.c libavcodec/nellymoserdec.c libavcodec/nellymoserenc.c libavcodec/nvenc.c libavcodec/on2avc.c libavcodec/options_table.h libavcodec/opus_celt.c libavcodec/pngenc.c libavcodec/ra288.c libavcodec/ratecontrol.c libavcodec/twinvq.c libavcodec/vc1_block.c libavcodec/vc1_loopfilter.c libavcodec/vc1_mc.c libavcodec/vc1dec.c libavcodec/vorbisdec.c libavcodec/vp3.c libavcodec/wma.c libavcodec/wmaprodec.c libavcodec/x86/hpeldsp_init.c libavcodec/x86/me_cmp_init.c Merged-by: Michael Niedermayer <michael@niedermayer.cc>
| * lavc: AV-prefix all codec flagsVittorio Giovara2015-07-27
| | | | | | | | | | | | Convert doxygen to multiline and express bitfields more simply. Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* | avcodec/x86: add missing colon to labelsJames Almer2015-07-26
| | | | | | | | | | | | Silences warnings with Nasm Signed-off-by: James Almer <jamrial@gmail.com>
* | Merge commit 'a344e5d094ebcf9a23acf3a27c56cbbbc829db42'Michael Niedermayer2015-07-17
|\| | | | | | | | | | | | | | | | | | | * commit 'a344e5d094ebcf9a23acf3a27c56cbbbc829db42': x86: bswapdsp: Don't treat 32-bit integers as 64-bit Conflicts: libavcodec/x86/bswapdsp.asm Merged-by: Michael Niedermayer <michael@niedermayer.cc>
| * x86: bswapdsp: Don't treat 32-bit integers as 64-bitHenrik Gramner2015-07-17
| | | | | | | | | | | | | | | | | | The upper halves are not guaranteed to be zero in x86-64. Also use `test` instead of `and` when the result isn't used for anything other than as a branch condition, this allows some register moves to be eliminated. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* | Merge commit 'd42191c78befc1983f23b1899b2dda513b72f1ed'Michael Niedermayer2015-07-17
|\| | | | | | | | | | | | | | | | | | | | | | | * commit 'd42191c78befc1983f23b1899b2dda513b72f1ed': configure: Factor out vp8dsp module Conflicts: configure libavcodec/Makefile libavcodec/x86/Makefile Merged-by: Michael Niedermayer <michael@niedermayer.cc>
| * configure: Factor out vp8dsp moduleVittorio Giovara2015-07-17
| |
* | Merge commit '5cb4bdb2a03c3643f8f1e7d21d7094e61e0a4418'Michael Niedermayer2015-07-17
|\| | | | | | | | | | | | | | | | | | | | | * commit '5cb4bdb2a03c3643f8f1e7d21d7094e61e0a4418': configure: Factor out rv34dsp module Conflicts: libavcodec/Makefile libavcodec/x86/Makefile Merged-by: Michael Niedermayer <michael@niedermayer.cc>
| * configure: Factor out rv34dsp moduleVittorio Giovara2015-07-17
| |
* | videodsp: assert that linesize is larger than widthMichael Niedermayer2015-07-08
| | | | | | | | | | Suggested-by: Andreas Cadhalpun <andreas.cadhalpun@googlemail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | doc: avoid incorrect phrase 'allows to'Andreas Cadhalpun2015-06-16
| | | | | | | | | | | | | | | | Also fix typo found by Lou Logan: Sacrifying -> Sacrificing Reviewed-by: Lou Logan <lou@lrcd.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>
* | avcodec/jpeg200dsp: add ff_rct_int_{sse2,avx2}James Almer2015-06-13
| | | | | | | | | | Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* | avcodec/jpeg200dsp: add ff_ict_float_{sse,avx}James Almer2015-06-13
| | | | | | | | | | | | | | Original intrinsics version by Nicolas Bertrand. Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: James Almer <jamrial@gmail.com>
* | Merge commit 'b7a4127a45b780d76e6b09427a3d0197c4bc1cdb'Michael Niedermayer2015-06-12
|\| | | | | | | | | | | | | * commit 'b7a4127a45b780d76e6b09427a3d0197c4bc1cdb': h264_qpel: Use the correct header Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * h264_qpel: Use the correct headerVittorio Giovara2015-06-12
| |
* | Merge commit '5e87080f2c73186066df0b9c43877b4af0beef3a'Michael Niedermayer2015-06-12
|\| | | | | | | | | | | | | | | | | | | | | | | * commit '5e87080f2c73186066df0b9c43877b4af0beef3a': h264_weight: Fix SSSE3 biweight code with weights of 128 Conflicts: libavcodec/x86/h264_weight.asm See: e1009665759d4a3938dd2dd07b7e84d8bc9c5290 See: fb2288834ba8a51e8934cfe8b82ac468144a34df Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * h264_weight: Fix SSSE3 biweight code with weights of 128Michael Niedermayer2015-06-12
| | | | | | | | | | | | | | CC: libav-stable@libav.org Sample-Id: test_bref.mp4 Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
| * x86: check for AV_CPU_FLAG_AVXSLOW where usefulJames Almer2015-05-31
| | | | | | | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* | avcodec/x86/h264_weight: handle weight1=128Michael Niedermayer2015-06-09
| | | | | | | | | | | | Fix ticket4596 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | x86: check for AV_CPU_FLAG_AVXSLOW where usefulJames Almer2015-06-01
| | | | | | | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | Merge commit 'e4610300de6869bd6b3b00e76cfeabb6d7653dcd'Michael Niedermayer2015-05-28
|\| | | | | | | | | | | | | | | | | | | | | * commit 'e4610300de6869bd6b3b00e76cfeabb6d7653dcd': x86: cavs: Remove an unneeded scratch buffer Conflicts: libavcodec/x86/cavsdsp.c See: d79f7bf0d63a81ee66026ee92a6946a7303d04bd Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * x86: cavs: Remove an unneeded scratch bufferMichael Niedermayer2015-05-28
| | | | | | | | | | | | | | | | Simplifies the code and makes it build on certain compilers running out of registers on x86. CC: libav-stable@libav.org Reported-By: mudler
* | Revert "Move struc FFTContext below SECTION_RODATA"Timothy Gu2015-05-28
| | | | | | | | | | | | | | | | | | | | | | | | This reverts commit 599888a48040f4086c9086aae958b7231b37afcc. The commit does not silence the warning on ELF-based systems, and will be fixed in the subsequent commit. Conflicts: libavcodec/x86/fft_mmx.asm Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | Merge commit '848e86f74d3e6e87fa592ee8ba8c184cc5fd9a42'Michael Niedermayer2015-05-22
|\| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '848e86f74d3e6e87fa592ee8ba8c184cc5fd9a42': mpegvideo: Drop flags and flags2 Conflicts: libavcodec/mpeg12dec.c libavcodec/mpeg12enc.c libavcodec/mpegvideo.c libavcodec/mpegvideo_enc.c libavcodec/mpegvideo_motion.c libavcodec/ratecontrol.c libavcodec/vc1_block.c libavcodec/vc1_loopfilter.c libavcodec/vc1_mc.c libavcodec/vc1dec.c Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * mpegvideo: Drop flags and flags2Vittorio Giovara2015-05-22
| | | | | | | | They are just duplicates of AVCodecContext members so use those instead.
* | Merge remote-tracking branch 'rbultje/vp9-bugfixes'Michael Niedermayer2015-05-18
|\ \ | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * rbultje/vp9-bugfixes: vp9: match another find_ref_mvs() bug in libvpx. vp9: fix scaled motion vector clipping for sub8x8 blocks. vp9: improve signbias check. vp9: don't allow compound references if error_resilience is enabled. vp9: clamp segmented lflvl before applying ref/mode deltas. vp9: reset loopfilter mode/ref deltas on keyframe. vp9: fix crash when playing back 440/440 content with width%64<56. vp9: extend loopfilter workaround for vp9 h/v mix-up to work for 422. vp9: clip motion vectors in the same way as libvpx does. vp9: set skip flag if the block had no coded coefficients. vp9: apply mv scaling workaround only when subsampling is enabled. vp9: read all 4x4 blocks in sub8x8 blocks individually with scalability. vp9: fix segmentation map referencing upon framesize change. vp9: disable more pmulhrsw optimizations in idct16/32. vp9: disable all pmulhrsw in 8/16 iadst x86 optimizations. Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * | vp9: disable more pmulhrsw optimizations in idct16/32.Ronald S. Bultje2015-05-14
| | | | | | | | | | | | | | | For idct16, only when called from a adst16x16 variant, so impact is minor. For idct32, for all, so relatively major impact.
| * | vp9: disable all pmulhrsw in 8/16 iadst x86 optimizations.Ronald S. Bultje2015-05-14
| | | | | | | | | | | | They all overflow in various samples that are considered valid input.
* | | lavc/flac: Fix encoding and decoding with high lpc.Carl Eugen Hoyos2015-05-17
|/ / | | | | | | | | | | Based on an analysis by trac user lvqcl. Fixes ticket #4421, reported by Chase Walker.
* | avcodec/x86/vp9dsp_init: Fix mix of declaration and statementMichael Niedermayer2015-05-07
| | | | | | | | | | Reviewed-by: "Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | vp9: add keyframe profile 2/3 support.Ronald S. Bultje2015-05-06
| |
* | avcodec/x86/deinterlace: use INIT_MMX like other asm code does tooMichael Niedermayer2015-05-05
| |
* | avcodec/x86/dct-test: Use uint8_t for idct_simple_mmx_permMichael Niedermayer2015-05-02
| | | | | | | | | | | | The table contains no element outside the unsigned 8bit range Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | avcodec/x86/dct-test: Make static table constMichael Niedermayer2015-05-02
| | | | | | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | vp9: remove another optimization branch in iadst16 which causes overflows.Ronald S. Bultje2015-04-24
| | | | | | | | | | See sample vp90-2-14-resize-fp-tiles-16-8.webm from the vp9 test vector set to reproduce the issue.
* | vp9: remove one optimization branch in iadst16 which causes overflows.Ronald S. Bultje2015-04-22
| | | | | | | | | | | | | | | | See sample vp90-2-14-resize-fp-tiles-16-8-4-2-1.webm from the vp9 test vector set which reproduces the issue. This probably costs a few cycles, but I don't think there's an easy way to workaround that. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | avcodec/x86/hpeldsp_init: Put CONFIG_* first in if()Michael Niedermayer2015-03-26
| | | | | | | | | | | | This is more consistent and may fix a build failure Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | x86/xvididct: add some yasm guardsJames Almer2015-03-20
| | | | | | | | | | | | Should fix compilation on compilers with less-than-ideal dead code elimination Signed-off-by: James Almer <jamrial@gmail.com>
* | x86/xvididct: remove obsolete function prototypesJames Almer2015-03-20
| | | | | | | | Signed-off-by: James Almer <jamrial@gmail.com>
* | Merge commit '48aef27f5232794e70ecef0d347b9f65e27a9bad'Michael Niedermayer2015-03-17
|\| | | | | | | | | | | | | | | | | | | | | * commit '48aef27f5232794e70ecef0d347b9f65e27a9bad': x86: Put COPY3_IF_LT under HAVE_6REGS Conflicts: libavcodec/x86/mathops.h See: b38910c9790253b362839042a17e13252c1d4b90 Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * x86: Put COPY3_IF_LT under HAVE_6REGSLuca Barbato2015-03-17
| | | | | | | | | | | | | | It uses 6 registers, unbreaks building on hardened x86 system. Bug-Id: gentoo/541930 CC: libav-stable@libav.org
* | avcodec/x86/cavsdsp: remove incorrect LOCAL_ALIGN tmpMichael Niedermayer2015-03-16
| | | | | | | | | | | | This is faster and simpler as well Signed-off-by: Michael Niedermayer <michaelni@gmx.at>