summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
* x86inc: Add debug symbols indicating sizes of compiled functionsGeza Lore2016-01-23
| | | | | | | | | | | Some debuggers/profilers use this metadata to determine which function a given instruction is in; without it they get can confused by local labels (if you haven't stripped those). On the other hand, some tools are still confused even with this metadata. e.g. this fixes `gdb`, but not `perf`. Currently only implemented for ELF. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86: build: Group all encoder objects togetherDiego Biurrun2016-01-18
|
* x86: ac3dsp: Drop forward declaration for nonexisting functionDiego Biurrun2016-01-18
|
* x86: use emms after ff_int32_to_float_fmul_scalar_sseJanne Grunau2015-12-30
| | | | | | | | | | | | | | | | | | | | | Intel's Instruction Set Reference (as of September 2015) clearly states that cvtpi2ps switches to MMX state. Actual CPUs do not switch if the source is a memory location. The Instruction Set Reference from 1999 (Order Number 243191) describes this behaviour but all later versions I've seen have make no distinction whether MMX registers or memory is used as source. The documentation for the matching SSE2 instruction to convert to double (cvtpi2pd) was fixed (see the valgrind bug https://bugs.kde.org/show_bug.cgi?id=210264). It will take time to get a clarification and fixes in place. In the meantime it makes sense to change ff_int32_to_float_fmul_scalar_sse to be correct according to the documentation. The vast majority of users will have SSE2 so a change to the SSE version has little effect. Fixes fate-checkasm on x86 valgrind targets. Valgrind 'bug' reported as https://bugs.kde.org/show_bug.cgi?id=357059
* x86: zero extend the 32-bit length in int32_to_float_fmul_scalar implicitlyJanne Grunau2015-12-29
| | | | This reverts commit 5dfe4edad63971d669ae456b0bc40ef9364cca80.
* dca: remove unused decode_hf function and quant_d tablesAlexandra Hájková2015-12-24
| | | | | They were superseded with their integer equivalents. Rename integer decode_hf to decode_hf.
* x86_64: int32_to_float_fmul_scalar sign extend integer lengthJanne Grunau2015-12-14
|
* hevcdsp: use a macro for .rodata sectionDave Yeo2015-12-11
| | | | | | | fixes assembling on OS/2 Signed-off-by: Dave Yeo <dave.r.yeo@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* hevcdsp: add x86 SIMD for MCAnton Khirnov2015-12-05
|
* mpegvideo: Make sure mpegutils.h is included where neededVittorio Giovara2015-09-13
|
* lavc: Drop deprecated deinterlace moduleVittorio Giovara2015-08-28
| | | | Deprecated in 03/2013.
* x86inc: Drop SECTION_TEXT macroHenrik Gramner2015-08-11
| | | | | | | The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86inc: Support arbitrary stack alignmentsHenrik Gramner2015-08-11
| | | | | | | | Change ALLOC_STACK to always align the stack before allocating stack space for consistency. Previously alignment would occur either before or after allocating stack space depending on whether manual alignment was required or not. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86: dcadsp: Avoid SSE2 instructions in SSE functionsHenrik Gramner2015-08-11
| | | | Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86: dct: Disable dct32_float_sse on x86-64Henrik Gramner2015-08-02
| | | | | | | There is an SSE2 implementation so the SSE version is never used. The "SSE" version also happens to contain SSE2 instructions on x86-64. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* lavc: AV-prefix all codec flagsVittorio Giovara2015-07-27
| | | | | | Convert doxygen to multiline and express bitfields more simply. Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* x86: bswapdsp: Don't treat 32-bit integers as 64-bitHenrik Gramner2015-07-17
| | | | | | | | | The upper halves are not guaranteed to be zero in x86-64. Also use `test` instead of `and` when the result isn't used for anything other than as a branch condition, this allows some register moves to be eliminated. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* configure: Factor out vp8dsp moduleVittorio Giovara2015-07-17
|
* configure: Factor out rv34dsp moduleVittorio Giovara2015-07-17
|
* h264_qpel: Use the correct headerVittorio Giovara2015-06-12
|
* h264_weight: Fix SSSE3 biweight code with weights of 128Michael Niedermayer2015-06-12
| | | | | | | CC: libav-stable@libav.org Sample-Id: test_bref.mp4 Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* x86: check for AV_CPU_FLAG_AVXSLOW where usefulJames Almer2015-05-31
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* x86: cavs: Remove an unneeded scratch bufferMichael Niedermayer2015-05-28
| | | | | | | | Simplifies the code and makes it build on certain compilers running out of registers on x86. CC: libav-stable@libav.org Reported-By: mudler
* mpegvideo: Drop flags and flags2Vittorio Giovara2015-05-22
| | | | They are just duplicates of AVCodecContext members so use those instead.
* x86: Put COPY3_IF_LT under HAVE_6REGSLuca Barbato2015-03-17
| | | | | | | It uses 6 registers, unbreaks building on hardened x86 system. Bug-Id: gentoo/541930 CC: libav-stable@libav.org
* lavc: do not compile fmtconvert unconditionallyAnton Khirnov2015-02-28
| | | | Only ac3dec and dcadec use it.
* fmtconvert: drop unused functionsAnton Khirnov2015-02-28
|
* hevc_deblock: Fix compilation with nasmCarl Eugen Hoyos2015-02-22
| | | | | | CC: libav-stable@libav.org Bug-Id: 795 Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* v210enc: Add SIMD optimised 8-bit and 10-bit encodersKieran Kunhya2014-12-05
| | | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* motion_est: convert stride to ptrdiff_tVittorio Giovara2014-11-24
| | | | | CC: libav-stable@libav.org Bug-Id: CID 700556 / CID 700557 / CID 700558
* x86: h264_intrapred: Don't treat 32-bit integers as 64-bitHenrik Gramner2014-10-08
| | | | | | The upper halves are not guaranteed to be zero in x86-64. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* xvid: Add C IDCTPascal Massimino2014-09-02
| | | | | | Thanks to Pascal Massimino and Michael Militzer for relicensing as LGPL. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* idctdsp: Add global function pointers for {add|put}_pixels_clamped functionsDiego Biurrun2014-09-02
| | | | | | These function pointers already existed in the ARM code. Adding them globally allows calls to the function pointers to access arch-optimized versions of the functions transparently.
* x86: xvid: K&R formatting cosmeticsDiego Biurrun2014-08-27
|
* cosmetics: Make naming scheme of Xvid IDCT consistent with other IDCTsDiego Biurrun2014-08-27
|
* x86: xvid_idct: Drop unused definitionsDiego Biurrun2014-08-27
|
* build: Add explanatory comments to (optimization) blocks in the MakefilesDiego Biurrun2014-08-15
|
* mpegvideo: cosmetics: Lowercase ugly uppercase MPV_ function name prefixesDiego Biurrun2014-08-15
|
* avcodec: Rename xvidmmx IDCT to xvidDiego Biurrun2014-08-08
| | | | The Xvid IDCT is not MMX-specific.
* xvididct: Ensure that the scantable permutation is always set correctlyDiego Biurrun2014-08-08
| | | | | This fixes cases where the scantable permuation would get overwritten by the general idctdsp initialization.
* idct: Split off Xvid IDCTDiego Biurrun2014-08-01
| | | | | The Xvid IDCT is only required to decode some Xvid-encoded MPEG-4 files, so there is no point in having it as an unconditional part of idctdsp.
* x86: build: Restore ordering of OBJS linesDiego Biurrun2014-07-28
|
* hevc: SSE2 and SSSE3 loop filtersPierre Edouard Lepere2014-07-26
| | | | | | | | Additional contributions by James Almer <jamrial@gmail.com>, Carl Eugen Hoyos <cehoyos@ag.or.at>, Fiona Glaser <fiona@x264.com> and Anton Khirnov <anton@khirnov.net> Signed-off-by: Anton Khirnov <anton@khirnov.net>
* qpeldsp: Mark source pointer in qpel_mc_func function pointer constDiego Biurrun2014-07-25
|
* dct-test: Move arch-specific bits into arch-specific subdirectoriesDiego Biurrun2014-07-21
|
* simple_idct: Move x86-specific declarations to a header in the x86 directoryDiego Biurrun2014-07-19
|
* fdct: Move x86-specific declarations to a header in the x86 directoryDiego Biurrun2014-07-19
|
* x86: dnxhdenc: Eliminate some unnecessary ifdefsDiego Biurrun2014-07-18
|
* idctdsp: prettyprinting cosmeticsDiego Biurrun2014-07-18
|
* idct: Convert IDCT permutation #defines to an enumDiego Biurrun2014-07-18
| | | | Also rename the enum values to be consistent with other DCT permutations.