summaryrefslogtreecommitdiff
path: root/libavcodec/x86
Commit message (Collapse)AuthorAge
* tests: Move all test programs to a subdirectoryDiego Biurrun2016-05-13
|
* cosmetics: Fix spelling mistakesVittorio Giovara2016-05-04
| | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
* build: miscellaneous cosmeticsDiego Biurrun2016-04-07
| | | | | | Restore alphabetical order in lists, break overly long lines, do some prettyprinting, add some explanatory section comments, group parts together that belong together logically.
* fft: Split MDCT bits off from FFTDiego Biurrun2016-03-01
|
* fft: x86: cosmetics: Drop silly comments, add comment, whitespaceDiego Biurrun2016-02-26
|
* x86: hevc: Fix linking with both yasm and optimizations disabledDiego Biurrun2016-02-23
| | | | | Some optimized functions reference optimized symbols, so the functions must be explicitly disabled when those symbols are unavailable.
* build: Add vc1dsp component for more fine-grained dependenciesDiego Biurrun2016-02-19
|
* v210: Use separate sample_factorsLuca Barbato2016-02-01
| | | | | | | The 10bit and the 8bit functions can now be implemented to process a different amount of samples. And while at it simplify a little the code.
* v210: Add avx2 version of the 10-bit line encoderJames Darnley2016-02-01
| | | | | | Around 25% faster than the ssse3 version. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* v210: Add avx2 version of the 8-bit line encoderJames Darnley2016-02-01
| | | | | | | Around 35% faster than the avx version. Signed-off-by: Henrik Gramner <henrik@gramner.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* v210: x86: Add the correct guards around the asm codeLuca Barbato2016-01-26
| | | | Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* x86inc: Add debug symbols indicating sizes of compiled functionsGeza Lore2016-01-23
| | | | | | | | | | | Some debuggers/profilers use this metadata to determine which function a given instruction is in; without it they get can confused by local labels (if you haven't stripped those). On the other hand, some tools are still confused even with this metadata. e.g. this fixes `gdb`, but not `perf`. Currently only implemented for ELF. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86: build: Group all encoder objects togetherDiego Biurrun2016-01-18
|
* x86: ac3dsp: Drop forward declaration for nonexisting functionDiego Biurrun2016-01-18
|
* x86: use emms after ff_int32_to_float_fmul_scalar_sseJanne Grunau2015-12-30
| | | | | | | | | | | | | | | | | | | | | Intel's Instruction Set Reference (as of September 2015) clearly states that cvtpi2ps switches to MMX state. Actual CPUs do not switch if the source is a memory location. The Instruction Set Reference from 1999 (Order Number 243191) describes this behaviour but all later versions I've seen have make no distinction whether MMX registers or memory is used as source. The documentation for the matching SSE2 instruction to convert to double (cvtpi2pd) was fixed (see the valgrind bug https://bugs.kde.org/show_bug.cgi?id=210264). It will take time to get a clarification and fixes in place. In the meantime it makes sense to change ff_int32_to_float_fmul_scalar_sse to be correct according to the documentation. The vast majority of users will have SSE2 so a change to the SSE version has little effect. Fixes fate-checkasm on x86 valgrind targets. Valgrind 'bug' reported as https://bugs.kde.org/show_bug.cgi?id=357059
* x86: zero extend the 32-bit length in int32_to_float_fmul_scalar implicitlyJanne Grunau2015-12-29
| | | | This reverts commit 5dfe4edad63971d669ae456b0bc40ef9364cca80.
* dca: remove unused decode_hf function and quant_d tablesAlexandra Hájková2015-12-24
| | | | | They were superseded with their integer equivalents. Rename integer decode_hf to decode_hf.
* x86_64: int32_to_float_fmul_scalar sign extend integer lengthJanne Grunau2015-12-14
|
* hevcdsp: use a macro for .rodata sectionDave Yeo2015-12-11
| | | | | | | fixes assembling on OS/2 Signed-off-by: Dave Yeo <dave.r.yeo@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* hevcdsp: add x86 SIMD for MCAnton Khirnov2015-12-05
|
* mpegvideo: Make sure mpegutils.h is included where neededVittorio Giovara2015-09-13
|
* lavc: Drop deprecated deinterlace moduleVittorio Giovara2015-08-28
| | | | Deprecated in 03/2013.
* x86inc: Drop SECTION_TEXT macroHenrik Gramner2015-08-11
| | | | | | | The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86inc: Support arbitrary stack alignmentsHenrik Gramner2015-08-11
| | | | | | | | Change ALLOC_STACK to always align the stack before allocating stack space for consistency. Previously alignment would occur either before or after allocating stack space depending on whether manual alignment was required or not. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86: dcadsp: Avoid SSE2 instructions in SSE functionsHenrik Gramner2015-08-11
| | | | Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86: dct: Disable dct32_float_sse on x86-64Henrik Gramner2015-08-02
| | | | | | | There is an SSE2 implementation so the SSE version is never used. The "SSE" version also happens to contain SSE2 instructions on x86-64. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* lavc: AV-prefix all codec flagsVittorio Giovara2015-07-27
| | | | | | Convert doxygen to multiline and express bitfields more simply. Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* x86: bswapdsp: Don't treat 32-bit integers as 64-bitHenrik Gramner2015-07-17
| | | | | | | | | The upper halves are not guaranteed to be zero in x86-64. Also use `test` instead of `and` when the result isn't used for anything other than as a branch condition, this allows some register moves to be eliminated. Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* configure: Factor out vp8dsp moduleVittorio Giovara2015-07-17
|
* configure: Factor out rv34dsp moduleVittorio Giovara2015-07-17
|
* h264_qpel: Use the correct headerVittorio Giovara2015-06-12
|
* h264_weight: Fix SSSE3 biweight code with weights of 128Michael Niedermayer2015-06-12
| | | | | | | CC: libav-stable@libav.org Sample-Id: test_bref.mp4 Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* x86: check for AV_CPU_FLAG_AVXSLOW where usefulJames Almer2015-05-31
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* x86: cavs: Remove an unneeded scratch bufferMichael Niedermayer2015-05-28
| | | | | | | | Simplifies the code and makes it build on certain compilers running out of registers on x86. CC: libav-stable@libav.org Reported-By: mudler
* mpegvideo: Drop flags and flags2Vittorio Giovara2015-05-22
| | | | They are just duplicates of AVCodecContext members so use those instead.
* x86: Put COPY3_IF_LT under HAVE_6REGSLuca Barbato2015-03-17
| | | | | | | It uses 6 registers, unbreaks building on hardened x86 system. Bug-Id: gentoo/541930 CC: libav-stable@libav.org
* lavc: do not compile fmtconvert unconditionallyAnton Khirnov2015-02-28
| | | | Only ac3dec and dcadec use it.
* fmtconvert: drop unused functionsAnton Khirnov2015-02-28
|
* hevc_deblock: Fix compilation with nasmCarl Eugen Hoyos2015-02-22
| | | | | | CC: libav-stable@libav.org Bug-Id: 795 Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* v210enc: Add SIMD optimised 8-bit and 10-bit encodersKieran Kunhya2014-12-05
| | | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
* motion_est: convert stride to ptrdiff_tVittorio Giovara2014-11-24
| | | | | CC: libav-stable@libav.org Bug-Id: CID 700556 / CID 700557 / CID 700558
* x86: h264_intrapred: Don't treat 32-bit integers as 64-bitHenrik Gramner2014-10-08
| | | | | | The upper halves are not guaranteed to be zero in x86-64. Signed-off-by: Anton Khirnov <anton@khirnov.net>
* xvid: Add C IDCTPascal Massimino2014-09-02
| | | | | | Thanks to Pascal Massimino and Michael Militzer for relicensing as LGPL. Signed-off-by: Diego Biurrun <diego@biurrun.de>
* idctdsp: Add global function pointers for {add|put}_pixels_clamped functionsDiego Biurrun2014-09-02
| | | | | | These function pointers already existed in the ARM code. Adding them globally allows calls to the function pointers to access arch-optimized versions of the functions transparently.
* x86: xvid: K&R formatting cosmeticsDiego Biurrun2014-08-27
|
* cosmetics: Make naming scheme of Xvid IDCT consistent with other IDCTsDiego Biurrun2014-08-27
|
* x86: xvid_idct: Drop unused definitionsDiego Biurrun2014-08-27
|
* build: Add explanatory comments to (optimization) blocks in the MakefilesDiego Biurrun2014-08-15
|
* mpegvideo: cosmetics: Lowercase ugly uppercase MPV_ function name prefixesDiego Biurrun2014-08-15
|
* avcodec: Rename xvidmmx IDCT to xvidDiego Biurrun2014-08-08
| | | | The Xvid IDCT is not MMX-specific.