summaryrefslogtreecommitdiff
path: root/libavutil/arm
Commit message (Collapse)AuthorAge
* avutil/arm/intmath: return int for uint8 / uint16 clipMichael Niedermayer2015-07-20
| | | | | | | | | | The C functions return uint8/16_t but that is effectively int not unsigned int Fixes fate-filter-tblend We do not return uint8/16_t as that would require the compiler to truncate the values, slowing it down. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
* arm: only enable setend on ARMv6Andreas Cadhalpun2015-06-05
| | | | | | | Without this check it causes SIGILL crashes on ARMv5. Reviewed-by: Michael Niedermayer <michaelni@gmx.at> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>
* Merge commit 'dcae2e32f7d8a1ca5fb8c1e4aa81313be854dd73'Michael Niedermayer2015-03-07
|\ | | | | | | | | | | | | * commit 'dcae2e32f7d8a1ca5fb8c1e4aa81313be854dd73': arm: Suppress tags about used cpu arch and extensions Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Suppress tags about used cpu arch and extensionsMartin Storsjö2015-03-07
| | | | | | | | | | | | | | | | | | | | When all the codepaths using manually set .arch/.fpu code is behind runtime detection, the elf attributes should be suppressed. This allows tools to know that the final built binary doesn't strictly require these extensions. Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge commit '76ce9bd8e26dcb3652240a1072840ff4011d7cdc'Michael Niedermayer2015-02-21
|\| | | | | | | | | | | | | * commit '76ce9bd8e26dcb3652240a1072840ff4011d7cdc': libavutil: Add ARM av_clip_intp2_arm Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * libavutil: Add ARM av_clip_intp2_armPeter Meerwald2015-02-21
| | | | | | | | | | | | | | | | | | | | add ARM code for implementing av_clip_intp2 using the ssat instruction on Cortex-A8, av_clip_intp2_arm() is faster than av_clip_intp2_c() and the generic av_clip(), about -19% Signed-off-by: Peter Meerwald <pmeerw@pmeerw.net> Signed-off-by: Luca Barbato <lu_zero@gentoo.org>
* | Merge commit 'f963f80399deb1a2b44c1bac3af7123e8a0c9e46'Michael Niedermayer2014-12-09
|\| | | | | | | | | | | | | | | | | | | * commit 'f963f80399deb1a2b44c1bac3af7123e8a0c9e46': arm: Use .data.rel.ro for const data with relocations Conflicts: configure Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Use .data.rel.ro for const data with relocationsMartin Storsjö2014-12-09
| | | | | | | | Signed-off-by: Martin Storsjö <martin@martin.st>
* | avutil/arm/float_dsp_init_vfp: replace restrict by av_restrictjessejiang2014-11-20
| | | | | | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | avutil: turn arm setend into a cpuflagMichael Niedermayer2014-08-13
| | | | | | | | | | | | | | | | | | | | | | this allows disabling and enabling it it also prevents crashes if vfpv3 and neon are disabled which previously would have enabled the flag And last but not least one can enable setend on cpus like cortex-a8 where its fast but disabled by default Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | Merge commit '6869612f5c7d4d2f20f69a5658328a761deadb1c'Michael Niedermayer2014-07-22
|\| | | | | | | | | | | | | | | | | | | * commit '6869612f5c7d4d2f20f69a5658328a761deadb1c': arm: Macroize the test for 'setend' CPU instruction support Conflicts: libavcodec/arm/h264dsp_init_arm.c Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Macroize the test for 'setend' CPU instruction supportBen Avison2014-07-21
| | | | | | | | Signed-off-by: Diego Biurrun <diego@biurrun.de>
| * armv6: Accelerate butterflies_floatBen Avison2014-07-18
| | | | | | | | | | | | | | | | | | | | | | | | | | | | I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in butterflies_float_c() / ff_butterflies_float_vfp() for the same sample AAC stream: Before After Mean StdDev Mean StdDev Confidence Change Audio decode 1542.8 43.7 1470.5 41.5 100.0% +4.9% butterflies_float 130.0 11.9 70.2 12.1 100.0% +85.2% Signed-off-by: Martin Storsjö <martin@martin.st>
| * armv6: Accelerate vector_fmul_windowBen Avison2014-07-18
| | | | | | | | | | | | | | | | | | | | | | | | | | | | I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in vector_fmul_window_c() / ff_vector_fmul_window_vfp() for the same sample AAC stream: Before After Mean StdDev Mean StdDev Confidence Change Audio decode 1598.2 47.4 1529.2 25.4 100.0% +4.5% vector_fmul_window 244.0 22.1 188.9 22.3 100.0% +29.2% Signed-off-by: Martin Storsjö <martin@martin.st>
* | armv6: Accelerate butterflies_floatBen Avison2014-07-16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in butterflies_float_c() / ff_butterflies_float_vfp() for the same sample AAC stream: Before After Mean StdDev Mean StdDev Confidence Change Audio decode 1542.8 43.7 1470.5 41.5 100.0% +4.9% butterflies_float 130.0 11.9 70.2 12.1 100.0% +85.2% Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | armv6: Accelerate vector_fmul_windowBen Avison2014-07-16
| | | | | | | | | | | | | | | | | | | | | | | | | | | | I benchmarked the result by measuring the number of gperftools samples that hit anywhere in the AAC decoder (starting from aac_decode_frame()) or specifically in vector_fmul_window_c() / ff_vector_fmul_window_vfp() for the same sample AAC stream: Before After Mean StdDev Mean StdDev Confidence Change Audio decode 1598.2 47.4 1529.2 25.4 100.0% +4.5% vector_fmul_window 244.0 22.1 188.9 22.3 100.0% +29.2% Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* | Merge commit '7b0c7c9163fe3dd0081696befde28617119d2590'Michael Niedermayer2014-06-28
|\| | | | | | | | | | | | | * commit '7b0c7c9163fe3dd0081696befde28617119d2590': arm: Detect 32 bit cpu features on ARMv8 when running on a 64 bit kernel Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Detect 32 bit cpu features on ARMv8 when running on a 64 bit kernelMartin Storsjö2014-06-28
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | When running on a 64 bit kernel, /proc/cpuinfo lists different optional features than on 32 bit kernels (because some of them are mandatory in the 64 bit implemenations). The kernel does list the old features properly if they are queried via /proc/self/auxv though - however this file is not always readable (e.g. on most android systems). The getauxval function could also provide the same info as /proc/self/auxv even if this file isn't readable, but this function is not always available (and thus would need to be loaded with dlsym for compatibility with older android versions). The android cpufeatures library does this slightly differently, by assuming that these are available if the "CPU architecture" line is >= 8, see [1] for details. It has been suggested to include the old, non-optional features in /proc/cpuinfo as well, but that suggested patch never was merged. See [2] for the discussion around this suggestion. [1] https://android-review.googlesource.com/91380 [2] http://marc.info/?l=linux-arm-kernel&m=139087240101974 Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge commit 'd5a55981986ac5d1a31aef3a8d16eaff8534a412'Michael Niedermayer2014-06-04
|\| | | | | | | | | | | | | * commit 'd5a55981986ac5d1a31aef3a8d16eaff8534a412': build: check if AS supports the '.func' directive Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * build: check if AS supports the '.func' directiveJanne Grunau2014-06-03
| | | | | | | | | | Not supported by Clang's integrated assembler. Since it just adds debug information it can safely omitted.
* | Merge commit '831a1180785a786272cdcefb71566a770bfb879e'Michael Niedermayer2014-03-13
|\| | | | | | | | | | | | | | | | | | | | | * commit '831a1180785a786272cdcefb71566a770bfb879e': Update dsputil- and SIMD-related comments to match reality more closely Conflicts: libavcodec/x86/hpeldsp.asm libavutil/arm/float_dsp_init_arm.c Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * Update dsputil- and SIMD-related comments to match reality more closelyDiego Biurrun2014-03-13
| |
* | Merge remote-tracking branch 'qatar/master'Michael Niedermayer2014-03-05
|\| | | | | | | | | | | | | * qatar/master: arm: hpeldsp: prevent overreads in armv6 asm Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: hpeldsp: prevent overreads in armv6 asmJanne Grunau2014-03-05
| | | | | | | | | | | | | | Based on a patch by Russel King <rmk+libav@arm.linux.org.uk> Bug-Id: 646 CC: libav-stable@libav.org
* | Merge commit '543156d7518f5e5d731123da066d86278f9fa492'Michael Niedermayer2014-02-19
|\| | | | | | | | | | | | | * commit '543156d7518f5e5d731123da066d86278f9fa492': arm: Mark the stack as non-executable Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Mark the stack as non-executableMartin Storsjö2014-02-19
| | | | | | | | | | | | | | If linking in an object file without this attribute set, the linker will assume that an executable stack might be needed. Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge commit 'e3fec3f095ab5ea08ee662942d98526aaf5e3635'Michael Niedermayer2014-02-08
|\| | | | | | | | | | | | | * commit 'e3fec3f095ab5ea08ee662942d98526aaf5e3635': arm: Add EXTERN_ASM to the .func and .type declarations for exported symbols Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Add EXTERN_ASM to the .func and .type declarations for exported symbolsMartin Storsjö2014-02-07
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This makes the generated assembly more internally consistent, avoiding declaring two labels for the same function (for cases where EXTERN_ASM is empty) and not declaring a separate unprefixed label in other cases. This also makes sure the .func and .type delcarations have the same prefix. They have previously not been used on the platforms that have prefixed symbols on arm (iOS), but gas-preprocessor has recently started using the .func declarations for adding .thumb_func declarations for such functions. Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge remote-tracking branch 'qatar/master'Michael Niedermayer2014-01-11
|\| | | | | | | | | | | | | | | | | | | * qatar/master: arm: Add an option for making sure NEON registers aren't clobbered Conflicts: configure Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Add an option for making sure NEON registers aren't clobberedMartin Storsjö2014-01-11
| | | | | | | | | | | | This is pretty much based on the same test for XMM registers. Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge commit '5dae4872357613a0b51120b54a4c5221e0ec3f69'Michael Niedermayer2014-01-08
|\| | | | | | | | | | | | | * commit '5dae4872357613a0b51120b54a4c5221e0ec3f69': arm: Allow overriding the alignment set in the function macro Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Allow overriding the alignment set in the function macroMartin Storsjö2014-01-07
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The function macro always sets .align 2 before declaring the function label (since 5c5e1ea3) and always sets the section to .text (since 278caa6a). The .align 5 before certain functions, added in fc252eba, were added before .text and .align were added to the function macro and thus became useless/unused when the function macro got them. This restores the original intention, to align the loop entry points. Signed-off-by: Martin Storsjö <martin@martin.st>
* | Reinstate proper FFmpeg license for all files.Thilo Borgmann2013-08-30
| |
* | Merge commit '7ffda66fd5c81af4725bff7c2c4f207ba2aa0613'Michael Niedermayer2013-08-29
|\| | | | | | | | | | | | | * commit '7ffda66fd5c81af4725bff7c2c4f207ba2aa0613': arm: float_dsp: Propagate cpu_flags to vfp initialization function Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: float_dsp: Propagate cpu_flags to vfp initialization functionDiego Biurrun2013-08-29
| |
* | Merge commit '8410d6e93c2e074881f1c7b7e4cdefd2e497d52e'Michael Niedermayer2013-08-29
|\| | | | | | | | | | | | | * commit '8410d6e93c2e074881f1c7b7e4cdefd2e497d52e': avutil: Refactor CPU extension availability macros Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * avutil: Refactor CPU extension availability macrosDiego Biurrun2013-08-28
| |
* | Merge commit 'b78b10c4b78b696927f2801cf2d9f193b4eff28b'Michael Niedermayer2013-08-29
|\| | | | | | | | | | | | | * commit 'b78b10c4b78b696927f2801cf2d9f193b4eff28b': avutil: Move internal CPU detection function declarations to private header Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * avutil: Move internal CPU detection function declarations to private headerDiego Biurrun2013-08-28
| |
* | Merge commit '439902e0d68a0f0d800c21b5e6b598d5fa0c51da'Michael Niedermayer2013-07-19
|\| | | | | | | | | | | | | | | | | | | * commit '439902e0d68a0f0d800c21b5e6b598d5fa0c51da': Employ consistent LIBAV_COMPAT_ multiple inclusion guards in compat/ Conflicts: compat/aix/math.h Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * Employ consistent LIBAV_COMPAT_ multiple inclusion guards in compat/Diego Biurrun2013-07-18
| | | | | | | | Also fix a comment and an #endif comment.
* | Merge remote-tracking branch 'qatar/master'Michael Niedermayer2013-05-27
|\| | | | | | | | | | | | | | | * qatar/master: arm: Only output eabi attributes if building for ELF fix scalarproduct_and_madd_int16_altivec() for orders > 16 Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Only output eabi attributes if building for ELFMartin Storsjö2013-05-27
| | | | | | | | | | | | | | | | This matches the other eabi attribute in the same file. This is required in order to build for arm/hardfloat with other object file formats than ELF. Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge commit '1fda184a85178cfd7b98d9e308d18e1ded76a511'Michael Niedermayer2013-05-05
|\| | | | | | | | | | | | | * commit '1fda184a85178cfd7b98d9e308d18e1ded76a511': avutil: Add av_cold attributes to init functions missing them Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * avutil: Add av_cold attributes to init functions missing themDiego Biurrun2013-05-04
| |
* | Merge commit '375ef6528c9dd2db7f9881e232cb0ec3aa16970d'Michael Niedermayer2013-02-12
|\| | | | | | | | | | | | | | | * commit '375ef6528c9dd2db7f9881e232cb0ec3aa16970d': libfdk-aacenc: Actually check for upper bounds of cutoff arm: Fall back to runtime cpu feature detection via /proc/cpuinfo Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * arm: Fall back to runtime cpu feature detection via /proc/cpuinfoMartin Storsjö2013-02-11
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | On recent android versions, /proc/self/auxw is unreadable (unless the process is running running under the shell uid or in debuggable mode, which makes it hard to notice). See http://b.android.com/43055 and https://android-review.googlesource.com/51271 for more information about the issue. This makes sure e.g. neon optimizations are enabled at runtime in android apps even when built in release mode, if configured to use the runtime detection. CC: libav-stable@libav.org Signed-off-by: Martin Storsjö <martin@martin.st>
* | Merge commit '73b704ac609d83e0be124589f24efd9b94947cf9'Michael Niedermayer2013-01-23
|\| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '73b704ac609d83e0be124589f24efd9b94947cf9': arm: Add some missing header #includes floatdsp: move scalarproduct_float from dsputil to avfloatdsp. Conflicts: libavcodec/acelp_pitch_delay.c libavcodec/amrnbdec.c libavcodec/amrwbdec.c libavcodec/ra288.c libavcodec/x86/dsputil_mmx.c libavutil/x86/float_dsp.asm Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * floatdsp: move scalarproduct_float from dsputil to avfloatdsp.Ronald S. Bultje2013-01-22
| | | | | | | | This makes the aac decoder and all voice codecs independent of dsputil.
* | Merge commit '5959bfaca396ecaf63a8123055f499688b79cae3'Michael Niedermayer2013-01-23
|\| | | | | | | | | | | | | | | | | | | | | | | | | * commit '5959bfaca396ecaf63a8123055f499688b79cae3': floatdsp: move butterflies_float from dsputil to avfloatdsp. Conflicts: libavcodec/dsputil.c libavcodec/dsputil.h libavcodec/imc.c libavcodec/mpegaudiodec.c Merged-by: Michael Niedermayer <michaelni@gmx.at>