summaryrefslogtreecommitdiff
path: root/libavcodec/x86/hevc_deblock.asm
Commit message (Collapse)AuthorAge
* x86: lavc: share more constant through definesChristophe Gisquet2015-02-07
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86: vpx/h264/hevc/mpeg2: share constantsChristophe Gisquet2014-08-06
| | | | Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* hevc_deblock: change tc typeChristophe Gisquet2014-08-06
| | | | | | | The x86 asm expects int32_t so use that type. Reviewed-by: Mickaël Raulet <mraulet@insa-rennes.fr> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86: hevc_deblock: remove unnecessary maskingChristophe Gisquet2014-08-04
| | | | | | | | | | | | | | The unpacks/shuffles later on makes it unnecessary. Before: 1508 decicycles in h, 2096759 runs, 393 skips 2512 decicycles in v, 2095422 runs, 1730 skips After: 1477 decicycles in h, 2096745 runs, 407 skips 2484 decicycles in v, 2095297 runs, 1855 skips Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: improve 8bit transpose store macrosJames Almer2014-08-03
| | | | | | | Up to four instructions less depending on function and instruction set. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: use existing x86util transpose macro in chroma_{10, 12}James Almer2014-07-31
| | | | | | | Cosmetic change. No measurable difference in speed. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: add add ff_hevc_[hv]_loop_filter_luma_{8, 10, 12}_avxJames Almer2014-07-29
| | | | | | | ~5% faster than SSSE3 Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: improve luma functions register allocationJames Almer2014-07-29
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: remove some unnecessary instructionsJames Almer2014-07-29
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: use psignw instead of pmullw where possibleJames Almer2014-07-29
| | | | | | | It's slightly faster Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: load less data in hevc_h_loop_filter_luma_8James Almer2014-07-28
| | | | | | | Reading 8 bytes is enough. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* Merge commit '1a880b2fb8456ce68eefe5902bac95fea1e6a72d'Michael Niedermayer2014-07-27
|\ | | | | | | | | | | | | | | | | | | | | | | | | | | | | * commit '1a880b2fb8456ce68eefe5902bac95fea1e6a72d': hevc: SSE2 and SSSE3 loop filters Conflicts: libavcodec/hevcdsp.c libavcodec/hevcdsp.h libavcodec/x86/Makefile libavcodec/x86/hevc_deblock.asm libavcodec/x86/hevcdsp_init.c See: de7b89fd43f850d77cf24ad6ae50185dfe391e91 and several others Merged-by: Michael Niedermayer <michaelni@gmx.at>
| * hevc: SSE2 and SSSE3 loop filtersPierre Edouard Lepere2014-07-26
| | | | | | | | Additional contributions by James Almer <jamrial@gmail.com>, Carl Eugen Hoyos <cehoyos@ag.or.at>, Fiona Glaser <fiona@x264.com> and Anton Khirnov <anton@khirnov.net> Signed-off-by: Anton Khirnov <anton@khirnov.net>
* x86/hevc: remove unused constant in deblocking filterMickaël Raulet2014-07-26
| | | | | | cherry picked from commit a3f7282eaa6f1ab0524fb966c6eade50c3025f99 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc: add 12bits support for deblocking filterMickaël Raulet2014-07-26
| | | | | | cherry picked from commit 97d46afe320c7d61d7b9525e5f5588355cde4bb0 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc_deblock: Fix compilation with nasm.Carl Eugen Hoyos2014-07-23
|
* hevcdsp: remove an unneeded variable in the loop filterAnton Khirnov2014-07-22
| | | | | | | | | beta0 and beta1 will always be the same within a CU Signed-off-by: Mickaël Raulet <mraulet@insa-rennes.fr> cherry picked from commit 4a23d824741a289c7d2d2f2871d1e2621b63fa1b Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc_deblock: cosmeticsAnton Khirnov2014-07-22
| | | | | cherry picked from commit f7843356253459e6010320292dbbc1e888a5249b Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* hevc: cleanups in SSE2 and SSSE3 loop filters, use fewer instructionsAnton Khirnov2014-07-22
| | | | | cherry picked from commit f7843356253459e6010320292dbbc1e888a5249b Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc_deblock: use test instead of cmp 0Anton Khirnov2014-07-22
| | | | | cherry picked from commit f7843356253459e6010320292dbbc1e888a5249b Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc_deblock: use of paddw instead of psllwAnton Khirnov2014-07-22
| | | | | cherry picked from commit f7843356253459e6010320292dbbc1e888a5249b Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc_deblock: add %ifs to avoid "do nothing instructions"Anton Khirnov2014-07-22
| | | | | cherry picked from commit f7843356253459e6010320292dbbc1e888a5249b Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* hevc: cleaning up SSE2 and SSSE3 deblocking filtersAnton Khirnov2014-07-22
| | | | | | | Signed-off-by: Mickaël Raulet <mraulet@insa-rennes.fr> cherry picked from commit b432041d7d1eca38831590f13b4e5baffff8186f Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: add ff_hevc_[hv]_loop_filter_luma_{8, 10}_sse2James Almer2014-07-13
| | | | | | Signed-off-by: James Almer <jamrial@gmail.com> Reviewed-by: Kieran Kunhya <kierank@obe.tv> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: improve chroma functions register allocationJames Almer2014-05-24
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: use constants instead of generating values at runtimeJames Almer2014-05-19
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: remove some duplicated instructionsJames Almer2014-05-18
| | | | | | | Also remove a couple unnecessary cmps Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* Fix compilation of libavcodec/x86/hevc_deblock.asm with nasm.Carl Eugen Hoyos2014-05-17
| | | | Suggested-by: Reimar
* x86/hevc_deblock: use movhps instead of shuffling valuesJames Almer2014-05-17
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: fix label namesJames Almer2014-05-17
| | | | | | | Also remove some unnecessary jmps Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: fix usage of ABS1James Almer2014-05-17
| | | | | | | The second argument is a temp register for non-SSSE3 cases Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* x86/hevc_deblock: merge movs with other instructionsJames Almer2014-05-17
| | | | | Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc: updated to use x86util macrosplepere2014-05-16
| | | | | | Reviewed-by: James Almer <jamrial@gmail.com> Reviewed-by: Ronald S. Bultje Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
* avcodec/x86/hevc: added DBF assembly functionsplepere2014-05-16
Reviewed-by: James Almer <jamrial@gmail.com> Reviewed-by: Ronald S. Bultje Signed-off-by: Michael Niedermayer <michaelni@gmx.at>