summaryrefslogtreecommitdiff
path: root/libavcodec/x86/h264_deblock_10bit.asm
diff options
context:
space:
mode:
authorChristophe Gisquet <christophe.gisquet@gmail.com>2012-01-19 21:48:39 +0100
committerDiego Biurrun <diego@biurrun.de>2012-01-30 10:19:55 +0100
commit6b039003822a03add20c7ba91fc857dca52b0a03 (patch)
tree66ed7686c3377bce8accbed2fbc471c9a5931dbb /libavcodec/x86/h264_deblock_10bit.asm
parenta846202343af7c56bf444ec47d4bb26a5d2b83ce (diff)
x86 dsputil: provide SSE2/SSSE3 versions of bswap_buf
While pshufb allows emulating bswap on XMM registers for SSSE3, more shuffling is needed for SSE2. Alignment is critical, so specific codepaths are provided for this case. For the huffyuv sequence "angels_480-huffyuvcompress.avi": C (using bswap instruction): ~ 55k cycles SSE2: ~ 40k cycles SSSE3 using unaligned loads: ~ 35k cycles SSSE3 using aligned loads: ~ 30k cycles Signed-off-by: Diego Biurrun <diego@biurrun.de>
Diffstat (limited to 'libavcodec/x86/h264_deblock_10bit.asm')
0 files changed, 0 insertions, 0 deletions