summaryrefslogtreecommitdiff
path: root/libavcodec/g722dec.c
diff options
context:
space:
mode:
authorMartin Storsjö <martin@martin.st>2022-07-13 00:06:31 +0300
committerMartin Storsjö <martin@martin.st>2022-07-16 17:25:54 +0300
commit68a03f64240dcbe408c3fd43d1071a105508a588 (patch)
tree36b9b2958c50cc1d34a906276b2b12403847bf19 /libavcodec/g722dec.c
parentb46de9aba436dea0cff76f3ed0f7c98448367fd0 (diff)
aarch64: me_cmp: Switch from uabd to uabal in ff_pix_abs16_xy2_neon
Using absolute-difference-accumulate does use twice the amount of absolute-difference instructions, but avoids the need for the uaddl and add instructions, reducing the total number of instructions by 3. These can be interleaved in the rest of the calculation, to avoid tight dependencies at the end. Unfortunately, this is marginally slower on Cortex A53, but faster on A72 and A73. Before: Cortex A53 A72 A73 Graviton 3 pix_abs_0_3_neon: 175.7 109.2 92.0 41.2 After: pix_abs_0_3_neon: 179.7 96.7 87.5 41.2 Signed-off-by: Martin Storsjö <martin@martin.st>
Diffstat (limited to 'libavcodec/g722dec.c')
0 files changed, 0 insertions, 0 deletions