| Commit message (Collapse) | Author | Age |
|
|
|
|
|
|
| |
Requires changing residual calc functions to AVX2.
Also, supply the private prefix via nasm -D option rather than modifying
x86inc.asm.
|
|
|
|
|
|
|
| |
Plus an offset parameter that signals the distance between different
coefficients. This allows to avoid passing so many pointers around,
which reduces register pressure and simplifies writing SIMD. Seems also
to be a little faster.
|
|
|
|
|
| |
Make it explicit that it only applies to u, as other arrays are not
indexed beyond curent line.
|
| |
|
|
|
|
| |
Meson is an extra dependency and generally sucks.
|
|
|
|
|
| |
Make parameter names more clear/consistent, document them, implement
missing 1U boundary.
|
|
|
|
|
| |
Also, merge the reflect boundary condition into residual calc+add.
Improves performance due to better locality.
|
|
|
|
|
|
| |
Do not do it at every residual calc, which also allows us to get rid of
an extra parameter (and reduce the number of registers used in x86
SIMD).
|
|
|
|
| |
Also, allocate all the diff coeffs together.
|
|
|