aboutsummaryrefslogtreecommitdiff
path: root/residual_calc.c
Commit message (Collapse)AuthorAge
* x86inc.asm: update to current master 04f14f43Anton Khirnov2024-04-16
| | | | | | | Requires changing residual calc functions to AVX2. Also, supply the private prefix via nasm -D option rather than modifying x86inc.asm.
* residual_calc: accept all diff coefficients in a single arrayAnton Khirnov2024-04-15
| | | | | | | Plus an offset parameter that signals the distance between different coefficients. This allows to avoid passing so many pointers around, which reduces register pressure and simplifies writing SIMD. Seems also to be a little faster.
* residual_calc: rename stride to u_strideAnton Khirnov2024-04-15
| | | | | Make it explicit that it only applies to u, as other arrays are not indexed beyond curent line.
* residual_calc: typedef computation kernelsAnton Khirnov2024-04-15
|
* Switch from meson to configure+make.Anton Khirnov2024-04-15
| | | | Meson is an extra dependency and generally sucks.
* rescalc: improve reflection boundary conditionsAnton Khirnov2019-06-14
| | | | | Make parameter names more clear/consistent, document them, implement missing 1U boundary.
* egs: merge residual calc and correct when possibleAnton Khirnov2019-04-24
| | | | | Also, merge the reflect boundary condition into residual calc+add. Improves performance due to better locality.
* egs: premultiply diff_coeffs with the denominator in initAnton Khirnov2019-04-19
| | | | | | Do not do it at every residual calc, which also allows us to get rid of an extra parameter (and reduce the number of registers used in x86 SIMD).
* egs: do not assume the same stride for all arraysAnton Khirnov2019-04-17
| | | | Also, allocate all the diff coeffs together.
* ell_grid_solve: split residual computation into its own fileAnton Khirnov2019-01-30