aboutsummaryrefslogtreecommitdiff
path: root/residual_calc.asm
Commit message (Collapse)AuthorAge
* egs: merge residual calc and correct when possibleAnton Khirnov2019-04-24
| | | | | Also, merge the reflect boundary condition into residual calc+add. Improves performance due to better locality.
* egs: premultiply diff_coeffs with the denominator in initAnton Khirnov2019-04-19
| | | | | | Do not do it at every residual calc, which also allows us to get rid of an extra parameter (and reduce the number of registers used in x86 SIMD).
* residual_calc.asm: use the correct coefficients for y derivativesAnton Khirnov2019-02-02
|
* ell_relax: compute the residual norm in residual_calc()Anton Khirnov2019-01-13
| | | | It is cheap and avoids an extra step in mg2d.
* residual_calc.asm: fix partial storesAnton Khirnov2019-01-13
| | | | .store1 and .store3 were switched
* residual_calc.asm: calculate x*=16 by x*=8; x+=xAnton Khirnov2019-01-13
| | | | Frees up one mm register for future use.
* residual_calc.asm: implement writing partial blocksAnton Khirnov2019-01-10
| | | | Avoid overwriting anything over the specified line size.
* residual_calc.asm: templatize the entire residual computationAnton Khirnov2019-01-10
|
* residual_calc.asm: templatize computing the mixed derivativeAnton Khirnov2019-01-10
|
* residual_calc.asm: templatize computing non-mixed derivativesAnton Khirnov2019-01-10
|
* residual_calc.asm: make mm register use more consistent between s1 and s2Anton Khirnov2019-01-10
|
* residual_calc.asm: make register use in s1 more similar to s2Anton Khirnov2019-01-10
|
* residual_calc.asm: reduce the use of magic constantsAnton Khirnov2019-01-10
|
* residual_calc.asm: reduce register use in the s1 variantAnton Khirnov2019-01-10
| | | | | Make it similar to the s2 version, which should make it easier to templatize the code in the future.
* ell_relax: add AVX SIMD for residual_calcAnton Khirnov2018-12-27