| Commit message (Collapse) | Author | Age |
| |
|
| |
|
| |
|
| |
|
| |
|
|
|
|
|
|
|
|
|
|
| |
Ignore-this: 5169757c7749834ae595d4d73b39220
Add a new, additional feature to LoopControl: different threads can
work on small regions that are likely to use the same cache entries as
other threads, trying to reduce cache pressure. This makes sense
mostly when the regions are still expensive although they are small,
e.g. for the BSSN RHS.
|
|
|
|
| |
Ignore-this: 309b4dd613f4af2b84aa5d6743fdb6b3
|
|\ |
|
| | |
|
|\| |
|
| |
| |
| |
| | |
Some systems have broken implementations, and it is not important.
|
| |
| |
| |
| |
| |
| |
| | |
When a loop has only zero grid points, ensure that the step size is still
positive.
Correct some comments.
|
| | |
|
| |
| |
| |
| | |
(make a difference between standalone and non-standalone)
|
| |
| |
| |
| | |
to ensure that things compile even without OpenMP present.
|
| |
| |
| |
| |
| |
| |
| | |
When a loop has only zero grid points, ensure that the step size is still
positive.
Correct some comments.
|
| | |
|
|/ |
|
|
|
|
|
| |
(scheduled at STARTUP) which can be triggered via LoopControl::run_demo = "yes"
and controled with LoopControl::n{xyz} and LoopControl:nsteps.
|
|
|
|
|
| |
Allow user-defined statements surrounding the innermost loop, e.g. for
vectorisation.
|
| |
|
| |
|
|
|
|
|
|
|
| |
Use #pragma instead of _Pragma for OpenMP directives. The PGI compilers do
not understand the _Pragma syntax.
Use int instead of ptrdiff_t for loop variables which are parallelised via
OpenMP. The PGI compilers cannot handle ptrdiff_t.
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Add parameter printstatus. Output statistics only if desired.
Add parameter legacy_init as new initial parallelisation choice.
Add parameter ignore_initial_overhead, which ignores the first time
the loop is encountered when optimising the parallelisation.
Correct serious inconsistency when choosing the tiling.
Output average loop time as well.
darcs-hash:20080204211904-dae7b-ef0175704efb2c8e0e20ee6ee04a468803b3a5ec.gz
|
|
|
|
| |
darcs-hash:20071010133923-dae7b-a3406485d61ab795191655c89a16e2ae2c487978.gz
|
|
|
|
|
|
|
|
| |
(Nice idea, but doesn't seem to work right. Maybe only the parameters
need to be chosen differently? But I rather think that a more
intelligent method is necessary.)
darcs-hash:20070826025505-dae7b-ed81bc28a4204d84776d28443be65a995c52699b.gz
|
|
|
|
| |
darcs-hash:20070826025425-dae7b-42f571cb9a9657e2f0143881145d1f97a2b8d5b6.gz
|
|
darcs-hash:20070821185122-dae7b-cde121a591484710e62b2b82453ef592d80786e4.gz
|