| Commit message (Collapse) | Author | Age |
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@40 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@39 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
|
|
|
|
|
|
| |
Remove kpos, because it is not used (it is a no-op, i.e. the
arithmetic + operator).
Add sin, cos, and tan.
Begin to implement (still commented out) integer vector operations.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@38 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Use the "andnot" instruction to reduce the number of different bit
masks that are required. Using fewer different bit masks may require
fewer registers to hold them, or fewer load instructions to access
them, thus potentially improving performance.
Do not scalarize ifpos when SSE 4.1 is not available; instead, use
logical operations to create a bit mask.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@31 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
|
| |
Indent vector architecture definitions.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@30 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Several C++ compilers cannot handle std::signbit; use a work-around
instead.
Correct a namespace problem when using the same identifier Vectors_SGN
for different precisions (real*4 and real*8).
Correct kifpos implementation incorrectly on several architectures.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@29 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
| |
This macro is only used internally anyway.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@28 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Add vectorisation test for vector creation, load, and store
statements.
Convert C to C++ since vectorisation requires C++.
Add test case.
Beautify vectorsation templates.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@26 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@24 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@22 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
| |
available.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@20 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@15 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@14 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
| |
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@12 105869f7-3296-0410-a4ea-f4349344b45a
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
Introduce configuration-time options for vectorisation, including
options to allow architecture-specific choices that may influence
performance.
Introduce "middle" masked stores for large vector sizes and small
loops.
Clean up and simplify some of the implementation code.
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@10 105869f7-3296-0410-a4ea-f4349344b45a
|
|
Add support for AVX (next-generation SSE)
Add support for Double Hummer (Blue Gene/P)
git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@7 105869f7-3296-0410-a4ea-f4349344b45a
|