aboutsummaryrefslogtreecommitdiff
Commit message (Collapse)AuthorAge
* Use NOTZERO (~0) instead of IMINeschnett2012-08-16
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@63 105869f7-3296-0410-a4ea-f4349344b45a
* Add ksgn function (vectorised version of Kranc's Sign)eschnett2012-08-11
| | | | | | | | | | | All architectures: Add copysign and sgn functions. Remove pos function (which does nothing). Add support for Blue Gene/Q (QPX instructions). Correct errors in AVX instructions. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@62 105869f7-3296-0410-a4ea-f4349344b45a
* Wrap isnan calls to avoid C++ namespace issueseschnett2012-08-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@61 105869f7-3296-0410-a4ea-f4349344b45a
* Avoid nans during self-testseschnett2012-05-13
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@59 105869f7-3296-0410-a4ea-f4349344b45a
* Support partial stores for the Double Hummer architecture (Blue Gene/P)eschnett2012-05-03
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@58 105869f7-3296-0410-a4ea-f4349344b45a
* make sure the test sees an expected nan as a successknarf2012-04-17
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@57 105869f7-3296-0410-a4ea-f4349344b45a
* Correct commenteschnett2012-04-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@56 105869f7-3296-0410-a4ea-f4349344b45a
* Implement asin, sinh, asinh, and friendseschnett2012-04-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@55 105869f7-3296-0410-a4ea-f4349344b45a
* Add missing casts.barry.wardell2012-03-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@54 105869f7-3296-0410-a4ea-f4349344b45a
* Add missing functionality for single precision unvectorised architecture.barry.wardell2012-03-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@53 105869f7-3296-0410-a4ea-f4349344b45a
* Add tolerance to test caseseschnett2012-02-06
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@52 105869f7-3296-0410-a4ea-f4349344b45a
* Implement missing functionalityeschnett2012-02-06
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@51 105869f7-3296-0410-a4ea-f4349344b45a
* Make macro arguments safeeschnett2012-02-06
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@50 105869f7-3296-0410-a4ea-f4349344b45a
* Correct test case for partial vector storeseschnett2012-02-05
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@49 105869f7-3296-0410-a4ea-f4349344b45a
* Correct partial vector storeseschnett2012-02-05
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@48 105869f7-3296-0410-a4ea-f4349344b45a
* Various changeseschnett2012-02-05
| | | | | | | | | | | | | | | 1. Implement a simplified partial store interface Implement vec_store_nta_partial, which offers a simpler interface, similar to the one used in OpenCL. 2. Add kifmsg function, and implement kifpos and kifneg in terms of this. 3. Update (and make safer) Kranc-specific code git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@47 105869f7-3296-0410-a4ea-f4349344b45a
* Make building with the Vectors thorn quietereschnett2012-01-04
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@46 105869f7-3296-0410-a4ea-f4349344b45a
* Translate Kranc's Pi to M_PIeschnett2012-01-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@45 105869f7-3296-0410-a4ea-f4349344b45a
* Make vectorisation macros safereschnett2011-12-22
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@44 105869f7-3296-0410-a4ea-f4349344b45a
* Output more information in test case failure error messageseschnett2011-12-21
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@43 105869f7-3296-0410-a4ea-f4349344b45a
* Simplify setting architecture description stringseschnett2011-12-21
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@42 105869f7-3296-0410-a4ea-f4349344b45a
* Implement sqrteschnett2011-12-21
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@41 105869f7-3296-0410-a4ea-f4349344b45a
* Don't use <x86intrin.h>; this does not exist everywhereeschnett2011-12-15
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@40 105869f7-3296-0410-a4ea-f4349344b45a
* Support FMA4 instructions (AMD's fused multiply-add)eschnett2011-12-14
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@39 105869f7-3296-0410-a4ea-f4349344b45a
* LSUThorns/Vectors: Remove pos, add sin/cos/tan functionseschnett2011-12-02
| | | | | | | | | | | | Remove kpos, because it is not used (it is a no-op, i.e. the arithmetic + operator). Add sin, cos, and tan. Begin to implement (still commented out) integer vector operations. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@38 105869f7-3296-0410-a4ea-f4349344b45a
* Add vec_index functioneschnett2011-12-02
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@37 105869f7-3296-0410-a4ea-f4349344b45a
* Correct vector alignment testeschnett2011-11-26
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@36 105869f7-3296-0410-a4ea-f4349344b45a
* Change default of VECTORISE_INLINE to "no"hinder2011-10-22
| | | | | | | | | Most of the machines supported by SimFactory that use the Intel compiler have to set this to "no" to avoid the compiler running out of memory when compiling ML_BSSN. We choose "no" as the default now to avoid problems for users of machines not supported in SimFactory. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@33 105869f7-3296-0410-a4ea-f4349344b45a
* Improve layout of READMEeschnett2011-09-27
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@32 105869f7-3296-0410-a4ea-f4349344b45a
* Use "andnot" instruction when vectorisingeschnett2011-09-26
| | | | | | | | | | | | | Use the "andnot" instruction to reduce the number of different bit masks that are required. Using fewer different bit masks may require fewer registers to hold them, or fewer load instructions to access them, thus potentially improving performance. Do not scalarize ifpos when SSE 4.1 is not available; instead, use logical operations to create a bit mask. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@31 105869f7-3296-0410-a4ea-f4349344b45a
* Suggest asm statements to support SSE4a with Intel compilers.eschnett2011-08-25
| | | | | | | Indent vector architecture definitions. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@30 105869f7-3296-0410-a4ea-f4349344b45a
* Make more C++ compilers understand the signbit functioneschnett2011-08-20
| | | | | | | | | | | | | Several C++ compilers cannot handle std::signbit; use a work-around instead. Correct a namespace problem when using the same identifier Vectors_SGN for different precisions (real*4 and real*8). Correct kifpos implementation incorrectly on several architectures. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@29 105869f7-3296-0410-a4ea-f4349344b45a
* Use a macro name which is less likely to conflict with an existing macro.svn_bwardell2011-08-08
| | | | | | This macro is only used internally anyway. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@28 105869f7-3296-0410-a4ea-f4349344b45a
* Remove TODO commenteschnett2011-08-08
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@27 105869f7-3296-0410-a4ea-f4349344b45a
* Add more vectorisation tests. Add test case.eschnett2011-08-08
| | | | | | | | | | | | | | Add vectorisation test for vector creation, load, and store statements. Convert C to C++ since vectorisation requires C++. Add test case. Beautify vectorsation templates. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@26 105869f7-3296-0410-a4ea-f4349344b45a
* Add information about tests to README.svn_bwardell2011-08-07
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@25 105869f7-3296-0410-a4ea-f4349344b45a
* Rename kifthen to kifpos as it more accurately reflects what it actually does.svn_bwardell2011-08-07
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@24 105869f7-3296-0410-a4ea-f4349344b45a
* Unconditionally run tests and abort if any fail.svn_bwardell2011-08-07
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@23 105869f7-3296-0410-a4ea-f4349344b45a
* Make definition of vec_architecture for SSE and default more explicit.svn_bwardell2011-08-07
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@22 105869f7-3296-0410-a4ea-f4349344b45a
* Fix reversed definition of vec_architecture.svn_bwardell2011-08-07
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@21 105869f7-3296-0410-a4ea-f4349344b45a
* Fix definition of kifthen for architectures where blend instructions are not ↵svn_bwardell2011-08-07
| | | | | | available. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@20 105869f7-3296-0410-a4ea-f4349344b45a
* Also print architecture in startup message.svn_bwardell2011-08-07
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@19 105869f7-3296-0410-a4ea-f4349344b45a
* Add code to test vectorisation macros.svn_bwardell2011-08-07
| | | | | | This is conditionally run dependent on the "tests" parameter which can take the values "report", "abort" or "skip". The default is "skip" which causes the tests not to run. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@18 105869f7-3296-0410-a4ea-f4349344b45a
* apparently OPTIMIZE_MODE isn't arriving here, neither is CCTK_OPTIMIZE_MODE. ↵knarf2011-08-04
| | | | | | So, for the moment treat an empty OPTIMIZE as 'yes', as Cactus itself also does git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@17 105869f7-3296-0410-a4ea-f4349344b45a
* don't rely on OPTIMISE or OPTIMIZE (both might not be defined), but use ↵knarf2011-08-04
| | | | | | OPTIMISE_MODE, which is always provided by Cactus, and consistent with the other two git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@16 105869f7-3296-0410-a4ea-f4349344b45a
* Fix typo which caused k8ifthen to not compile if SSE4.1 was not available.svn_bwardell2011-07-21
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@15 105869f7-3296-0410-a4ea-f4349344b45a
* Fix error in definition of k8abs_mask for SSE2 architectures.svn_bwardell2011-07-21
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@14 105869f7-3296-0410-a4ea-f4349344b45a
* Allow US spelling OPTIMIZE for OPTIMISE. Correct error message.eschnett2011-06-21
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@13 105869f7-3296-0410-a4ea-f4349344b45a
* Add new API elements "kifthen" and "vec_architecture"eschnett2011-06-20
| | | | git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@12 105869f7-3296-0410-a4ea-f4349344b45a
* Add startup message giving vector lengthhinder2011-06-16
| | | | | | | | This allows you to check that a simulation is using vectorisation. git-svn-id: https://svn.cct.lsu.edu/repos/numrel/LSUThorns/Vectors/trunk@11 105869f7-3296-0410-a4ea-f4349344b45a