- Aug 17, 2019
-
-
Jakub Klinkovský authored
Fixes #42
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
Benchmarks can be easily profiled even without this parameter, so it was just an unnecessary complication.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
This is nicer because it more clearly separates data load, computation and data store. Furthermore, it allows to use instances of std::plus, std::logical_and, std::logical_or, etc. instead of custom lambda functions.
-
Jakub Klinkovský authored
It contained only methods for prefixSum and segmentedPrefixSum, which were identical for Host and Cuda, so they can be easily implemented directly in Vector and VectorView.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
The performance seems to be identical to the code using volatile.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
They are not implemented anyway...
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
This is necessary to be able to run tests in parallel.
-
Jakub Klinkovský authored
The build takes too long because of this and nobody uses it anyway. CUDA does not support long double in device code at all.
-
- Aug 16, 2019
-
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
- Aug 14, 2019
-
-
Jakub Klinkovský authored
NDArray See merge request !18
-
Jakub Klinkovský authored
DistributedNDArray: added methods getLocalIndexer, getLocalView, getConstLocalView, getStorageIndex and getData
-
Jakub Klinkovský authored
This is necessary for local indexing of DistributedNDArray.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
It works for any value, device and index types, but the permutations of both arrays must be the same and both arrays have to be contiguous.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-