- Oct 14, 2021
-
-
Jakub Klinkovský authored
-
- Oct 01, 2021
-
-
Jakub Klinkovský authored
Fixes #86
-
- Sep 30, 2021
-
-
Jakub Klinkovský authored
This fixes the previous commit.
-
Jakub Klinkovský authored
These functions expect all index parameters to have the same type, so we need to cast 0 to the IndexType, otherwise the function lookup will fail when IndexType is not int.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
Added __device__ flag to the Shared union constructors in CudaReductionKernel.h and CudaScanKernel.h to make clang happy
-
- Sep 28, 2021
-
-
Jakub Klinkovský authored
Addresses #86
-
- Sep 27, 2021
-
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
- Sep 24, 2021
-
-
Jakub Klinkovský authored
Also fixed the header in Preconditioner.h
-
- Sep 23, 2021
-
-
Jakub Klinkovský authored
-
- Sep 16, 2021
-
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
MPI refactoring See merge request !101
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
It does not make sense to print a distributed array like this, because it may contain ghost elements. Users should examine the local array view manually.
-
Jakub Klinkovský authored
Replaced send/receive for Array(,View) and mpiSend/mpiReceive for String with a general implementation in the MPI namespace Also added analogous functions: MPI::sendrecv, MPI::bcast.
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
- removed wrapper functions: AllGroup, NullGroup, NullRequest - added MPI_COMM_WORLD and other handles to MPI/DummyDefs.h - renamed getCommunicationGroup to getCommunicator in all data structures - improved naming to match the MPI terminology: communicator instead of group
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
CUDA kernels should not ever work with distributed data structures, they should always get the underlying *local* data structure.
-
Jakub Klinkovský authored
- The "NullGroup" should not be used even when built without MPI, otherwise the behaviour is very bug-prone because "NullGroup" usage is not caught and changing the build type leads to a different semantics. - "AllGroup" is not a good default value for the parameters, considering that the class attributes are initialized to "NullGroup".
-
- Sep 14, 2021
-
-
Jakub Klinkovský authored
Buffering with a small value is very slow when profiling anything in nvpp.
-
- Sep 03, 2021
-
-
Jakub Klinkovský authored
-
- Sep 02, 2021
-
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
- Sep 01, 2021
-
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
- Aug 31, 2021
-
-
Jakub Klinkovský authored
-
- Aug 27, 2021
-
-
Jakub Klinkovský authored
Amends e5fc6a96
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
- Aug 11, 2021
-
-
Jakub Klinkovský authored
Scan refactoring Closes #87 See merge request !100
-
- Aug 08, 2021
-
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
Jakub Klinkovský authored
-
- Aug 06, 2021
-
-
Jakub Klinkovský authored
- structs from HorizontalOperations.h reimplemented as function objects in Functional.h - repetitive function definitions generated using macros - added new operators: % (modulus) and ^ (xor)
-