Commits · 57db358c278127f124e5f5f1a281054cca588b64 · TNL / tnl-dev

Oct 25, 2019

Split ArrayOperations into MemoryOperations and MultiDeviceMemoryOperations · 57db358c
Jakub Klinkovský authored 5 years ago
```
This will be necessary to avoid code bloat with more than 2 devices
(execution types).
```
57db358c

ArrayOperations: using more parallel algorithms and suitable sequential fallbacks · 986e25fc

- cudaMemcpy is slower than our ParallelFor kernel for CUDA
- use std::copy and std::equal instead of memcpy and memcmp, but only as
  sequential fallbacks
- use parallel algorithms for containsValue and containsOnlyValue (again
  with sequential fallbacks)

986e25fc

ArrayOperations: added missing methods for the static/sequential specialization · f8c8673d
Jakub Klinkovský authored 5 years ago

f8c8673d
Benchmarks: added benchmarks for array copy and compare using memcpy and memcmp · 7a5840de
Jakub Klinkovský authored 5 years ago

7a5840de
Moved SystemInfo class out of the Devices namespace · dacc1711
Jakub Klinkovský authored 5 years ago
```
It has nothing to do with devices.
```
dacc1711
Cleaned up Devices::Cuda · e2ac7194
Jakub Klinkovský authored 5 years ago

e2ac7194

Removed duplicate TransferBufferSize constants · a1a054bf

Jakub Klinkovský authored 5 years ago

Also set the buffer size to 1 MiB, because larger buffer size slows down
memory copies significantly (e.g. MeshTest would take about 10x longer).

Addresses #26

a1a054bf

Moved atomicAdd function from Devices/Cuda.h into Atomic.h · 15b5e2c4
Jakub Klinkovský authored 5 years ago

15b5e2c4

Moved synchronization of smart pointers from Devices::Cuda into TNL::Pointers... · 1743358a

Jakub Klinkovský authored 5 years ago

Moved synchronization of smart pointers from Devices::Cuda into TNL::Pointers namespace as free functions

synchronizeDevice() was renamed to synchronizeSmartPointersOnDevice()
for clarity - there are many similarly named functions in CUDA (e.g.
cudaDeviceSynchronize()).

1743358a

Moved (most of) static methods from TNL::Devices::Cuda as free functions into... · 2d5176fb

Jakub Klinkovský authored 5 years ago

Moved (most of) static methods from TNL::Devices::Cuda as free functions into separate namespace TNL::Cuda

The class TNL::Devices::Cuda was too bloated, breaking the Single
Responsibility Principle. It should be used only for template
specializations and other things common to all devices.

The functions in MemoryHelpers.h are deprecated, smart pointers should
be used instead.

The functions in LaunchHelpers.h are temporary, more refactoring is
needed with respect to execution policies and custom launch parameters.

2d5176fb

Oct 24, 2019
- Added default stream synchronizations after kernel launches in CudaReductionKernel.h · fed5d45c
  Jakub Klinkovský authored 5 years ago
  
  fed5d45c
- Fixed parseCommandLine after refactoring the getType function · 39dadccb
  Jakub Klinkovský authored 5 years ago
  
  39dadccb
- Reimplemented getType() function using typeid operator and removed useless getType() methods · 5910a5e8
  Jakub Klinkovský authored 5 years ago
  
  Fixes #46
  5910a5e8
- Removed custom implementation of std::make_unique which is available in STL since C++14 · 203ee514
  Jakub Klinkovský authored 5 years ago
  
  203ee514
- Removed useless operator<< for TNL::String · 826332e4
  Jakub Klinkovský authored 5 years ago
  
  The implementation for std::string (which is a base class of TNL::String) is perfectly sufficient.
  826332e4
- Refactoring VectorFieldVTKWriter · 6d17baa3
  Jakub Klinkovský authored 5 years ago
  
  Fixes #11
  6d17baa3
- Devices: replaced getDeviceType() with getType() · 4675fbdf
  Jakub Klinkovský authored 5 years ago
  
  4675fbdf
- Removed MIC support · e7880461
  Jakub Klinkovský authored 5 years ago
  
  e7880461
Oct 05, 2019
- 2D MPI GPU method adjusted. · e162a57a
  Matouš Fencl authored 5 years ago
  
  e162a57a
Sep 28, 2019
- Fixed passing of Arrays by ArrayView. · ac305460
  Tomáš Oberhuber authored 5 years ago
  
  ac305460
Sep 25, 2019
- Fix 2D GPU neighbours. Version with Chess method and OpenMP FSM methods. · 008601ad
  Matouš Fencl authored 5 years ago and Tomáš Oberhuber committed 5 years ago
  
  008601ad
- Fixed saving with expcetions. · f2dc4517
  Tomáš Oberhuber authored 6 years ago
  
  f2dc4517
- deleting tnlDirectEikonalMethodsBase_impl.h · b42fa59a
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  b42fa59a
- Refactoring · d4412d31
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  d4412d31
- DeepCopy removed from CUDA · 7af752a5
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  7af752a5
- 2D and 3D solvers extended with MPI (3D has issue on biggest mesh) · 204f7f1d
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  204f7f1d
- First try to repair the installation error in 2D · df7abcac
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  df7abcac
- 2D MPI cuda repaired · be284ee4
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  be284ee4
- MPI implemented for CPU and GPU in 2D but meshFunction.template synchronize<... · 5c15d04c
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  MPI implemented for CPU and GPU in 2D but meshFunction.template synchronize< Communicator >(); doesn't copy overlaps.
  5c15d04c
- MPI ready in tnlDirectEikonal* · 933cc22b
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  933cc22b
- Changed int to IndexType · 97ee2879
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  97ee2879
- Change of int to IndexType and preparations for OpenMPI. · 9f44ea1d
  Matouš Fencl authored 6 years ago and Tomáš Oberhuber committed 5 years ago
  
  9f44ea1d
Sep 20, 2019
- Fixed distributed scan without OpenMP · 552e90c4
  Jakub Klinkovský authored 5 years ago
  
  552e90c4
Sep 03, 2019
- Style changes in StaticArray · bb04a590
  Jakub Klinkovský authored 5 years ago
  
  bb04a590
- Fixed forwarding of arguments passed to StaticFor · db254ee8
  Jakub Klinkovský authored 5 years ago
  
  StaticArrayAssignment expects the arguments passed as reference.
  db254ee8
- Fixed StaticArrayTest · 7a9a3087
  Jakub Klinkovský authored 5 years ago
  
  7a9a3087
- Simplified functors in StaticArrayAssignment.h · ccf1aa07
  Jakub Klinkovský authored 5 years ago
  
  ccf1aa07
- Cleanup · 78d15fb0
  Jakub Klinkovský authored 5 years ago
  
  78d15fb0
- Fixed const-qualification of dataFetcher in CUDA reduction · 1c07e0af
  Jakub Klinkovský authored 5 years ago
  
  1c07e0af
Sep 02, 2019
- Editing documentation of static vector plus small changes in StaticArray. · c6b8fc06
  Tomáš Oberhuber authored 5 years ago
  
  c6b8fc06