Commits · 258f4d760ed9c155bbcd750e1792fe1f7f72ea9e · TNL / tnl-dev

Mar 02, 2020
- Added SlicedEllpack segments unit tests. · 258f4d76
  Tomáš Oberhuber authored 5 years ago and Tomáš Oberhuber committed 5 years ago
  
  258f4d76
- Added test for allReduction in Segments. · c8c5fc09
  Tomáš Oberhuber authored 5 years ago and Tomáš Oberhuber committed 5 years ago
  
  c8c5fc09
- Implementing unit tests for Segments. · 8bdceeae
  Tomáš Oberhuber authored 5 years ago
  
  8bdceeae
- Added Segments unit tests. · 47d413b3
  Tomáš Oberhuber authored 5 years ago
  
  47d413b3
Feb 29, 2020
- Preparation of tests for Vector-of-StaticVectors and StaticVector-of-StaticVectors · 80c57392
  Jakub Klinkovský authored 5 years ago
  
  80c57392
- Prepare tests for Vector of StaticVectors · 435c8ddf
  Jakub Klinkovský authored 5 years ago
  
  Tests don't pass yet...
  435c8ddf
Nov 08, 2019
- Renamed prefixSum methods to scan · afba52d9
  Jakub Klinkovský authored 5 years ago
  
  Closes #49
  afba52d9
- Removed HostType and CudaType aliases in containers, matrices and grids · d070cc39
  Jakub Klinkovský authored 5 years ago
  
  They are not suitable for more than 2 devices/execution types; their design breaks the Open-Closed Principle. Instead, a type template "Self" was created, which allows to change any template parameter.
  d070cc39
- Removed Containers::List because it has no benefits over std::list · 1b7361a9
  Jakub Klinkovský authored 5 years ago
  
  1b7361a9
Oct 25, 2019

Moved algorithms from TNL/Containers/Algorithms/ to just TNL/Algorithms/ · 399f9627

The usage of algorithms such as MemoryOperations or Reduction is not
bound to a particular container. On the other hand, ArrayIO,
ArrayAssignment, VectorAssignment and StaticArrayAssignment are just
implementation details for the containers - moved into
TNL/Containers/detail/

Also moved ParallelFor, StaticFor, StaticVectorFor, TemplateStaticFor
into TNL/Algorithms/

399f9627

Split ArrayOperations into MemoryOperations and MultiDeviceMemoryOperations · 57db358c
Jakub Klinkovský authored 5 years ago
```
This will be necessary to avoid code bloat with more than 2 devices
(execution types).
```
57db358c

Moved (most of) static methods from TNL::Devices::Cuda as free functions into... · 2d5176fb

Jakub Klinkovský authored 5 years ago

Moved (most of) static methods from TNL::Devices::Cuda as free functions into separate namespace TNL::Cuda

The class TNL::Devices::Cuda was too bloated, breaking the Single
Responsibility Principle. It should be used only for template
specializations and other things common to all devices.

The functions in MemoryHelpers.h are deprecated, smart pointers should
be used instead.

The functions in LaunchHelpers.h are temporary, more refactoring is
needed with respect to execution policies and custom launch parameters.

2d5176fb

Oct 24, 2019
- Reimplemented getType() function using typeid operator and removed useless getType() methods · 5910a5e8
  Jakub Klinkovský authored 5 years ago
  
  Fixes #46
  5910a5e8
- Removed MIC support · e7880461
  Jakub Klinkovský authored 5 years ago
  
  e7880461
Sep 03, 2019
- Style changes in StaticArray · bb04a590
  Jakub Klinkovský authored 5 years ago
  
  bb04a590
- Fixed StaticArrayTest · 7a9a3087
  Jakub Klinkovský authored 5 years ago
  
  7a9a3087
- Cleanup · 78d15fb0
  Jakub Klinkovský authored 5 years ago
  
  78d15fb0
Sep 02, 2019
- Implemented StaticArray::operator= accepting both arrays and single... · b91b41d8
  Tomáš Oberhuber authored 5 years ago
  
  Implemented StaticArray::operator= accepting both arrays and single StaticArray:::ValueType compatible type.
  b91b41d8
- Added test for StaticArray serialization/deserialization. · 02f481ee
  Tomáš Oberhuber authored 5 years ago
  
  02f481ee
- Renaming PrefixSum to Scan. · 92dc4a47
  Tomáš Oberhuber authored 5 years ago
  
  92dc4a47
Aug 27, 2019
- Avoiding compiler warnings for builds without CUDA · 7390a03b
  Jakub Klinkovský authored 5 years ago
  
  7390a03b
Aug 17, 2019
- Implemented distributed prefix-sum · d13a2d18
  Jakub Klinkovský authored 5 years ago
  
  Fixes #43
  d13a2d18
- Replaced static member variables in CudaPrefixSumKernelLauncher with static getters · 1fe62640
  Jakub Klinkovský authored 5 years ago
  
  1fe62640
- Removed volatile reduction from PrefixSum and updated the normal reduction operation · 8d0d2638
  Jakub Klinkovský authored 5 years ago
  
  Same changes as for the regular Reduction operation...
  8d0d2638
- Replaced custom lambda functions with instances of STL types where possible · 0a57393f
  Jakub Klinkovský authored 5 years ago
  
  0a57393f
- Changed reduction operation to use functions with `return a + b` instead of `a += b` · e20a0930
  Jakub Klinkovský authored 5 years ago
  
  This is nicer because it more clearly separates data load, computation and data store. Furthermore, it allows to use instances of std::plus, std::logical_and, std::logical_or, etc. instead of custom lambda functions.
  e20a0930
- Removed VectorOperations class which is now useless · d0fc1bb7
  Jakub Klinkovský authored 5 years ago
  
  It contained only methods for prefixSum and segmentedPrefixSum, which were identical for Host and Cuda, so they can be easily implemented directly in Vector and VectorView.
  d0fc1bb7
- Removed volatile reduction completely · 13b89a71
  Jakub Klinkovský authored 5 years ago
  
  13b89a71
- Rewritten multireduction with lambda functions · b74a24d2
  Jakub Klinkovský authored 5 years ago
  
  b74a24d2
- Tests: forced using a unique file name in each test · a3ba2469
  Jakub Klinkovský authored 5 years ago
  
  This is necessary to be able to run tests in parallel.
  a3ba2469
Aug 14, 2019
- Removed conditional per-device Permutation and SliceInfo setting from NDArray and SlicedNDArray · 55ded6ad
  Jakub Klinkovský authored 5 years ago
  
  55ded6ad
- Added templated assignment operators for NDArray and NDArrayView · fd4d8429
  Jakub Klinkovský authored 5 years ago
  
  It works for any value, device and index types, but the permutations of both arrays must be the same and both arrays have to be contiguous.
  fd4d8429
- DistributedNDArray: added tests for semi-1D distribution · 298dd421
  Jakub Klinkovský authored 5 years ago
  
  298dd421
- Added DistributedNDArraySynchronizer · 3934beaf
  Jakub Klinkovský authored 5 years ago
  
  3934beaf
- DistributedNDArray: added forOverlaps method · 0dd215e8
  Jakub Klinkovský authored 5 years ago
  
  0dd215e8
- DistributedNDArray: added forBoundary and forLocalBoundary methods · 82e0c538
  Jakub Klinkovský authored 5 years ago
  
  82e0c538
- DistributedNDArray: added forInternal and forLocalInternal methods · f9853a86
  Jakub Klinkovský authored 6 years ago
  
  f9853a86
- DistributedNDArray: added forAll method · 8b91dfcc
  Jakub Klinkovský authored 6 years ago
  
  8b91dfcc
- Basic implementation of the distributed NDArray · 07d933dc
  Jakub Klinkovský authored 6 years ago
  
  07d933dc
- NDArray: added forBoundary method · 6c8c608e
  Jakub Klinkovský authored 6 years ago
  
  6c8c608e