Commits · c45aad580d448c512e76fea6a0e0bdf8c5859edf · TNL / tnl-dev

Nov 26, 2019
- Reformatted code. Added description of code. · c45aad58
  Lukas Cejka authored 6 years ago
  
  c45aad58
- Fixed implementation of error reporting. Made all formats be compared with... · e6f24de1
  Lukas Cejka authored 6 years ago
  
  Fixed implementation of error reporting. Made all formats be compared with cuSPARSE (format and cuSPARSE compared to CPU, not between each other, that would require Benchmarks.h to be changed). Added name of mtx being tested to MetaDataColumns. Code reformatting.
  e6f24de1
- Found potential mistake in SpMV/spmv.h where MatrixReader doesn't need to be... · a4f9e711
  Lukas Cejka authored 6 years ago
  
  Found potential mistake in SpMV/spmv.h where MatrixReader doesn't need to be called twice. Commiting to show in meeting.
  a4f9e711
- Implemented rough version of result comparison. Implemented benchmark for... · 6365a136
  Lukas Cejka authored 6 years ago
  
  Implemented rough version of result comparison. Implemented benchmark for comparison of TNL CSR and Cusparse on GPU. Edited log file formatting.
  6365a136
- Uncommented ifdef that served as a barrier for use of file functions. · 2a405524
  Lukas Cejka authored 6 years ago
  
  2a405524
- Implemented for the benchmark to write the output of MatrixReader into the log... · 987a0ff6
  Lukas Cejka authored 6 years ago
  
  Implemented for the benchmark to write the output of MatrixReader into the log file. BUG: Every other error message added into the Benchmark doesn't have a '!' as a prefix in the log file.
  987a0ff6
- Added getMatrixFormat(). Implemented basic error writing into log files. · 1a798c60
  Lukas Cejka authored 6 years ago
  
  1a798c60
- Added log file naming based on current date and time, so that log files don't get overwritten. · b7d2952d
  Lukas Cejka authored 6 years ago
  
  b7d2952d
- Deleted useless commented-out function. · 0a04588f
  Lukáš Matthew Čejka authored 6 years ago
  
  0a04588f
- Implemented rought version of SpMV Benchmark for mtx files. · 2ebb1334
  Lukas Cejka authored 6 years ago
  
  2ebb1334
- Partial implementation of SpMV benchmark for mtx files. Commiting for backup purposes. · 3fb1c95e
  Lukas Cejka authored 6 years ago
  
  3fb1c95e
- Added useful functions to begind implementation. Commiting for backup purposes. · 0a0c44ca
  Lukas Cejka authored 6 years ago
  
  0a0c44ca
- Copied tnl-benchmark-spmv files and spmv.h from BLAS to SpMV. Deleted min/max... · 33d17aab
  Lukas Cejka authored 6 years ago
  
  Copied tnl-benchmark-spmv files and spmv.h from BLAS to SpMV. Deleted min/max size and stepFactor. Not working yet, backup purposes.
  33d17aab
Nov 08, 2019
- Fixed internal linkage of the getHardwareMetadata function in benchmarks · e8cc0880
  Jakub Klinkovský authored 5 years ago
  
  e8cc0880
- Renamed prefixSum methods to scan · afba52d9
  Jakub Klinkovský authored 5 years ago
  
  Closes #49
  afba52d9
- Removed HostType and CudaType aliases in containers, matrices and grids · d070cc39
  Jakub Klinkovský authored 5 years ago
  
  They are not suitable for more than 2 devices/execution types; their design breaks the Open-Closed Principle. Instead, a type template "Self" was created, which allows to change any template parameter.
  d070cc39
- Removed useless typedefs such as ThisType · 3a997233
  Jakub Klinkovský authored 5 years ago
  
  3a997233
Oct 25, 2019

Moved algorithms from TNL/Containers/Algorithms/ to just TNL/Algorithms/ · 399f9627

The usage of algorithms such as MemoryOperations or Reduction is not
bound to a particular container. On the other hand, ArrayIO,
ArrayAssignment, VectorAssignment and StaticArrayAssignment are just
implementation details for the containers - moved into
TNL/Containers/detail/

Also moved ParallelFor, StaticFor, StaticVectorFor, TemplateStaticFor
into TNL/Algorithms/

399f9627

Benchmarks: added benchmarks for array copy and compare using memcpy and memcmp · 7a5840de
Jakub Klinkovský authored 5 years ago

7a5840de
Moved SystemInfo class out of the Devices namespace · dacc1711
Jakub Klinkovský authored 5 years ago
```
It has nothing to do with devices.
```
dacc1711

Moved synchronization of smart pointers from Devices::Cuda into TNL::Pointers... · 1743358a

Jakub Klinkovský authored 5 years ago

Moved synchronization of smart pointers from Devices::Cuda into TNL::Pointers namespace as free functions

synchronizeDevice() was renamed to synchronizeSmartPointersOnDevice()
for clarity - there are many similarly named functions in CUDA (e.g.
cudaDeviceSynchronize()).

1743358a

Moved (most of) static methods from TNL::Devices::Cuda as free functions into... · 2d5176fb

Jakub Klinkovský authored 5 years ago

Moved (most of) static methods from TNL::Devices::Cuda as free functions into separate namespace TNL::Cuda

The class TNL::Devices::Cuda was too bloated, breaking the Single
Responsibility Principle. It should be used only for template
specializations and other things common to all devices.

The functions in MemoryHelpers.h are deprecated, smart pointers should
be used instead.

The functions in LaunchHelpers.h are temporary, more refactoring is
needed with respect to execution policies and custom launch parameters.

2d5176fb

Oct 24, 2019
- Reimplemented getType() function using typeid operator and removed useless getType() methods · 5910a5e8
  Jakub Klinkovský authored 5 years ago
  
  Fixes #46
  5910a5e8
- Removed MIC support · e7880461
  Jakub Klinkovský authored 5 years ago
  
  e7880461
Sep 03, 2019
- Cleanup · 78d15fb0
  Jakub Klinkovský authored 5 years ago
  
  78d15fb0
Sep 02, 2019
- Renaming PrefixSum to Scan. · 92dc4a47
  Tomáš Oberhuber authored 5 years ago
  
  92dc4a47
Aug 27, 2019
- Avoiding compiler warnings for builds without CUDA · 7390a03b
  Jakub Klinkovský authored 5 years ago
  
  7390a03b
Aug 24, 2019
- Avoiding compiler warnings · 8253355f
  Jakub Klinkovský authored 5 years ago
  
  8253355f
Aug 17, 2019
- Added prefix-sum to BLAS benchmarks · 27631930
  Jakub Klinkovský authored 5 years ago
  
  27631930
- Benchmarks: compute sample standard deviation of the measured computation times · 2bea9311
  Jakub Klinkovský authored 5 years ago
  
  2bea9311
- Removed timing parameter from benchmarks · e6e6cf46
  Jakub Klinkovský authored 5 years ago
  
  Benchmarks can be easily profiled even without this parameter, so it was just an unnecessary complication.
  e6e6cf46
- Benchmarks: added scalar multiplication with BLAS · 232be124
  Jakub Klinkovský authored 5 years ago
  
  232be124
- Ugly workaround for nvcc's stupid modification of `new` expressions · 32c69a11
  Jakub Klinkovský authored 5 years ago
  
  32c69a11
- Replaced custom lambda functions with instances of STL types where possible · 0a57393f
  Jakub Klinkovský authored 5 years ago
  
  0a57393f
- Changed reduction operation to use functions with `return a + b` instead of `a += b` · e20a0930
  Jakub Klinkovský authored 5 years ago
  
  This is nicer because it more clearly separates data load, computation and data store. Furthermore, it allows to use instances of std::plus, std::logical_and, std::logical_or, etc. instead of custom lambda functions.
  e20a0930
- Removed VectorOperations class which is now useless · d0fc1bb7
  Jakub Klinkovský authored 5 years ago
  
  It contained only methods for prefixSum and segmentedPrefixSum, which were identical for Host and Cuda, so they can be easily implemented directly in Vector and VectorView.
  d0fc1bb7
- Removed volatile reduction completely · 13b89a71
  Jakub Klinkovský authored 5 years ago
  
  13b89a71
Aug 14, 2019
- Removed conditional per-device Permutation and SliceInfo setting from NDArray and SlicedNDArray · 55ded6ad
  Jakub Klinkovský authored 5 years ago
  
  55ded6ad
- NDArray: added forBoundary method · 6c8c608e
  Jakub Klinkovský authored 6 years ago
  
  6c8c608e
- Added NDArray · e444116e
  Jakub Klinkovský authored 6 years ago
  
  e444116e