ArrayOperations: using more parallel algorithms and suitable sequential fallbacks (986e25fc) · Commits · TNL / tnl-dev

Commit 986e25fc authored Aug 22, 2019 by

Jakub Klinkovský

ArrayOperations: using more parallel algorithms and suitable sequential fallbacks

- cudaMemcpy is slower than our ParallelFor kernel for CUDA
- use std::copy and std::equal instead of memcpy and memcmp, but only as
  sequential fallbacks
- use parallel algorithms for containsValue and containsOnlyValue (again
  with sequential fallbacks)

parent f8c8673d

Hide whitespace changes

Inline Side-by-side

Please register or to comment