There was an error fetching the commit references. Please try again later.
Replaced explicit algorithms for host in VectorOperations with general implementation
According to benchmarks, there is practically no difference in performance. Only explicit unrolling is helpful, but that has been implemented for the general algorithm in Reduction::reduce as well.
Showing
- CMakeLists.txt 0 additions, 4 deletionsCMakeLists.txt
- build 0 additions, 3 deletionsbuild
- src/Benchmarks/BLAS/vector-operations.h 0 additions, 82 deletionssrc/Benchmarks/BLAS/vector-operations.h
- src/TNL/Containers/Algorithms/VectorOperationsHost_impl.h 94 additions, 244 deletionssrc/TNL/Containers/Algorithms/VectorOperationsHost_impl.h
Loading
Please register or sign in to comment