Skip to content
Commit cbc2fff9 authored by Jakub Klinkovský's avatar Jakub Klinkovský
Browse files

Found a way to avoid using volatile in CUDA reduction: __syncwarp()

The performance seems to be identical to the code using volatile.
parent b74a24d2
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment