Skip to content
Commit cbc2fff9 authored by Jakub Klinkovský's avatar Jakub Klinkovský
Browse files

Found a way to avoid using volatile in CUDA reduction: __syncwarp()

The performance seems to be identical to the code using volatile.
parent b74a24d2
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment