This project is archived. Its data is
read-only
.
Commit
cbc2fff9
authored
Aug 11, 2019
by
Jakub Klinkovský
Browse files
Found a way to avoid using volatile in CUDA reduction: __syncwarp()
The performance seems to be identical to the code using volatile.
parent
b74a24d2
Loading
Loading
Loading
Changes
4
Expand all
Show whitespace changes
Inline
Side-by-side
Loading
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
sign in
to comment