Skip to content
Commit a4e15b08 authored by Jakub Klinkovský's avatar Jakub Klinkovský
Browse files

Refactored parallel OpenMP scan

The first phase performs only per-block reduction, not scan. The output
array elements are written only in the second phase, so overall we
perform only `n` instead of `2n` write operations.
parent 63d567e4
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment