Unify accumulator type handling across CUB/Thrust #3993

bernhardmgruber · 2025-03-03T20:40:05Z

I vaguely remember that not every reduce or scan algorithm uses ::cuda::std::__accumulator_t to determine the accumulator type to use. We should consolidate this behavior.

The text was updated successfully, but these errors were encountered:

bernhardmgruber · 2025-03-05T17:17:00Z

@gevtushenko said there is divergence between CUB and Thrust. CUB uses __accumulator_t, whereas Thrust uses the type of the initial value.

bernhardmgruber · 2025-03-05T18:24:48Z

I had another look and realized that the C++ standard seems to determine the accumulator type to either be the iterator value type or the initial value type. So it seems the divergence between CUB and Thrust is fine.

fbusato · 2025-03-05T18:33:52Z

This is also related and relevant to SIMD reduction. Using cuda::std::plus<> vs. cuda::std::plus<T> could affect performance. e.g. cuda::std::plus<> applied to int16_t induces implicit promotion which disables SIMD

github-project-automation bot added this to CCCL Mar 3, 2025

bernhardmgruber mentioned this issue Mar 3, 2025

[EPIC] Breaking changes for CCCL 3.0 #101

Open

89 tasks

github-project-automation bot moved this to Todo in CCCL Mar 3, 2025

bernhardmgruber assigned fbusato Mar 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify accumulator type handling across CUB/Thrust #3993

Unify accumulator type handling across CUB/Thrust #3993

bernhardmgruber commented Mar 3, 2025 •

edited

Loading

bernhardmgruber commented Mar 5, 2025

bernhardmgruber commented Mar 5, 2025

fbusato commented Mar 5, 2025

Unify accumulator type handling across CUB/Thrust #3993

Unify accumulator type handling across CUB/Thrust #3993

Comments

bernhardmgruber commented Mar 3, 2025 • edited Loading

bernhardmgruber commented Mar 5, 2025

bernhardmgruber commented Mar 5, 2025

fbusato commented Mar 5, 2025

bernhardmgruber commented Mar 3, 2025 •

edited

Loading