You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I assume that we checked SASS only on default set of architectures from our preset "60;70;80".
It may be even worse. I cannot find what exact cmake invocation I used, but it could have been just 86. I will revisit your table with all architectures touched by each PR.
bernhardmgruber
changed the title
[BUG]: Validate tuning refactoring on SM90
[BUG]: Validate tuning refactoring on all affected architectures
Jan 7, 2025
bernhardmgruber
changed the title
[BUG]: Validate tuning refactoring on all affected architectures
[BUG]: Validate tuning refactoring on all affected SM versions
Jan 7, 2025
Is this a duplicate?
Type of Bug
Performance
Component
CUB
Describe the bug
#3138 introduced 120% performance regression in
cub::DeviceScan::Sum
on Hopper.Looking at PR description:
I assume that we checked SASS only on default set of architectures from our preset "60;70;80".
Mentioned refactoring also touches SM90 policies, which results in the following diff:
We should verify SM90 SASS differences on other algorithms we refactored lately.
How to Reproduce
Expected behavior
No SASS difference on any architecture after tuning refactoring
Reproduction link
No response
Operating System
No response
nvidia-smi output
No response
NVCC version
No response
The text was updated successfully, but these errors were encountered: