Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

correct BLayout stride in SM80 m16n8k32 int4 MMA traits
#3140 opened Apr 1, 2026 by zfmmmm Loading…
Fix Hopper FMHA performance regression on CUDA < 13.1
#3137 opened Mar 31, 2026 by arvin-chou Loading…
5 of 6 tasks
Fix elementwise_apply.py
#3129 opened Mar 25, 2026 by HydraQYH Loading…
Enable strict C++ compiler warnings with -Werror
#3123 opened Mar 22, 2026 by maxwbuckley Loading…
3 of 4 tasks
fix: Handle zero-stride TMA basis in fill_tma_gmem_shape_stride
#3121 opened Mar 20, 2026 by RobTand Loading…
2 tasks
[bugfix] use acquire to prevent reordering.
#3118 opened Mar 20, 2026 by shubaoyu2 Loading…
Fix typo in elementwise_add.py
#3116 opened Mar 20, 2026 by HydraQYH Loading…
Add FlashMoE Publication
#3115 opened Mar 20, 2026 by osayamenja Loading…
[docs] Fix same typo
#3098 opened Mar 9, 2026 by lhtin Loading…
WIP: OSS CI Testing for v4.4
#3093 opened Mar 7, 2026 by zekunf-nv Loading…
Add dlopen-based dynamic kernel loading for profiler
#3088 opened Mar 3, 2026 by Wazrrr Loading…
ProTip! Filter pull requests by the default branch with base:main.