Skip to content

Pull requests: NVIDIA/cutlass

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[bugfix] use acquire to prevent reordering.
#3118 opened Mar 20, 2026 by shubaoyu2 Loading…
Fix typo in elementwise_add.py
#3116 opened Mar 20, 2026 by HydraQYH Loading…
Add FlashMoE Publication
#3115 opened Mar 20, 2026 by osayamenja Loading…
Add CuTe DSL JAX demo
#3103 opened Mar 13, 2026 by katjasrz Loading…
[docs] Fix same typo
#3098 opened Mar 9, 2026 by lhtin Loading…
WIP: OSS CI Testing for v4.4
#3093 opened Mar 7, 2026 by zekunf-nv Loading…
Add dlopen-based dynamic kernel loading for profiler
#3088 opened Mar 3, 2026 by Wazrrr Loading…
[CuTeDSL] Fix: remove redundant Float8E4M3.
#3067 opened Feb 25, 2026 by Peter9606 Loading…
[CuTeDSL] Add BF16 grouped GEMM example for Hopper SM90
#3060 opened Feb 23, 2026 by vruga Loading…
Use unrounded inputs for the profiler by default
#3053 opened Feb 22, 2026 by saagarjha Loading…
docs: Fix IDE setup guide for VSCode and clangd
#3052 opened Feb 22, 2026 by bledden Loading…
2 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.