Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Commit 005a4b9

Browse filesBrowse files
drisspgpytorchmergebot
authored andcommitted
[Submodule] Bump Cutlass to 3.5.1 OSS PR (#144000)
## Summary Follow up PR to #143515. That PR added a bunch of macro switches to ensure both 3.4 and 3.5.1 built succesfully. This PR actual bumps the cutlass pin to 3.5.1. I am going to do a stack on top to add an conditional gates for 3.6 hijacking the 3.4 switches. We will leap frog our way to the top :) Pull Request resolved: #144000 Approved by: https://github.com/Skylion007, https://github.com/eqy, https://github.com/malfet
1 parent 93633d0 commit 005a4b9
Copy full SHA for 005a4b9

File tree

Expand file treeCollapse file tree

2 files changed

+8
-2
lines changed
Filter options
Expand file treeCollapse file tree

2 files changed

+8
-2
lines changed

‎aten/src/ATen/native/cuda/RowwiseScaledMM.cu

Copy file name to clipboardExpand all lines: aten/src/ATen/native/cuda/RowwiseScaledMM.cu
+7-1Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,15 @@
33
#include <ATen/core/Tensor.h>
44
#include <ATen/cuda/CUDAContext.h>
55
#include <ATen/cuda/nvrtc_stub/ATenNVRTC.h>
6+
#include <c10/macros/Macros.h>
7+
8+
// Two warninngs in Cutlass included header files
9+
C10_DIAGNOSTIC_PUSH_AND_IGNORED_IF_DEFINED("-Wset-but-not-used")
10+
C10_DIAGNOSTIC_PUSH_AND_IGNORED_IF_DEFINED("-Wunused-but-set-parameter")
611

712
// Determine if the architecture supports rowwise scaled mm
8-
// Currenlty failing on windows with: https://github.com/NVIDIA/cutlass/issues/1571
13+
// Currently failing on windows with:
14+
// https://github.com/NVIDIA/cutlass/issues/1571
915
#if !defined(USE_ROCM) && !defined(_WIN32) && defined(CUDA_VERSION) && CUDA_VERSION >= 12000
1016

1117
#define BUILD_ROWWISE_FP8_KERNEL

‎third_party/cutlass

Copy file name to clipboard
Submodule cutlass updated 696 files

0 commit comments

Comments
0 (0)
Morty Proxy This is a proxified and sanitized view of the page, visit original site.