Skip to content

Commit

Permalink
Fix performance regression in cutlass backward pass since 0.0.25 (#1177)
Browse files Browse the repository at this point in the history
Fixes #1176
  • Loading branch information
danthe3rd authored Dec 17, 2024
1 parent 3d947a6 commit b17d794
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion xformers/ops/fmha/cutlass.py
Original file line number Diff line number Diff line change
Expand Up @@ -436,7 +436,7 @@ def apply(cls, ctx: Context, inp: Inputs, grad: torch.Tensor) -> Gradients:
philox_offset=rng_offset,
custom_mask_type=_custom_mask_type(inp.attn_bias),
scale=inp.scale,
num_splits_key=-1, # Let C++ determine it
num_splits_key=None, # Let C++ determine it
window_size=(
inp.attn_bias._window_size
if isinstance(
Expand Down

0 comments on commit b17d794

Please sign in to comment.