You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Could anyone provide a good example of using key_padding_mask for flash attention v2?
key_padding_mask is used to mask out the keys (shape [B, 1, 1, N])
Is it possible to use key_padding_mask and causal mask together?
The text was updated successfully, but these errors were encountered:
Could anyone provide a good example of using key_padding_mask for flash attention v2?
key_padding_mask is used to mask out the keys (shape [B, 1, 1, N])
Is it possible to use key_padding_mask and causal mask together?
The text was updated successfully, but these errors were encountered: