-
Notifications
You must be signed in to change notification settings - Fork 144
Pull requests: facebookresearch/generative-recommenders
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
update block size for standalone_cint_v4
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#145
opened Nov 22, 2024 by
zhaozhul
Loading…
standalone_cint_v4
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#144
opened Nov 22, 2024 by
zhaozhul
Loading…
loop unroll for hstu attn bwd
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#143
opened Nov 21, 2024 by
LinjianMa
Loading…
Convert directory fbcode/hammer to use the Ruff Formatter
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#142
opened Nov 20, 2024 by
tpolasek
Loading…
Prepare for "Fix type-safety of This label is managed by the Meta Open Source bot.
fb-exported
torch.nn.Module
instances": fbcode/h*
CLA Signed
#141
opened Nov 20, 2024 by
ezyang
Loading…
Add complete_cumsum cpu and meta ops
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#140
opened Nov 20, 2024 by
jiyuanzFB
Loading…
add pytorch implementations for jagged operations
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#137
opened Nov 19, 2024 by
zhaozhul
Loading…
Replace Triton addmm withThis label is managed by the Meta Open Source bot.
fb-exported
torch.addmm
for AMD to achieve better training performance
CLA Signed
#133
opened Nov 19, 2024 by
yoyoyocmu
Loading…
Change autotune key for ragged_hstu_attention to support dynamic batch size
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#132
opened Nov 18, 2024 by
AlbertDachiChen
Loading…
Bug fix: detecting contextual length in triton hstu attn code
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#130
opened Nov 18, 2024 by
lic225
Loading…
Fix type-safety of This label is managed by the Meta Open Source bot.
fb-exported
torch.nn.Module
instances
CLA Signed
#129
opened Nov 18, 2024 by
ezyang
Loading…
move sorted_kv_pairs from hammer/ops/cuda/ to hammer/ops/cpp/
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#128
opened Nov 18, 2024 by
jiyuanzFB
Loading…
copy complete_cumsum kernel to ops/cpp/
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#127
opened Nov 15, 2024 by
jiyuanzFB
Loading…
Redefine FBGEMM targets with gpu_cpp_library (Re-land attempt of D64863809)
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#126
opened Nov 15, 2024 by
q10
Loading…
num_stages=0 becomes num_stages=2
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#125
opened Nov 14, 2024 by
nmacchioni
Loading…
Fix max_attn_len numerical
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#124
opened Nov 14, 2024 by
hanli0612
Loading…
: Add permute_4d_jagged_2013 cufa kernel
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#113
opened Nov 6, 2024 by
xing-liu
Loading…
Enable HSTU SMEM for TW and PW with autotuning for both SMEM preload and maxnregs
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#92
opened Oct 7, 2024 by
plotfi
Loading…
[Triton SMEM] Add not-yet-landed usage of Triton SMEM feature with autotuning
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP] TMA Version of HSTU (Autotuned)
CLA Signed
This label is managed by the Meta Open Source bot.
#71
opened Aug 21, 2024 by
plotfi
Loading…
TMA version of hstu
CLA Signed
This label is managed by the Meta Open Source bot.
#57
opened Jul 24, 2024 by
manman-ren
•
Draft
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.