jt-zhang

Follow

Jintao Zhang jt-zhang

Follow

DBGroup, Tsinghua University

41 followers · 40 following

@thu-ml, Tsinghua University
Beijing, China
https://jt-zhang.github.io/

Achievements

Achievements

Highlights

Pro

Organizations

Pinned Loading

thu-ml/SageAttention thu-ml/SageAttention Public

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Cuda 525 24