You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi @mrwyattii@jeffra, I believe you could significantly speed up DeepSpeed-MII further by implementing Speculative Decoding in combination with the great framework presented here. Speculative Decoding is widely known by now for giving a ~2x speed up of models, based on the research from Google's DeepMind.
Hi @mrwyattii @jeffra, I believe you could significantly speed up DeepSpeed-MII further by implementing Speculative Decoding in combination with the great framework presented here. Speculative Decoding is widely known by now for giving a ~2x speed up of models, based on the research from Google's DeepMind.
Paper: https://arxiv.org/abs/2211.17192
Code (not original): https://github.com/jaymody/speculative-sampling
The text was updated successfully, but these errors were encountered: