Skip to content
View kssteven418's full-sized avatar

Highlights

  • Pro

Block or report kssteven418

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. SqueezeAILab/LLMCompiler SqueezeAILab/LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.5k 109

  2. SqueezeAILab/SqueezeLLM SqueezeAILab/SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 642 43

  3. Squeezeformer Squeezeformer Public

    [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

    Python 245 19

  4. I-BERT I-BERT Public

    [ICML'21 Oral] I-BERT: Integer-only BERT Quantization

    Python 226 32

  5. LTP LTP Public

    [KDD'22] Learned Token Pruning for Transformers

    Python 93 17

  6. BigLittleDecoder BigLittleDecoder Public

    [NeurIPS'23] Speculative Decoding with Big Little Decoder

    Python 85 10