EECS Ph.D. @ UC Berkeley
-
UC Berkeley
- Berkeley, CA, US
-
03:41
(UTC -08:00) - https://happierpig.github.io/
- in/yilong-zhao-162151279
Highlights
- Pro
Pinned Loading
-
efeslab/Atom
efeslab/Atom Public[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
-
efeslab/Nanoflow
efeslab/Nanoflow PublicA throughput-oriented high-performance serving framework for LLMs
-
mit-han-lab/Quest
mit-han-lab/Quest Public[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.