Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 158 Bytes

README.md

File metadata and controls

3 lines (2 loc) · 158 Bytes

Medusa: Accelerating Serverless LLM Inference with Materialization

This repository contains the source code of modified PyTorch used by Medusa (ASPLOS'25).