Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0221Updated Dec 11, 2024Dec 11, 2024
    • nyuntam

      Public
      Python
      GNU Affero General Public License v3.0
      1271372Updated Dec 2, 2024Dec 2, 2024
    • Python
      GNU Affero General Public License v3.0
      1731Updated Oct 28, 2024Oct 28, 2024
    • Python
      GNU Affero General Public License v3.0
      01001Updated Oct 25, 2024Oct 25, 2024
    • lmquant

      Public
      Python
      Apache License 2.0
      0201Updated Oct 25, 2024Oct 25, 2024
    • Python
      GNU Affero General Public License v3.0
      0800Updated Oct 25, 2024Oct 25, 2024
    • PatchGD

      Public
      Python
      0300Updated Sep 5, 2024Sep 5, 2024
    • This is the official documentation for nyuntam
      Python
      0300Updated Sep 4, 2024Sep 4, 2024
    • C++
      Apache License 2.0
      0000Updated Aug 22, 2024Aug 22, 2024
    • qserve

      Public
      QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
      Python
      Apache License 2.0
      26000Updated Aug 2, 2024Aug 2, 2024
    • FLAP

      Public
      Patch for Grouped Query Attention
      Python
      Apache License 2.0
      10001Updated Aug 2, 2024Aug 2, 2024
    • AQLM

      Public
      Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf
      Python
      Apache License 2.0
      179000Updated Aug 1, 2024Aug 1, 2024
    • Python
      0000Updated Jul 1, 2024Jul 1, 2024
    • SFSD-LLM

      Public
      Python
      1500Updated May 31, 2024May 31, 2024
    • PruneGPT

      Public
      Python
      35300Updated May 31, 2024May 31, 2024
    • Python
      74030Updated Apr 23, 2024Apr 23, 2024