Skip to content
@Zyphra

Zyphra

Popular repositories Loading

  1. BlackMamba BlackMamba Public

    Code repository for Black Mamba

    Python 232 18

  2. Zamba2 Zamba2 Public

    PyTorch implementation of models from the Zamba2 series.

    Python 164 17

  3. tree_attention tree_attention Public

    Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

    Python 108 5

  4. transformers_zamba2 transformers_zamba2 Public

    Python 41 1

  5. Zyda_processing Zyda_processing Public

    Python 28 1

  6. zcookbook zcookbook Public

    Training hybrid models for dummies.

    Python 15 1

Repositories

Showing 10 of 22 repositories
  • transformers_zamba Public Forked from huggingface/transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Zyphra/transformers_zamba’s past year of commit activity
    Python 3 Apache-2.0 27,668 0 0 Updated Dec 7, 2024
  • tree_attention Public

    Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters

    Zyphra/tree_attention’s past year of commit activity
    Python 108 5 1 0 Updated Dec 3, 2024
  • Zyphra/transformers_zamba2’s past year of commit activity
    Python 41 Apache-2.0 1 6 0 Updated Nov 28, 2024
  • Zamba2 Public

    PyTorch implementation of models from the Zamba2 series.

    Zyphra/Zamba2’s past year of commit activity
    Python 164 Apache-2.0 17 1 1 Updated Nov 26, 2024
  • FastChat Public Forked from lm-sys/FastChat

    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

    Zyphra/FastChat’s past year of commit activity
    Python 0 Apache-2.0 4,719 0 0 Updated Nov 6, 2024
  • zcookbook Public

    Training hybrid models for dummies.

    Zyphra/zcookbook’s past year of commit activity
    Python 15 Apache-2.0 1 0 0 Updated Oct 29, 2024
  • Megatron-LM Public Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Zyphra/Megatron-LM’s past year of commit activity
    Python 0 2,478 10 4 Updated Aug 20, 2024
  • Megatron-DeepSpeed Public Forked from microsoft/Megatron-DeepSpeed

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Zyphra/Megatron-DeepSpeed’s past year of commit activity
    Python 0 2,478 0 2 Updated Aug 19, 2024
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    Zyphra/flash-attention’s past year of commit activity
    Python 0 BSD-3-Clause 1,376 0 0 Updated Jul 8, 2024
  • Zamba-torch Public
    Zyphra/Zamba-torch’s past year of commit activity
    Python 6 Apache-2.0 1 0 0 Updated Jul 1, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…