Skip to content
View Poeroz's full-sized avatar
🪴
hard working
🪴
hard working

Highlights

  • Pro

Block or report Poeroz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ictnlp/LLaMA-Omni ictnlp/LLaMA-Omni Public

    LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

    Python 2.6k 174

  2. ictnlp/DASpeech ictnlp/DASpeech Public

    Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".

    Python 60 5

  3. ictnlp/STEMM ictnlp/STEMM Public

    Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

    Python 36 7

  4. ictnlp/ComSpeech ictnlp/ComSpeech Public

    Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".

    Python 23 6

  5. ictnlp/CRESS ictnlp/CRESS Public

    Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".

    Python 16 2

  6. ga642381/speech-trident ga642381/speech-trident Public

    Awesome speech/audio LLMs, representation learning, and codec models

    709 37