Skip to content
View shuo-ouyang's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Xiaomi
  • Beijing

Highlights

  • Pro

Block or report shuo-ouyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shuo-ouyang/README.md

Focus on machine learning system and high-performance computing.


  • Languages: C++11/14, Python
  • Frameworks: TensorRT, PyTorch
  • Libraries: CUDA, CUB, thrust, cuBLAS, cuDLA

Pinned Loading

  1. apache/mxnet apache/mxnet Public archive

    Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

    C++ 20.8k 6.8k

  2. horovod/horovod horovod/horovod Public

    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

    Python 14.3k 2.2k

  3. ps-lite ps-lite Public

    Forked from dmlc/ps-lite

    A lightweight parameter server interface

    C++

  4. OpenPPL/ppl.nn OpenPPL/ppl.nn Public

    A primitive library for neural network

    C++ 1.3k 217

  5. trt-hackathon-2022 trt-hackathon-2022 Public

    TensorRT Hackathon 2022 Final Competition

    Python 7 1