Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AITemplate - 심층 신경망을 고성능 CUDA/HIP C++ 코드로 변환하는 파이썬 프레임워크 #5

Open
YeonwooSung opened this issue Jul 10, 2023 · 0 comments

Comments

@YeonwooSung
Copy link
Contributor

AITemplate - 심층 신경망을 고성능 CUDA/HIP C++ 코드로 변환하는 파이썬 프레임워크

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

  • 빠른 추론을 위해 심층 신경망을 CUDA(NVIDIA GPU) / HIP(AMD GPU) C++ 코드로 전환
  • ResNet, BERT, VisionTransformer, Stable Diffusion 등 메어지 모델에서 루프라인 fp16 TensorCode/MatrixCore 성능에 근접
  • Unified, Open, Flexible 한 오픈소스
  • 훌륭한 하위호환성(써드파티 라이브러리/런타임 의존 없음). 각 모델이 포터블 바이너리로 컴파일
  • Horizontal Fusion / Vertical Fusion / Memory Fusion
  • Working w/wo PyTorch
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant