TensorRT対応 #26

sanamiy · 2024-09-12T01:40:40Z

NVIDIA GPUでの推論をより早く軽くしたい

TensorRT 10.4 supports operators in the inclusive range of opset 9 to opset 20
https://github.com/onnx/onnx-tensorrt/blob/10.4-GA/docs/operators.md

tuna2134 · 2024-09-12T02:42:55Z

Googlefanさんが昨日話していたのですが、対応が難しいのとCUDAがあるので十分早いのではないかっていう話があります。ただ挑戦する価値はあると思います。

Googlefan256 · 2024-09-12T02:55:06Z

参考程度に4060tiでは1秒で25回短いテキストを推論できます
ただもっと早くするロマンはありますよね...!

Googlefan256 · 2024-09-12T03:40:57Z

bertのほうはtrt対応できました。
本体はほかの人に託します...

tuna2134 added enhancement New feature or request 優先度　低 labels Sep 12, 2024

Googlefan256 mentioned this issue Sep 12, 2024

tensorrt partial support #28

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT対応 #26

TensorRT対応 #26

sanamiy commented Sep 12, 2024 •

edited

Loading

tuna2134 commented Sep 12, 2024 •

edited

Loading

Googlefan256 commented Sep 12, 2024

Googlefan256 commented Sep 12, 2024

TensorRT対応 #26

TensorRT対応 #26

Comments

sanamiy commented Sep 12, 2024 • edited Loading

tuna2134 commented Sep 12, 2024 • edited Loading

Googlefan256 commented Sep 12, 2024

Googlefan256 commented Sep 12, 2024

sanamiy commented Sep 12, 2024 •

edited

Loading

tuna2134 commented Sep 12, 2024 •

edited

Loading