❓ [Question] TRTorch and Pytorch Serve #535

p1x31 · 2021-07-11T21:12:02Z

p1x31
Jul 11, 2021

❓ Question

Is it possbile to use TRTorch with TorchServe?
If not what is the best way to deploy TRTorch programs?

What you have already tried

In the documentation, it is said I can continue to use programs via PyTorch API
I have converted all my models to TRTorch.

Environment

Build information about the TRTorch compiler can be found by turning on debug messages

PyTorch Version (e.g., 1.0):
CPU Architecture:
OS (e.g., Linux):
How you installed PyTorch (conda, pip, libtorch, source):
Build command you used (if compiling from source):
Are you using local sources or building from archives:
Python version:
CUDA version:
GPU models and configuration:
Any other relevant information:

Additional context

narendasan · 2021-07-12T22:54:25Z

narendasan
Jul 12, 2021
Collaborator

I think you should be able to with some slight modifications to torchserve. Basically you just need to load the TRTorch runtime in your handler and also insure that the input size to your model is in the range specified at compile time. I tried out creating a mar file from a TRTorch compile TorchScript file from the torch-model-archiver tool and that seemed to work. Let us know if you try it and if you are successful or hit any issues, we'd love to hear about it

0 replies

p1x31 · 2021-07-14T12:27:40Z

p1x31
Jul 14, 2021
Author

Thanks for the heads up. I'll close this question for now and will let you know the results.

0 replies

mrcolo · 2022-03-26T02:09:20Z

mrcolo
Mar 26, 2022

Hi! I'm experiencing a weird problem with Torch_TENSORRT - it seems that within a local script environment, performing inference with my model and its correspective RT version consists of a 3x speed up in inference time, which is awesome.

The problem is that I can't replicate this speed up within torchserve. Even though I load the runtime in my handler (import torch_tensorrt), inference with torchserve appears to take more time than the torchscript version of the same model.

Is someone experiencing the same problem?

3 replies

fancoltran Jun 30, 2022

have you resolve it?

narendasan Jul 1, 2022
Collaborator

Can you share instructions on how to replicate your setup? @mrcolo

fancoltran Jul 7, 2022

i'm sorry i'm not Meta dev. I'm also get struggled into this and ask you for solving problem

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

❓ [Question] TRTorch and Pytorch Serve #535

{{title}}

Replies: 3 comments 3 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

❓ [Question] TRTorch and Pytorch Serve #535

p1x31 Jul 11, 2021

❓ Question

What you have already tried

Environment

Additional context

Replies: 3 comments · 3 replies

narendasan Jul 12, 2021 Collaborator

p1x31 Jul 14, 2021 Author

mrcolo Mar 26, 2022

fancoltran Jun 30, 2022

narendasan Jul 1, 2022 Collaborator

fancoltran Jul 7, 2022

p1x31
Jul 11, 2021

Replies: 3 comments 3 replies

narendasan
Jul 12, 2021
Collaborator

p1x31
Jul 14, 2021
Author

mrcolo
Mar 26, 2022

narendasan Jul 1, 2022
Collaborator