Add pre-processing in PredictorTRT pipeline (#309)

* Add pre-processing transform in PredictorTRT * Remove img_size parameters in PredictorTRT.warmup * Fixing device and dtype setting * Fixing inference pipeline * Minor fixes * Update tutorials * Update README for TensorRT deploy pipeline * Apply pre-commit * Add Examples
zhiqwang · Feb 11, 2022 · c4fbbe1 · c4fbbe1
1 parent 720cd32
commit c4fbbe1
Show file tree

Hide file tree

Showing 5 changed files with 388 additions and 293 deletions.
diff --git a/README.md b/README.md
@@ -137,6 +137,21 @@ On the `ONNX Runtime` front you can use the [C++ example](deployment/onnxruntime
 
 ### Inference on TensorRT backend
 
+The pipeline for TensorRT deployment is also very easy to use.
+
+```python
+import torch
+from yolort.runtime import PredictorTRT
+
+# Load the exported TensorRT engine
+engine_path = "yolov5n6.engine"
+device = torch.device("cuda")
+y_runtime = PredictorTRT(engine_path, device=device)
+
+# Perform inference on an image file
+predictions = y_runtime.predict("bus.jpg")
+```
+
 On the `TensorRT` front you can use the [C++ example](deployment/tensorrt), and we also provide a [tutorial](https://zhiqwang.com/yolov5-rt-stack/notebooks/onnx-graphsurgeon-inference-tensorrt.html) for using the `TensorRT`.
 
 ## 🎨 Model Graph Visualization