Future optimizations #112

nathanielrindlaub · 2023-04-07T22:31:22Z

@rbavery had a few ideas for future optimizations of the Megadetector v5 endpoint that I wanted to document:

test compiling the model with NeuralMagic for increased inference speed
explore using test-time augmentations (during inference perform a few different random transforms/pre-processing steps on the fly, request inference on all versions of the image, and then average results across them) to boost model accuracy. This would come at the cost of potentially tripling (or more) our inference time depending on how many augmentations we try and under what conditions, so we'd want to think through it a bit more and be sure the benefits out weigh costs.
use ONNX-compiled models across all endpoints for the sake of standardization (and perhaps some speed gains)

@rbavery - anything else to add here??

nathanielrindlaub · 2023-07-13T17:22:04Z

From Dan:

Sometimes, if we're still missing animals, but one or both models look close, try again using YOLOv5's test-time augmentation tools via this alternative (but compatible) MD inference script.

nathanielrindlaub · 2023-10-20T17:13:17Z

Also according to Dan, "inference takes ~1.7x longer with TTA turned on". That's not as bad a hit as I was imagining, so very much worth evaluating.

rbavery · 2023-11-10T01:04:13Z

Just a heads up that I got try running MDV5a compile dwith tensorrt and it was blazing fast. Example here: https://github.com/rbavery/animal_detector/blob/master/mdv5app/torchscript_to_tensorrt.py

It sped up inference something like ~10x on my GPU compared to running the torchscript model without tensorrt.

This might be the most cost effective option for bulk inference without requiring a change in architecture. still uses torchserve and virtually the same handler code paths.

nathanielrindlaub mentioned this issue Apr 7, 2023

Run serverless endpoint batch test and record cost and time results #99

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Future optimizations #112

Future optimizations #112

nathanielrindlaub commented Apr 7, 2023 •

edited

Loading

nathanielrindlaub commented Jul 13, 2023

nathanielrindlaub commented Oct 20, 2023

rbavery commented Nov 10, 2023

Future optimizations #112

Future optimizations #112

Comments

nathanielrindlaub commented Apr 7, 2023 • edited Loading

nathanielrindlaub commented Jul 13, 2023

nathanielrindlaub commented Oct 20, 2023

rbavery commented Nov 10, 2023

nathanielrindlaub commented Apr 7, 2023 •

edited

Loading