Add OpenVINO model inference when models are run on CPU #72

surajpaib · 2024-07-18T20:44:18Z

Following up here on some of the work I did at PW41 (https://projectweek.na-mic.org/PW41_2024_MIT/Projects/IntegrateCpuFriendlyAutoSegmentationAndCtUtilityModelsIntoMhub/)

The main idea would be to provide OpenVINO optimized models for each of the trained models provided in this repo. When run on CPU, these models would be fetched (which are half the size) and optimized inference would be run with the OpenVINO inference API.

Sending this PR in early so we can discuss how to best integrate this.

What I've done so far is:

Take every model url from Models.json and convert the model.pt to an OpenVINO model.xml and model.bin. These are for now repackaged among the zip files and hosted on my fork here: https://github.com/surajpaib/SlicerMONAIAuto3DSeg/releases/Models The conversion script is rather simple and could be added to a Github action or so if needed. https://gist.github.com/surajpaib/74600da3c2ad4e983f4d5022301bf568
I've edited the inference script in the Slicer extension and run the OpenVINO inference using these models (code changes in the PR)
Ran the testing suite from the slicer extension (disabling the GPU). You can see the results here: https://github.com/surajpaib/SlicerMONAIAuto3DSeg/releases/ModelsTestResults
These might not be one to one comparisons with the previous (as I use a different CPU) so maybe these need to be re-run on the machine used for the original benchmark. On my machine, I see a speedup of between 1.5x to 2x. You can refer to the PW page where I put up some comparisons.

surajpaib and others added 8 commits July 1, 2024 09:18

Add OpenVINO enabled inference

b9498e9

Update model path

520fe0b

Merge branch 'lassoan:main' into main

80bc162

Add global disable for quick testing

d5a9610

Update release paths to forked repo

dc05d43

Merge branch 'lassoan:main' into main

8836f78

Update brats

44bce83

Merge branch 'main' of https://github.com/surajpaib/SlicerMONAIAuto3DSeg

d236f45

surajpaib marked this pull request as ready for review July 23, 2024 15:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add OpenVINO model inference when models are run on CPU #72

Add OpenVINO model inference when models are run on CPU #72

surajpaib commented Jul 18, 2024

Add OpenVINO model inference when models are run on CPU #72

Are you sure you want to change the base?

Add OpenVINO model inference when models are run on CPU #72

Conversation

surajpaib commented Jul 18, 2024