Added support for setGraphExecutorOptimize with torchscript models. #904

hodovo · 2021-04-27T07:23:54Z

Description

This PR adds support for torch::jit::setGraphExecutorOptimize which allows the user to prevent model "warmup" periods while torchscript optimizes the model on GPU.

Users can disable the torchscript optimization with the following code:

JniUtils.setGraphExecutorOptimize(false);

Since this feature is enabled by default in torchscript, it will only disable optimization when the the method above is called. Since JNI maintains an individual environment per thread, the above method must be called for each thread that is using the model which optimization should be disabled for.

This change is backwards compatible and does not alter the usage of any existing code.

…required model warmup for different batch sizes.

hodovo · 2021-04-27T07:24:24Z

@frankfliu I created it as a draft pull request as this is my first pull request. Please have a look and you can mark it ready for review.

hodovo · 2021-04-27T08:50:47Z

After doing some testing, there is currently a bug with multi-threaded inference where it will have the same multi-second delay after a few inferences. It does not happen single threaded.

I have fixed it by running PyTorchLibrary.LIB.setGraphExecutorOptimize(false); before every time PyTorchLibrary.LIB.moduleForward is called. This indicates that when using multiple threads, the value of the setGraphExecutorOptimize is being reset in the JNI code or it is not being respected.

I propose that we add another parameter to PyTorchLibrary.LIB.moduleForward which takes in whether setGraphExecutorOptimize is true or false and sets it each time.

I am going to continue investigating to see if I can figure out why the value is not being respected when set in the PtSymbolicBlock on the first inference and only when using more than one thread.

hodovo · 2021-04-27T09:54:59Z

I was able to resolve this by calling JniUtils.setGraphExecutorOptimize(false); in each thread I planned to use. I do not have much experience with JNI, but it seems that because each thread provides its own environment to each JNI call, this causes the reset.

I think the best course of action would to be add a parameter to PyTorchLibrary.LIB.moduleForward as I mentioned above. Let me know what you think.

… since this is not respected in a multi-threaded environment.

hodovo · 2021-04-27T11:20:53Z

It might be a good idea to just allow the developer to utilize the JNI function how they want rather than focusing it one way or another. This flexibility will be needed in some situations, like if you want to load models on one thread with optimization and another set of models without optimization. With this global approach, it is much less flexible and it doesn't enforce any changes.

I just reverted my changes to PtSymbolBlock as I think flexibility is the way to go.

Change-Id: I59b7ef2e2b24543d34a9c15e73add232ef55afc6

codecov-commenter · 2021-04-27T17:11:22Z

Codecov Report

Merging #904 (6ddac4b) into master (e819389) will decrease coverage by 0.01%.
The diff coverage is 0.00%.

@@             Coverage Diff              @@
##             master     #904      +/-   ##
============================================
- Coverage     70.34%   70.32%   -0.02%     
  Complexity     5085     5085              
============================================
  Files           501      501              
  Lines         22432    22437       +5     
  Branches       2332     2335       +3     
============================================
  Hits          15779    15779              
- Misses         5412     5417       +5     
  Partials       1241     1241

Impacted Files	Coverage Δ	Complexity Δ
...ine/src/main/java/ai/djl/pytorch/jni/JniUtils.java	`90.57% <0.00%> (-0.30%)`	`194.00 <0.00> (ø)`
...c/main/java/ai/djl/pytorch/jni/PyTorchLibrary.java	`100.00% <ø> (ø)`	`1.00 <0.00> (ø)`
api/src/main/java/ai/djl/util/Platform.java	`45.65% <0.00%> (-3.19%)`	`8.00% <0.00%> (ø%)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e819389...6ddac4b. Read the comment docs.

hodovo added 2 commits April 26, 2021 23:49

Added support for setGraphExecutorOptimize in torchscript to prevent …

25581a2

…required model warmup for different batch sizes.

Reverted back to cu102 default in build.gradle for pytorch-native.

7062213

hodovo marked this pull request as ready for review April 27, 2021 07:51

Removed optimizeProperty check on first forward pass of PtSymbolBlock…

766dd36

… since this is not respected in a multi-threaded environment.

Reformat java and C++ code

6ddac4b

Change-Id: I59b7ef2e2b24543d34a9c15e73add232ef55afc6

frankfliu approved these changes Apr 27, 2021

View reviewed changes

lanking520 merged commit 91209ae into deepjavalibrary:master Apr 27, 2021

hodovo deleted the torchscript-optimize branch May 1, 2021 20:13

KexinFeng mentioned this pull request Nov 17, 2022

[pytorch] Add system property to config GraphExecutorOptimize #2156

Merged

Lokiiiiii pushed a commit to Lokiiiiii/djl that referenced this pull request Oct 10, 2023

Add integration test for lmi-dist AutoModel (deepjavalibrary#904)

44364bb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for setGraphExecutorOptimize with torchscript models. #904

Added support for setGraphExecutorOptimize with torchscript models. #904

hodovo commented Apr 27, 2021 •

edited

Loading

hodovo commented Apr 27, 2021

hodovo commented Apr 27, 2021

hodovo commented Apr 27, 2021

hodovo commented Apr 27, 2021

codecov-commenter commented Apr 27, 2021 •

edited

Loading

Added support for setGraphExecutorOptimize with torchscript models. #904

Added support for setGraphExecutorOptimize with torchscript models. #904

Conversation

hodovo commented Apr 27, 2021 • edited Loading

Description

hodovo commented Apr 27, 2021

hodovo commented Apr 27, 2021

hodovo commented Apr 27, 2021

hodovo commented Apr 27, 2021

codecov-commenter commented Apr 27, 2021 • edited Loading

Codecov Report

hodovo commented Apr 27, 2021 •

edited

Loading

codecov-commenter commented Apr 27, 2021 •

edited

Loading