Intel OpenVINO backend #2332

dkurt · 2020-12-21T07:00:01Z

Pull Request Template

Description

Add Intel OpenVINO backend for prediction. Backend optimizes only predictions, not training.

To enable OpenVINO, add use_openvino=True when create a model:

# examples/tox21/tox21_tf_progressive.py

model = dc.models.ProgressiveMultitaskClassifier(
    len(tox21_tasks),
    n_features,
    layer_sizes=[1000],
    dropouts=[.25],
    learning_rate=0.001,
    batch_size=50,
    use_openvino=True)

Efficiency measurements (not final, working on improvement):

benchmark code

from __future__ import print_function
from __future__ import division
from __future__ import unicode_literals

import os
import shutil
import numpy as np
import argparse
import deepchem as dc
from deepchem.molnet import load_tox21

parser = argparse.ArgumentParser()
parser.add_argument('--use_openvino', action='store_true')
args = parser.parse_args()

# Only for debug!
np.random.seed(123)

# Load Tox21 dataset
n_features = 1024
tox21_tasks, tox21_datasets, transformers = load_tox21()
train_dataset, valid_dataset, test_dataset = tox21_datasets

# Fit models
metric = dc.metrics.Metric(dc.metrics.roc_auc_score, np.mean)

model = dc.models.ProgressiveMultitaskClassifier(
    len(tox21_tasks),
    n_features,
    layer_sizes=[1000],
    dropouts=[.25],
    learning_rate=0.001,
    batch_size=50,
    use_openvino=args.use_openvino)

print("Evaluating model")
import time
for i in range(3):
  start = time.time()
  train_scores = model.evaluate(train_dataset, [metric], transformers)
  print(time.time() - start)

tox21_tf_progressive	use_openvino=False	use_openvino=True
Intel® Core™ i7 8665UE	7.75 sec	6.03 sec (x1.28)
Intel® Xeon® Gold 6258R	1.23 sec	0.72 sec (x1.7)

Tested models:

ProgressiveMultitaskClassifier (tox21)

Type of change

Please check the option that is related to your PR.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
- In this case, we recommend to discuss your modification on GitHub issues before creating the PR
Documentations (modification for documents)

Checklist

rbharath

Thanks for the contribution! I have a few preliminary comments below. I'm not very familiar with OpenVINO so some general information about its use cases and more documentation and tests would be very helpful for us to be able to maintain this as a feature.

deepchem/models/openvino_model.py

dkurt · 2020-12-22T09:21:56Z

@rbharath, Hi! Thanks for review! I fully agree with your comments and going to resolve them soon.

codecov-io · 2020-12-23T12:37:40Z

Codecov Report

Merging #2332 (8471ded) into master (5099208) will increase coverage by 0.07%.
The diff coverage is 94.97%.

@@            Coverage Diff             @@
##           master    #2332      +/-   ##
==========================================
+ Coverage   85.00%   85.08%   +0.07%     
==========================================
  Files         292      294       +2     
  Lines       26018    26191     +173     
==========================================
+ Hits        22116    22284     +168     
- Misses       3902     3907       +5

Impacted Files	Coverage Δ
deepchem/data/data_loader.py	`88.56% <ø> (ø)`
deepchem/data/datasets.py	`86.91% <ø> (ø)`
deepchem/feat/graph_data.py	`79.48% <ø> (ø)`
deepchem/utils/openvino_model.py	`92.37% <92.37%> (ø)`
deepchem/models/keras_model.py	`87.14% <100.00%> (+0.28%)`	⬆️
deepchem/models/tests/test_openvino.py	`100.00% <100.00%> (ø)`
deepchem/models/torch_models/torch_model.py	`89.14% <100.00%> (+0.22%)`	⬆️
deepchem/rl/tests/test_rl_reload.py	`96.61% <0.00%> (+1.69%)`	⬆️
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 01ea79d...8471ded. Read the comment docs.

env.cpu.yml

dkurt · 2020-12-24T11:59:02Z

Failed test:

deepchem/models/tests/test_graph_models.py::test_dtnn_regression_model FAILED [ 44%]

Not sure that it's because of the changes in this PR

rbharath

Looks better! Thanks for adding in some unit tests :). I've done a more detailed review pass now

As a first comment, I think we need some more work on the docstrings to match the rest of the codebase.

A second comment is whether we should consider making OpenVINOModel a private class.

My other major comment is about maintainability. If OpenVINO is experimental, I want to make sure that we can maintain it. If you have bandwidth to commit to maintaining I think we can make this work :)

Also, tagging in @peastman @nd-02110114 who might be interested to follow along. Please feel free to chime in if you have thoughts!

deepchem/models/openvino_model.py

env.cpu.yml

deepchem/models/openvino_model.py

nissy-dev · 2020-12-29T15:49:55Z

My other major comment is about maintainability.

I'm also concerned about this point.
I seems that OpenVINO is an optimized engine for computer vision such as image processing like CNN. However, most of DNN models for material science rarely treat with image processing and I don't think our users will gain much benefit from it. Also, in the field of chemistry, the inference performance is rarely required as severely as in image recognition or object recognition.

On the other hand, this PR depends on your published PyPI package and it is personalized.
So, I seem it is hard and not motivated for us to maintain.

dkurt · 2020-12-29T16:17:40Z

@nd-02110114, Thanks for feedback!

Despite that OpenVINO was initially designed for computer vision tasks, it's a universal engine and from PR's description you can see that there is a benefit even for networks without convolutions at all (x1.7 improvement for tox21_tf_progressive which consists of GEMM layers only).

On the other hand, this PR depends on your published PyPI package and it is personalized.
So, I seem it is hard and not motivated for us to maintain.

I see you point. I'll try to switch to an official package.

nissy-dev · 2021-01-03T08:10:51Z

@dkurt I'm sorry for a late response 🙇‍♂️

Despite that OpenVINO was initially designed for computer vision tasks, it's a universal engine and from PR's description you can see that there is a benefit even for networks without convolutions at all (x1.7 improvement for tox21_tf_progressive which consists of GEMM layers only).

I understood the inference performance for tox21_tf_progressive improved. However, as I mentioned, our users like chemists rarely face the situation that requires such inference performance improvements. So, I seem that most of our users don't feel benchmark improvments as a benefit. Generally, our users require train or preprocess performance improvements rather than inference. When making an inference, the bottleneck of the performance is mainly the preprocess in my experience. If you know the good usecases of OpenVINO in the area of chemistry, I want to know.

rbharath · 2021-01-04T22:05:43Z

One possible application that comes to mind is if a user wants to run inference against a large chemical or materials library. For example, users sometimes want to run a graph conv model against a large database of compounds like Enamine REAL (~1 billion compounds). In that case, it might be useful to have inference speedups. Could OpenVINO help for this case? If so a small benchmark might really help establish the advantage.

@dkurt Seconding @nd-02110114 that it would be great to hear about any other use cases you have in mind :).

dkurt · 2021-01-05T14:40:24Z

@nd-02110114, @rbharath, Thanks for comments! You're absolutely right that the best benefit of using such kind of optimizations is large volume data.

For example, users sometimes want to run a graph conv model against a large database of compounds like Enamine REAL (~1 billion compounds)

May I ask to point to a model so we can do benchmarking?

rbharath · 2021-01-19T00:36:19Z

My apologies for the slow response here! This fell behind due to the DeepChem 2.4.0 release work.

As a suggestion for a benchmark model, could you try running dc.models.GraphConvModel with the OpenVino backend to see the speed improvements? You could use any collection of molecules for this, but the ZINC15 dataset is available in MoleculeNet (and is quite large) so might be a good scale benchmark

rbharath · 2023-04-05T23:38:14Z

Closing this old PR for cleanup

dkurt marked this pull request as draft December 21, 2020 07:00

dkurt force-pushed the openvino branch 2 times, most recently from 4703992 to 83b9c42 Compare December 21, 2020 10:02

rbharath reviewed Dec 22, 2020

View reviewed changes

deepchem/models/openvino_model.py Outdated Show resolved Hide resolved

deepchem/models/openvino_model.py Outdated Show resolved Hide resolved

deepchem/models/openvino_model.py Outdated Show resolved Hide resolved

dkurt force-pushed the openvino branch from b3f6010 to f8553d9 Compare December 24, 2020 11:03

dkurt commented Dec 24, 2020

View reviewed changes

env.cpu.yml Outdated Show resolved Hide resolved

dkurt force-pushed the openvino branch 2 times, most recently from 35715ae to d5c5548 Compare December 24, 2020 18:50

dkurt marked this pull request as ready for review December 24, 2020 19:22

rbharath reviewed Dec 28, 2020

View reviewed changes

dkurt force-pushed the openvino branch from 39dd709 to 8471ded Compare December 29, 2020 08:08

dkurt mentioned this pull request Jan 12, 2021

[MO] pip packaging openvinotoolkit/openvino#3123

Merged

dkurt added 9 commits April 29, 2021 08:13

Tox21 with OpenVINO

54c1303

Sync inference

e8d364c

Move OpenVINO impl to separate module

7c57d14

Enable OpenVINO tests

f47083a

Install MO package

f9f8234

OpenVINO async inference

c9d6d83

OpenVINO backend for PyTorch models

bfca196

Fix Torch import

b20155b

Fix docs for OpenVINO and do some refactoring

837c3c1

dkurt and others added 3 commits April 29, 2021 08:15

Use official OpenVINO

ff219df

Use MO from GitHub

c86860c

add cgcnn regression (#7)

2cb955a

dkurt force-pushed the openvino branch from 9d471ed to 2cb955a Compare April 29, 2021 05:27

dkurt marked this pull request as draft April 29, 2021 06:15

rbharath closed this Apr 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intel OpenVINO backend #2332

Intel OpenVINO backend #2332

dkurt commented Dec 21, 2020 •

edited

Loading

rbharath left a comment

dkurt commented Dec 22, 2020

codecov-io commented Dec 23, 2020 •

edited

Loading

dkurt commented Dec 24, 2020

rbharath left a comment

nissy-dev commented Dec 29, 2020 •

edited

Loading

dkurt commented Dec 29, 2020

nissy-dev commented Jan 3, 2021

rbharath commented Jan 4, 2021

dkurt commented Jan 5, 2021

rbharath commented Jan 19, 2021

rbharath commented Apr 5, 2023

Intel OpenVINO backend #2332

Intel OpenVINO backend #2332

Conversation

dkurt commented Dec 21, 2020 • edited Loading

Pull Request Template

Description

Type of change

Checklist

rbharath left a comment

Choose a reason for hiding this comment

dkurt commented Dec 22, 2020

codecov-io commented Dec 23, 2020 • edited Loading

Codecov Report

dkurt commented Dec 24, 2020

rbharath left a comment

Choose a reason for hiding this comment

nissy-dev commented Dec 29, 2020 • edited Loading

dkurt commented Dec 29, 2020

nissy-dev commented Jan 3, 2021

rbharath commented Jan 4, 2021

dkurt commented Jan 5, 2021

rbharath commented Jan 19, 2021

rbharath commented Apr 5, 2023

dkurt commented Dec 21, 2020 •

edited

Loading

codecov-io commented Dec 23, 2020 •

edited

Loading

nissy-dev commented Dec 29, 2020 •

edited

Loading