[C++] How to predict fast? #2118

bitnick10 · 2019-10-13T05:41:35Z

for (int i = 0; i < 1000*1000*1000; i++) {
    auto &input_tensor = input_tensor_vec[i];
    auto output_tensors = session_->Run(Ort::RunOptions{ nullptr }, input_node_names.data(), &input_tensor, 1, output_node_names.data(), 1);
}

This is really slow,is there some method make this faster (using C++)?

The text was updated successfully, but these errors were encountered:

hariharans29 · 2019-10-18T03:00:46Z

Your model must support "batching" inherently and then you can feed in a "batched" input.

For example, if your model has graph input (taking image case) of shape [1, 3, 224, 224] - it can only take one 3 channel image of height and width 224. But if it has graph input like this - ["some_string", 3, 224, 224] - it can handle "batched" input of any size.

Hope this helps.

bitnick10 · 2019-10-18T07:07:02Z

import onnx
mp = onnx.load_model('aa.onnx')
mp.graph.input[0].type.tensor_type.shape.dim[0].dim_param = 'None'
onnx.save(mp, 'aa_batch.onnx')

[C++] after session_ load aa_batch.onnx.
auto output_tensors = session_->Run(Ort::RunOptions{ nullptr }, input_node_names.data(), &input_tensor, 1, output_node_names.data(), 1);
It runs very fast! Thanks @hariharans29

cena001plus · 2023-06-01T07:38:19Z

import onnx
mp = onnx.load_model('aa.onnx')
mp.graph.input[0].type.tensor_type.shape.dim[0].dim_param = 'None'
onnx.save(mp, 'aa_batch.onnx')
[C++] after session_ load aa_batch.onnx. auto output_tensors = session_->Run(Ort::RunOptions{ nullptr }, input_node_names.data(), &input_tensor, 1, output_node_names.data(), 1); It runs very fast! Thanks @hariharans29

Can it also speed up by simply modifying the model to dynamic input, without modifying the output node, and not batching the input data? Hope to hear from you.

hariharans29 added the question label Oct 18, 2019

bitnick10 closed this as completed Oct 18, 2019

hariharans29 mentioned this issue Dec 23, 2019

Batch processing support for Inference #2725

Closed

hariharans29 mentioned this issue Oct 23, 2020

How to use batchsize in onnxruntime? #5577

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++] How to predict fast? #2118

[C++] How to predict fast? #2118

bitnick10 commented Oct 13, 2019 •

edited

Loading

hariharans29 commented Oct 18, 2019

bitnick10 commented Oct 18, 2019

cena001plus commented Jun 1, 2023

[C++] How to predict fast? #2118

[C++] How to predict fast? #2118

Comments

bitnick10 commented Oct 13, 2019 • edited Loading

hariharans29 commented Oct 18, 2019

bitnick10 commented Oct 18, 2019

cena001plus commented Jun 1, 2023

bitnick10 commented Oct 13, 2019 •

edited

Loading