-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Subgraph extraction in ONNX models #4107
Changes from all commits
0446814
0c8a4f7
4a512fc
49d2908
15bcb02
fe7b3b1
7e70fda
53667fa
ddcffa3
0078926
b2373e4
b8fb3bd
45d11e1
2d3e298
f2fdf5e
48b3cd4
92addf2
59d7ef4
6eaee42
0e7822f
985ef97
263cd4b
1f18183
9972ac0
a39cbd0
4b86f01
ad7100c
2a3744b
6980948
8d5f739
b337772
6c67ec8
0ae591d
ee6831c
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,164 @@ | ||
//***************************************************************************** | ||
// Copyright 2017-2021 Intel Corporation | ||
// | ||
// Licensed under the Apache License, Version 2.0 (the "License"); | ||
// you may not use this file except in compliance with the License. | ||
// You may obtain a copy of the License at | ||
// | ||
// http://www.apache.org/licenses/LICENSE-2.0 | ||
// | ||
// Unless required by applicable law or agreed to in writing, software | ||
// distributed under the License is distributed on an "AS IS" BASIS, | ||
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
// See the License for the specific language governing permissions and | ||
// limitations under the License. | ||
//***************************************************************************** | ||
|
||
#pragma once | ||
|
||
#include <map> | ||
#include <set> | ||
#include <string> | ||
#include <unordered_map> | ||
#include <vector> | ||
|
||
namespace ONNX_NAMESPACE | ||
{ | ||
class GraphProto; | ||
class NodeProto; | ||
class ValueInfoProto; | ||
} // namespace ONNX_NAMESPACE | ||
|
||
namespace ngraph | ||
{ | ||
enum class EdgeType | ||
{ | ||
INPUT, | ||
OUTPUT | ||
}; | ||
|
||
template <EdgeType> | ||
struct Edge | ||
{ | ||
Edge() = delete; | ||
Edge(const int node_idx, std::string tensor_name) | ||
: m_node_idx{node_idx} | ||
, m_tensor_name{std::move(tensor_name)} | ||
{ | ||
} | ||
|
||
const int m_node_idx; | ||
const std::string m_tensor_name; | ||
}; | ||
namespace onnx_import | ||
{ | ||
/// \brief Defines an edge connected to an input of any node in the graph. | ||
/// It consists of a node index in the processed ONNX model and the input name. | ||
tomdol marked this conversation as resolved.
Show resolved
Hide resolved
|
||
/// The index should point to a node in the topological sort of the underlying graph | ||
/// which means it has to be in range: 0 <= node_idx < graph.node_size() | ||
/// | ||
/// For a node number 5, with 3 inputs: | ||
/// | ||
/// ----(in_A)----> +--------+ | ||
/// ----(in_B)----> | node 5 | ----(out)----> | ||
/// ----(in_C)----> +--------+ | ||
/// | ||
/// there are 3 possible valid instances of this struct: | ||
/// InputEdge(5, "in_A") | ||
/// InputEdge(5, "in_B") | ||
/// InputEdge(5, "in_C") | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Assuming that There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. As discussed offline, this will be addressed in a separate ticket: 47578 |
||
using InputEdge = Edge<EdgeType::INPUT>; | ||
|
||
/// \brief Defines an edge connected to an output of any node in the graph. | ||
/// It consists of a node index in the processed ONNX model and the output name. | ||
/// | ||
/// For a node number 5, with 2 outputs: | ||
/// | ||
/// +--------+ ----(out1)----> | ||
/// ----(in_A)----> | node 5 | | ||
/// +--------+ ----(out2)----> | ||
/// | ||
/// there are 2 possible valid instances of this struct: | ||
/// OutputEdge(5, "out1") | ||
/// OutputEdge(5, "out2") | ||
using OutputEdge = Edge<EdgeType::OUTPUT>; | ||
|
||
/// \brief Subgraph extraction helper structure | ||
struct SubgraphExtractor | ||
{ | ||
SubgraphExtractor(ONNX_NAMESPACE::GraphProto& graph); | ||
|
||
/// \brief Adds new inputs to the graph and connects them to the nodes indicated by | ||
/// the provided input edges. | ||
void add_new_inputs(const std::vector<InputEdge>& new_inputs); | ||
|
||
/// \brief Adds new outputs to the graph with the same name as the nodes pointed to | ||
/// by the input edges "new_outputs". | ||
void add_new_outputs(const std::vector<OutputEdge>& new_outputs); | ||
|
||
/// \brief Extracts the final subgraph by traversing the original model bottom-up | ||
/// starting at each of the provided output edges. The extracted subgraph | ||
/// contains all previously added inputs and potentially a subset of original | ||
/// model's inputs that contribute to the value calculated in the output tensors. | ||
/// In the end the underlying GraphProto is modified and obsolete elements | ||
/// are discarded after this method call has finished. | ||
/// | ||
/// \param subgraph_outputs A list of expected outputs of the extracted subgraph. | ||
void extract_subgraph(std::vector<OutputEdge> subgraph_outputs); | ||
|
||
/// \brief Represents a subgraph of an ONNX model by holding a subset of nodes, inputs, | ||
/// outputs and initializers of the original graph. Objects of this struct can be | ||
/// merged into other instances using the += operator to build a subgraph from | ||
/// smaller clusters. | ||
struct SubgraphComponents | ||
{ | ||
SubgraphComponents() = default; | ||
SubgraphComponents(const SubgraphComponents&) = delete; | ||
SubgraphComponents(SubgraphComponents&&) = default; | ||
SubgraphComponents& operator=(const SubgraphComponents&) = delete; | ||
SubgraphComponents& operator=(SubgraphComponents&&) = default; | ||
|
||
std::set<int> nodes; | ||
std::set<std::string> inputs; | ||
std::set<std::string> initializers; | ||
std::set<std::string> outputs; | ||
|
||
SubgraphComponents& operator+=(SubgraphComponents&& other) | ||
{ | ||
nodes.insert(other.nodes.begin(), other.nodes.end()); | ||
inputs.insert(other.inputs.begin(), other.inputs.end()); | ||
initializers.insert(other.initializers.begin(), other.initializers.end()); | ||
outputs.insert(other.outputs.begin(), other.outputs.end()); | ||
return *this; | ||
} | ||
}; | ||
|
||
private: | ||
ONNX_NAMESPACE::GraphProto& m_onnx_graph; | ||
|
||
// Graph traversal helper: node index -> node inputs (one-to-many) | ||
std::unordered_multimap<int, std::string> m_node_inputs; | ||
// Number of consumers of all tensors in the graph | ||
std::map<std::string, int> m_tensor_consumers; | ||
|
||
/// \brief Replaces the old input edge with a new one in the helper struct. | ||
/// This is used by the output contributors discovery. | ||
void replace_input_edge(const InputEdge& old_edge, const InputEdge& new_edge); | ||
|
||
/// \brief Returns a list of edges of each outputs of the graph "m_onnx_graph" | ||
std::vector<OutputEdge> all_output_edges() const; | ||
|
||
/// \brief Traverses the graph bottom-up and collects all nodes, inputs and initializers | ||
/// that contribute to an output designated by the provided output edge. | ||
/// A sum of such SubgraphComponents objects forms a target extracted subgraph. | ||
SubgraphComponents | ||
discover_output_contributors(const OutputEdge& output_edge, | ||
const SubgraphComponents& already_collected) const; | ||
|
||
/// \brief Modifies the underlying GraphProto object and discards all obsolete elements. | ||
/// | ||
/// \param subgraph An object describing the subgraph to be extracted (elems to be kept) | ||
void extract_subgraph_from_onnx_model(const SubgraphComponents& subgraph); | ||
}; | ||
} // namespace onnx_import | ||
} // namespace ngraph |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How can one obtain a valid value for m_node_idx? Briefly looking in the code I see that a valid value is really required but there is no way for user to deduce this index based on some user-visible name without parsing model proto file. Even Netron doesn't give this index.
Can we make it at least optional? In this case it would mean cutting by a tensor name and providing a single input for all the consumers.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Similarily to the comment below - TBD in 47578