-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Relay][PASS] CombineParallelConv2D generating new kernel and causing performance issue #2827
Comments
I think this pass can also be considered as a part of graph tuner. |
@yzhliu Yes. Maybe we can add a call of this pass in autotvm before any other workload extraction/graph analysis is executed. |
fixed by #2961 |
Is this issue solved? We still don't call CombineParallelConv2d before workload extraction. |
CombineParallelConv2d doesn't always give better performance. We should extract workload with and without this pass and then decide whether to apply this pass. |
Is it possible to move this pass to higher opt level? Looks like it's not quite suitable to put it together with other mandatory optimization passes in opt 3. |
@kevinthesun Yes we can move it to opt 4 for now |
Let me summarize the actions here:
Would like to hear your comments @kevinthesun @merrymercy @yzhliu @tqchen |
@vinx13 Sounds good to me. For autotvm and graph tuning parts, we can update the tutorial to add one more parameter to indicate whether to apply this pass before tuning. |
CombineParallelConv2D moved to opt level 4. We can have another thread to deal with integration with autotvm. |
Since some developers have the similar issue, I open this issue to track the status of discuss topic https://discuss.tvm.ai/t/relay-alter-op-layout-pass-regression/1870/7. CombineParallelConv2D pass generates new workloads and in some cases cause performance regression. Another issue is the combination of this pass with AutoTVM. This pass might need to be done before AutoTVM is launched.
The text was updated successfully, but these errors were encountered: