-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Strategy] Support for Int8 schedules - CUDA/x86 #5031
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@vinx13 Adding you as well. Because I have padded C dim for GPU using Legalize to use DP4A schedules. Otherwise, we will have to put a check in strategy. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this line https://github.com/apache/incubator-tvm/pull/5031/files#diff-bf1d7b23844ba1082c770babaa524806R178 should pass both the final output (outs[0].op
) and the conv output to the _schedule_conv2d_NCHWc_int8
. Otherwise len(s[output].op.axis) == 5
should be always true, right? Correct me if I'm wrong.
Could you add a few tests for conv2d_nchw_int8 in the topi/tests/python/test_topi_conv2d_int8.py? otherwise, lgtm |
I think padding channels would be helpful, it would be good if we have comparison result (channel padding + int8 template vs direct template) |
Thanks @anijain2305 @kevinthesun @vinx13. This is now merged. |
* [CUDA] Op strategy changes for Int8 schedules. * Applying Haichen's suggestions. * Make 4D output work for task extraction. * Make x86 work. * Fix lint. * Lint fixes. * Tests, comments, out channel a multiple of 4. * Topi test. Co-authored-by: Ubuntu <[email protected]>
* [CUDA] Op strategy changes for Int8 schedules. * Applying Haichen's suggestions. * Make 4D output work for task extraction. * Make x86 work. * Fix lint. * Lint fixes. * Tests, comments, out channel a multiple of 4. * Topi test. Co-authored-by: Ubuntu <[email protected]>
Recently introduce op strategy currently has some issues with task extraction with AutoTVM. This PR fixes them for x86/CUDA.
@kevinthesun @icemelon9