-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OneDNN hardswish integration #30211
OneDNN hardswish integration #30211
Conversation
Thanks for your contribution! |
@jczaja Could you please take a look? |
96d1b9c
to
434cf2e
Compare
@jakpiase Please do profiling with following config, thanks
|
I profiled on my i9 machine, 10 warmup , 100 repeat
Performance improvement: |
fead798
to
843c86e
Compare
Sorry to inform you that d4e524e's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually. |
@lidanqing-intel Please verify the accuracy with enabling OneDNN hardswish. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
@luotao1 Could you please start your review? |
@jakpiase Juncai ask to cherry-pick this PR to release/2.0 |
@jakpiase |
* OneDNN hardswish integration (#30211) * keep only conv + hardswish in this PR Co-authored-by: jakpiase <[email protected]>
PR types
New features
PR changes
OPs
Describe
Added support for oneDNN hardswish activation function. Conv + activation and fc + activation fuse passes can now also fuse with hardswish activation.
Profiled on Intel(R) d on(R) Gold 6348H CPU @ 2.30GHz
warmup = 10, repeat = 100
CPU native
oneDNN without hardswish
oneDNN with hardswish
Total times:
oneDNN without hardswish / oneDNN with hardswish = 1.19
CPU native / oneDNN with hardswish = 2.76