-
Notifications
You must be signed in to change notification settings - Fork 787
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CpuGemmConv2d optimization affects performance on Apple M2/M2 Pro #1092
Comments
Hi @alvoron Thanks for reporting this. Would you please let us know how many inferences/iterations you are running? |
I run model 30 sec and calculate average exec time of each operation type. |
Hi @alvoron The mentioned patch should affect the start-up time, i.e. the first iteration only. I wonder if your runs configure() each time, or configure() only in the first iteration and run() in the remaining ones. |
OpenVINO uses oneDNN to call ACL's convolution. It seems oneDNN calls configure once via |
Hi @alvoron I ran ACL's benchmark_graph_mobilenet_v2 on a device with M2 but I could not see a significant performance degradation. See below the execution including the patch that you mentioned
And this is without the patch
6620.1732 us - AVG=6600.4505 us = 19.7227 us Would you please confirm if you experience the problem on other devices? |
With |
It seems the issue could be reproduced via ACL build command: onednn configure command (run in onednn root dir): benchdnn build command: The reproducer: On M2 Pro I've got @morgolock could you please try to repeat these steps? UPD:
|
Hi @alvoron Closing this as it was fixed in 24.02.1 Please reopen if you require further assistance. |
PR https://review.mlplatform.org/c/ml/ComputeLibrary/+/10526 makes CpuGemmConv2d slower on Apple M2 / M2 Pro.
The numbers below were collected on M2 Pro.
On
mobilenet-v2-1.0-224
CpuGemmConv2d takes 3.18 ms before the PR and 4.12 after the PR was merged.resnet-50-pytorch
- 16.37 ms before the PR; 19.67 ms after the PRSo, we have 20-30% performance degradation on CNN.
@sicong-li-arm @gunes-arm @aniraj01
The text was updated successfully, but these errors were encountered: