You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I see that there is some differences in configuration setting. One of them has reordering set, one of them has prefetching enabled on main but not on MLPerf branch.
For reproduction.
Input Model:
https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/punet.mlir
Input data :
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/inference_input.0.bin
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/inference_input.1.bin
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/inference_input.2.bin
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/inference_input.3.bin
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/inference_input.4.bin
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/inference_input.5.bin
wget https://sharkpublic.blob.core.windows.net/sharkpublic/sai/sdxl-punet/punet_weights.irpa
I built IREE on main and used the TD script in https://github.com/nod-ai/sdxl-scripts/blob/shared/sdxl_on_main/int8-model/specs/attention_and_matmul_spec.mlir
Compilation command for IREE on main
Run Command :
For compilation on MLPerf I used the same inputs/weights but used
IREE Commit : https://github.com/iree-org/iree/tree/mlperf_v4.1_20240726
TD script : https://github.com/nod-ai/sdxl-scripts/blob/mlperf_v4.1_20240726/int8-model/specs/attention_and_matmul_spec.mlir
and same run command
There are in general slow down in performance. Particular issues are in the following three matmul dispatches
main$async_matmul_156_matmul_like_*
main$async_matmul_143_matmul_like_*
main$async_matmul_158_matmul_like_*
Attached is the IR log before and after strategy selection and lowering
sdxl_mlperf_matmul_143.dump.mlir.txt
sdxl_mlperf_matmul_156.dump.mlir.txt
sdxl_mlperf_matmul_158.dump.mlir.txt
sdxl_tom_matmul_143.dump.mlir.txt
sdxl_tom_matmul_156.dump.mlir.txt
sdxl_tom_matmul_158.dump.mlir.txt
The text was updated successfully, but these errors were encountered: