[GPU] In gemm_tile_kernel, applied to use block read when N and K byte-size is aligned 4. #11629
Job | Run time |
---|---|
26s | |
11m 50s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
1m 9s | |
4m 13s | |
1s | |
17m 39s |
Job | Run time |
---|---|
26s | |
11m 50s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
0s | |
1m 9s | |
4m 13s | |
1s | |
17m 39s |