Skip to content

Commit

Permalink
[GPU] Fix lws calculation for reorder_kernel_bfyx_to_blocked_format k…
Browse files Browse the repository at this point in the history
…ernel
  • Loading branch information
Lyamin-Roman committed Jul 31, 2024
1 parent 11c0189 commit 514e9fe
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -143,7 +143,7 @@ static std::vector<size_t> GetBestLwsFromGws(const reorder_params& params, const
std::vector<size_t> dims{ 0, 1, 2 };

// SLM size: elemsize * tile_width * tile_width * work_items <= 64K
const size_t elem_size = params.inputs[0].ElementSize();
const size_t elem_size = params.outputs[0].ElementSize();
const size_t max_local_mem_size = params.engineInfo.maxLocalMemSize;
const size_t max_work_group_size = params.engineInfo.maxWorkGroupSize;
size_t max_num_work_items = std::min(max_work_group_size, max_local_mem_size / (elem_size * tile_width * tile_size));
Expand Down

0 comments on commit 514e9fe

Please sign in to comment.