[FEA] Allow to customize cooperative group size (`tile_size`) for `static_map` #194

ttnghia · 2022-07-23T02:12:25Z

Currently, static_map internally sets a fixed number tile_size = 4. Such tile_size value is used when calling the insert or contains APIs. The value tile_size = 4 is not an optimal one, and may cause performance regression on some (if not most) systems as I have tested myself. For example, setting tile_size = 2 would double the performance when running on my system.

It would be great if we can have a way to specify tile_size upon constructing the static_map object, similar to when we construct a static_multimap.

The text was updated successfully, but these errors were encountered:

sleeepyjack · 2022-07-26T19:02:35Z

Probably a candidate for #110. We could also explore having dynamic CG sizes, e.g., CG=1 for when the table occupancy is low and then use a wider group once the table fills up.

sleeepyjack · 2022-07-26T19:19:28Z

@ttnghia I'm curious, what architecture did you run your benchmarks on?

ttnghia · 2022-07-26T19:20:47Z

I'm running on RTX Quadro 6000, SM75.

jrhemstad · 2022-07-27T02:06:22Z

@sleeepyjack I'm guessing it's a difference of GDDR vs HBM. Larger tile_size is better on HBM vs GDDR.

sleeepyjack · 2022-07-27T02:22:57Z

@jrhemstad I was thinking the same thing. A long time ago, I dreamed about having a compile time lookup table for choosing the optimal (default) CG size for a given architecture in WarpCore. Sounds wild, but hey, why not?

jrhemstad · 2022-07-27T12:36:02Z

@jrhemstad I was thinking the same thing. A long time ago, I dreamed about having a compile time lookup table for choosing the optimal (default) CG size for a given architecture in WarpCore. Sounds wild, but hey, why not?

That wouldn't be too hard. It would be similar to how CUB does its device specific tuning policies.

PointKernel · 2024-12-06T00:10:30Z

Completed in the new implementation

ttnghia added the type: feature request New feature request label Jul 23, 2022

PointKernel added the P1: Should have Necessary but not critical label Jul 25, 2022

PointKernel added this to the Refactor Open Address Data Structures milestone Oct 5, 2022

PointKernel closed this as completed Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Allow to customize cooperative group size (`tile_size`) for `static_map` #194

[FEA] Allow to customize cooperative group size (`tile_size`) for `static_map` #194

ttnghia commented Jul 23, 2022 •

edited

Loading

sleeepyjack commented Jul 26, 2022

sleeepyjack commented Jul 26, 2022 •

edited

Loading

ttnghia commented Jul 26, 2022

jrhemstad commented Jul 27, 2022

sleeepyjack commented Jul 27, 2022 •

edited

Loading

jrhemstad commented Jul 27, 2022

PointKernel commented Dec 6, 2024

[FEA] Allow to customize cooperative group size (tile_size) for static_map #194

[FEA] Allow to customize cooperative group size (tile_size) for static_map #194

Comments

ttnghia commented Jul 23, 2022 • edited Loading

sleeepyjack commented Jul 26, 2022

sleeepyjack commented Jul 26, 2022 • edited Loading

ttnghia commented Jul 26, 2022

jrhemstad commented Jul 27, 2022

sleeepyjack commented Jul 27, 2022 • edited Loading

jrhemstad commented Jul 27, 2022

PointKernel commented Dec 6, 2024

[FEA] Allow to customize cooperative group size (`tile_size`) for `static_map` #194

[FEA] Allow to customize cooperative group size (`tile_size`) for `static_map` #194

ttnghia commented Jul 23, 2022 •

edited

Loading

sleeepyjack commented Jul 26, 2022 •

edited

Loading

sleeepyjack commented Jul 27, 2022 •

edited

Loading