Drop per-architecture properties which are actually per-compute-capability #293

eyalroz · 2022-01-14T11:01:31Z

Some properties we offer for the compute_architecture_t type used to really be per-arch, but with Volta-vs-Turing, and now Ampere 8.0 vs Ampere 8.6, have really become per-compute-capability. It is confusing and possibly wrong to let people rely on per-architecture values, e.g. if another GPU is added that differs from the per-arch values.

* Device properties functions now properly support Ampere GPUs (8.0 and 8.6); * Corrected some device-properties per-compute-capability values. * Dropped the not-really-per-architecture values in favor of simply per-compute-capability values. * No explaining where I got the "max in flight threads per SM" value from. * Comment tweaks * Exception description tweak for when an architecture is not known to us.

eyalroz self-assigned this Jan 14, 2022

eyalroz added the task label Jan 14, 2022

eyalroz added the resolved-on-development label Jan 14, 2022

eyalroz closed this as completed Jan 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop per-architecture properties which are actually per-compute-capability #293

Drop per-architecture properties which are actually per-compute-capability #293

eyalroz commented Jan 14, 2022

Drop per-architecture properties which are actually per-compute-capability #293

Drop per-architecture properties which are actually per-compute-capability #293

Comments

eyalroz commented Jan 14, 2022