You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Some properties we offer for the compute_architecture_t type used to really be per-arch, but with Volta-vs-Turing, and now Ampere 8.0 vs Ampere 8.6, have really become per-compute-capability. It is confusing and possibly wrong to let people rely on per-architecture values, e.g. if another GPU is added that differs from the per-arch values.
The text was updated successfully, but these errors were encountered:
* Device properties functions now properly support Ampere GPUs (8.0 and 8.6);
* Corrected some device-properties per-compute-capability values.
* Dropped the not-really-per-architecture values in favor of simply per-compute-capability values.
* No explaining where I got the "max in flight threads per SM" value from.
* Comment tweaks
* Exception description tweak for when an architecture is not known to us.
* Device properties functions now properly support Ampere GPUs (8.0 and 8.6);
* Corrected some device-properties per-compute-capability values.
* Dropped the not-really-per-architecture values in favor of simply per-compute-capability values.
* No explaining where I got the "max in flight threads per SM" value from.
* Comment tweaks
* Exception description tweak for when an architecture is not known to us.
* Device properties functions now properly support Ampere GPUs (8.0 and 8.6);
* Corrected some device-properties per-compute-capability values.
* Dropped the not-really-per-architecture values in favor of simply per-compute-capability values.
* No explaining where I got the "max in flight threads per SM" value from.
* Comment tweaks
* Exception description tweak for when an architecture is not known to us.
Some properties we offer for the compute_architecture_t type used to really be per-arch, but with Volta-vs-Turing, and now Ampere 8.0 vs Ampere 8.6, have really become per-compute-capability. It is confusing and possibly wrong to let people rely on per-architecture values, e.g. if another GPU is added that differs from the per-arch values.
The text was updated successfully, but these errors were encountered: