Investigate outlier in GPU memory usage with 2.6 (train): Hermit + cvf_diet_responset2t
#9516
Labels
type:enhancement ✨
Additions of new features or changes to existing ones, should be doable in a single PR
Followup to #9433
As seen [here| https://docs.google.com/spreadsheets/d/14rIvQLxEdp1PJDqhM65n0__THKhvS5f4XPLoWz8SWSs/edit#gid=1230100797], there is a huge increase to GPU memory usage for train on Hermit with
cvf_diet_responset2t
. This is an outlier, and should be investigated.Definition of Done (not necessarily in this order)
nvidia-smi
logs to narrow down the increaseDepending on what we learn from investigation create the appropriate issue:
The text was updated successfully, but these errors were encountered: