-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support for A100 #1
Comments
EDIT: changed |
@jimgao1 Thanks a ton for your help! I was able to compile Habitat successfully on a DGX-A100 system. Additionally, I'm able to run @geoffxy I got your training data for the MLPs and added in training data for Quadro RTX 6000 and was able to train the MLPs to completion. However, this is where things start breaking down. I tried to run a naive
The script finished all the 3 configs for
I was able to profile P.S: Do I need to run |
Hi Suhas, Division by 0:
Unable to record LSTM:
Thanks! |
I forked a copy of habitat to test and add support for A100s here -- https://github.com/suhasjs/habitat
I've appended specs for A100-SXM4-40GB in
analyzer/habitat/data/devices.yml
:I've also added A100 to
analyzer/habitat/analysis/mlp/devices.csv
:In addition, I've added A100 to list of
DEVICES
inexperiments/run_experiment.py
.I'm unable to run profiling/gather raw data on A100s with these changes.
I tried
CUDA_VISIBLE_DEVICES=0 bash experiments/gather_raw_data.sh A100
and it spit out the following errors:The errors are a result of
origin_wave_size
andorigin_occupancy
being set to zero incalculate_wave_info()
inresimplified_wave_scaling
.Do I need to re-train the MLP predictors with A100 profiling data mixed in? I noticed there exists a file
analyzer/habitat/analysis/mlp/train.py
. Is this relevant?Is this the right way to go about adding a new device type to Habitat? It would be great if you had a write-up on how to add a new device :) I would like to try out habitat with a couple more device types
The text was updated successfully, but these errors were encountered: