-
Notifications
You must be signed in to change notification settings - Fork 640
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Request for AdamW8bit support on CPU (would help TorchTune) #1226
Comments
|
I don't recall seeing any optimizers implemented yet for CPU, but may be mistaken. Paged optimizer doesn't make sense to me for CPU, but I can understand the request for AdamW8bit. |
Thanks for pointing that out, @matthewdouglas! I've revised the description. @jianan-gu @Xia-Weiwen, please clarify if you had added |
@sanchitintel Yes, we are going to do it. cc. @jianan-gu @jiqing-feng |
@sanchitintel thanks for raising this. When is the next torchtune release foreseen? Hmm, the problem is that the device abstraction / dispatcher situation is still not stable. Things will change fundamentally in the next 3 weeks. Not sure if this can be done as a PR to |
Feature request
Port AdamW8bit support for CPU from
multi-backend-refactor
branch to the main branchMotivation
Public cloud providers' machines with GPUs are usually expensive while datacenter-grade CPUs are more readily available at lower prices. Towards the goal of making Deep Learning more accessible to developers & learners, the ability to finetune with AdamW8bit on CPU seems like a good milestone.
TorchTune
is currently unable to support full fine-tuning on CPU withAdamW8bit
because it usesbitsandbytes
' AdamW8bit optimizer.#898 enabledAdamW8bit
for CPU inmulti-backend-refactor
branch, but the main branch doesn't have it.It'd be great if we could enable AdamW8bit for CPU in bitsandbytes main branch before TorchTune's next release (provided there would be a
bitsandbytes
release before that), so that users who'd install TorchTune would automatically end up installing a version ofbitsandbytes
that'd supportAdamW8bit
on CPU.Thanks!
Your contribution
@jianan-gu could port over his code from multi-backend-refactor branch to the main branch.
cc @mingfeima @ashokei @TimDettmers
The text was updated successfully, but these errors were encountered: