We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.
You must be logged in to block users.
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
I'm Hanze Dong.
I work on machine learning research.
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Python 8.3k 827
A recipe for online RLHF and online iterative DPO.
Python 468 51
Visualization of mean field and neural tangent kernel regime
Jupyter Notebook 20 2
Python 3