-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LORA training 🤍 #11
Comments
It's pretty simple, once WSL2 installed, create a folder for the training, like : " E:\HunyuanTrain" run the Ubuntu console, and navigate to this folder, "cd /mnt/e/HunyuanTrain" InfosWindows drives are typically mounted under /mnt. For example, the E: drive is accessible at /mnt/e.
Note that all paths needs to be conformed to linux format even for hunyuan_video and dataset .toml
example for the dataset.toml :
[[directory]]
path = '/mnt/e/Dataset/img/1_faces'
num_repeats = 2
[[directory]]
path = '/mnt/e/Dataset/img/2_eyes'
num_repeats = 5
example for the hunyuan_video.toml :
transformer_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/diffusion_models/hunyuan_video_720_fp8_e4m3fn.safetensors'
vae_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/vae/hunyuan_video_vae_bf16.safetensors'
llm_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer'
clip_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/clip/clip-vit-large-patch14' Clone and install the diffusion pipe repo, if you managed to install it as well as the requirements without issues then you are good to go. Once it's properly installed all you have to do is editing the dataset.toml and hunyuan_video.toml Navigate to your trainer path, example: Activate the venv: Start training using the given command |
where do we provide the trigger word? I don't see a provision to do it anywhere in the config 🤔 |
The trigger word is defined by the .txt captions. For example, if you have photos of a dog and the caption includes the word "dog," this will serve as the trigger word. You can also use special characters and modify the trigger word (e.g., "d0g") by placing it at the beginning of the caption to achieve better accuracy relative to your dataset during inference. Ofc this can be automated 2024-12-25.19-42-38.mp4 |
Is there any kind soul willing to write a detailed guide on how to do training on Windows for both Hunyuan and LTX?
I've been trying for three days, but unfortunately, I don't understand how to use WSL at all.
The text was updated successfully, but these errors were encountered: