LORA training 🤍 #11

4lt3r3go · 2024-12-15T16:49:51Z

Is there any kind soul willing to write a detailed guide on how to do training on Windows for both Hunyuan and LTX?
I've been trying for three days, but unfortunately, I don't understand how to use WSL at all.

PGCRT · 2024-12-20T19:42:49Z

It's pretty simple, once WSL2 installed, create a folder for the training,

like : " E:\HunyuanTrain"

run the Ubuntu console, and navigate to this folder,

"cd /mnt/e/HunyuanTrain"

Infos

Windows drives are typically mounted under /mnt. For example, the E: drive is accessible at /mnt/e.
Note that all paths needs to be conformed to linux format even for hunyuan_video and dataset .toml 

example for the dataset.toml : 

[[directory]]
path = '/mnt/e/Dataset/img/1_faces'
num_repeats = 2

[[directory]]
path = '/mnt/e/Dataset/img/2_eyes'
num_repeats = 5

example for the hunyuan_video.toml : 

transformer_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/diffusion_models/hunyuan_video_720_fp8_e4m3fn.safetensors'
vae_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/vae/hunyuan_video_vae_bf16.safetensors'
llm_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/LLM/llava-llama-3-8b-text-encoder-tokenizer'
clip_path = '/mnt/c/ComfyUI_windows_portable/ComfyUI/models/clip/clip-vit-large-patch14'

Clone and install the diffusion pipe repo, if you managed to install it as well as the requirements without issues then you are good to go.
If you got errors (mostly for mismatched or missing dependencys) the best way to fix them is to copy paste the error of the terminal to chatgpt to easily troubleshoot them.

Once it's properly installed all you have to do is editing the dataset.toml and hunyuan_video.toml
Then start the training by reopen the Ubuntu console,

Navigate to your trainer path, example:
cd /mnt/e/HunyuanTrain/diffusion-pipe

Activate the venv:
source venv/bin/activate

Start training using the given command
NCCL_P2P_DISABLE="1" NCCL_IB_DISABLE="1" deepspeed --num_gpus=1 train.py --deepspeed --config examples/hunyuan_video.toml

theycallmeloki · 2024-12-25T12:24:22Z

where do we provide the trigger word? I don't see a provision to do it anywhere in the config 🤔

PGCRT · 2024-12-25T18:44:14Z

The trigger word is defined by the .txt captions. For example, if you have photos of a dog and the caption includes the word "dog," this will serve as the trigger word. You can also use special characters and modify the trigger word (e.g., "d0g") by placing it at the beginning of the caption to achieve better accuracy relative to your dataset during inference.

Ofc this can be automated

2024-12-25.19-42-38.mp4

You can download the workflow here if you want

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LORA training 🤍 #11

LORA training 🤍 #11

4lt3r3go commented Dec 15, 2024 •

edited

Loading

PGCRT commented Dec 20, 2024 •

edited

Loading

theycallmeloki commented Dec 25, 2024

PGCRT commented Dec 25, 2024

LORA training 🤍 #11

LORA training 🤍 #11

Comments

4lt3r3go commented Dec 15, 2024 • edited Loading

PGCRT commented Dec 20, 2024 • edited Loading

theycallmeloki commented Dec 25, 2024

PGCRT commented Dec 25, 2024

4lt3r3go commented Dec 15, 2024 •

edited

Loading

PGCRT commented Dec 20, 2024 •

edited

Loading