Anyone had success in LORA training flux1? (not on an h100) #472

pinballelectronica · 2024-09-22T14:08:10Z

pinballelectronica
Sep 22, 2024

Curious what settings people have used to actually get a practical training speed on a consumer GPU. Floats, optimizers, ranks, alphas. My go-to is Prodigy but due to the ramp up time that may not be ideal. My 2x4090's are no match for float16 across the board. 95 steps a NIGHT at very conservative values :) I
hear nf4 produces very poor output so. Trying to find a sweet spot. Thank you.
This image took ~7 minutes at FP16.

FurkanGozukara · 2024-09-22T14:54:48Z

FurkanGozukara
Sep 22, 2024

everything you say doesn't make sense with 2x4090 :D

even 8gb gpus are able to train FLUX dev model and starting from 16 GB you can train with best quality

rtx 4090 can train very fast and 2x trains even faster - lora

rtx 4090 can train a decent speed full fine tuning as well

1 reply

Seedmanc Nov 29, 2024

8Gb

Can't train even XL on 8 without going sub-1024 and 8 bit base model loading, I can't imagine what compromises must be taken to fit Flux there.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anyone had success in LORA training flux1? (not on an h100) #472

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Anyone had success in LORA training flux1? (not on an h100) #472

pinballelectronica Sep 22, 2024

Replies: 1 comment · 1 reply

FurkanGozukara Sep 22, 2024

Seedmanc Nov 29, 2024

pinballelectronica
Sep 22, 2024

Replies: 1 comment 1 reply

FurkanGozukara
Sep 22, 2024