I've tried Florence2 for a training and it went good, was setting up another Lora today but this tim
I've tried Florence2 for a training and it went good, was setting up another Lora today but this time I actually read the captions first and noticed Florence got some key details very wrong, which could possibly mess up the Lora a bit so I dunno. I wonder if Llava is better or there's a free local Captioning tool that is a lot more consistent

