yes, continue training if you don't see model sanity degradation, you can get better results compare

yes, continue training if you don't see model sanity degradation, you can get better results compared to lora at 18-20k steps or more, if your use case require perfect resemblance at the cost of lower model sanity
Was this page helpful?