when training lora did you notice much difference between adafactor and adamw8bit?

when training lora did you notice much difference between adafactor and adamw8bit?
Was this page helpful?