Is there any point in modifying LR with adafactor? 22 hours of training and 4400 steps later, I have
Is there any point in modifying LR with adafactor? 22 hours of training and 4400 steps later, I have the impression that my final samples are still undertrained.
