I associated bigger with faster. He said his checkpoint was taking too much time to train, but I ass
I associated bigger with faster. He said his checkpoint was taking too much time to train, but I assumed if I use 0.0004 instead of 0.000004, I train faster.








