actually i tihnk training with higher batch sizes may be ok, but looks like learning rate is still b

actually i tihnk training with higher batch sizes may be ok, but looks like learning rate is still better to be kept at 2e-6
Was this page helpful?