Does anyone use Adafactort for concept training? If so, with what parameters? I found two completely
Does anyone use Adafactort for concept training? If so, with what parameters? I found two completely different arguments on the net, so I'm getting lost in the woods.




