how do you think the results will be, if we mix TEST 3 and TEST? EG: Mem Attention=default. optimiz

how do you think the results will be, if we mix TEST 3 and TEST? EG: Mem Attention=default. optimizer=Lion, Lr=2e-7?
Was this page helpful?