@Dr. Furkan Gözükara have you tested xformers/gradient checkpointing as well as Memory Efficient Att
@Dr. Furkan Gözükara have you tested xformers/gradient checkpointing as well as Memory Efficient Attention? I am running without Gradient CH. and xformers but still have memory efficient attention on. Not sure if it decreases quality






