xformers reduces training quality, better to set cross attention optimization to default - Software Engineering Courses (SECourses)