since we cant train the text encoder right now, it makes no sense to caption the training dataset wi

since we cant train the text encoder right now, it makes no sense to caption the training dataset with tags like ohwx right? what should be used instead?
Was this page helpful?