I could be wrong, but my assumption is that the class images as well as the training images are conv

I could be wrong, but my assumption is that the class images as well as the training images are converted to noise with the same noise scheduler for training, so it is irrelevant how the class images were generated, but thats just an assumption.
Was this page helpful?