I have been trying for the last few days to do a training on 8xA100 SXM GPUs, which usually I have no problem provisioning on Runpod. Since the AWS outage, however, there never seem to be more than 2 or 3 GPUs available at any one time in EUR-IS-1.
Is there a problem with the GPUs? Why are they never available as of recently?
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!