Hi guys, my team is interested in using RunPod for multinode training. We are looking for 24-96 a100s for larger scale model training. Do you guys currently support this?
We have a cluster feature that supports multiple nodes currently in progress. Feel free to open a support ticket, and we’ll keep you updated as soon as it’s ready.