Who to message to increase 80GB serverless endpoint to 3 GPUs/worker instead of 2?
Whenever I try to increase my serverless endpoint to more than 2x GPU/worker for the 80GB cards, it is grayed out. 8x 48GB cards does not fit 405b models but during testing 4x 80GB cards do.
Continue the conversation
Join the Discord to ask follow-up questions and connect with the community
R
Runpod
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!