serverless multi-gpu

Hi, in the serverless endpoint console I'm seeing that you can't have a serverless multi-gpu endpoint except for 2x A40? Is this correct? So essentially the serverless product is only for smaller models?
Solution:
we are slowly allowing more as we get more available capacity
Jump to solution
5 Replies
ashleyk
ashleyk14mo ago
No description
ashleyk
ashleyk14mo ago
Its supported by the 48GB tier which includes A40 and A6000, not just A40.
asherisaac
asherisaacOP14mo ago
Thanks. Why can I only select 2 GPU's per instance in the 48gb tier? There is no option to do 4x 48gb or whatever for serverless?
ashleyk
ashleyk14mo ago
To prevent someone from using all available capacity and leaving no capacity for the other customers.
Solution
flash-singh
flash-singh14mo ago
we are slowly allowing more as we get more available capacity

Did you find this page helpful?