serverless multi-gpu

Hi, in the serverless endpoint console I'm seeing that you can't have a serverless multi-gpu endpoint except for 2x A40? Is this correct? So essentially the serverless product is only for smaller models?
Solution:
we are slowly allowing more as we get more available capacity
Jump to solution
5 Replies
ashleyk
ashleyk3mo ago
No description
ashleyk
ashleyk3mo ago
Its supported by the 48GB tier which includes A40 and A6000, not just A40.
asherisaac
asherisaac3mo ago
Thanks. Why can I only select 2 GPU's per instance in the 48gb tier? There is no option to do 4x 48gb or whatever for serverless?
ashleyk
ashleyk3mo ago
To prevent someone from using all available capacity and leaving no capacity for the other customers.
Solution
flash-singh
flash-singh3mo ago
we are slowly allowing more as we get more available capacity