DeepSeek R1 Serverless for coding

I'm interested in running an FP16 DeepSeek R1 and I am wondering if Serverless is the way to go or if a Pod would be better. I need this for 2-3 hours at a time and I would like a 'dedicated' access to this environment. Which DeepSeek R1 model should I pick (GGUF?) and how should I configure the deployment tool in Serverless to get it to run on an H100? Thanks in advance for any help.
6 Replies
lsdvaibhavvvv
lsdvaibhavvvv9mo ago
Hi i am also trying to host a deepseek r1 on serverless. It fails at the endpoint level
<MarDev/>
<MarDev/>9mo ago
Yo bro, did you found a solution ?
MindDragon
MindDragonOP9mo ago
I don't think that anyone did. I'm trying a full pod of 4080's (5x) ... idk what else to do
lsdvaibhavvvv
lsdvaibhavvvv9mo ago
Here is the detail error
No description
lsdvaibhavvvv
lsdvaibhavvvv9mo ago
Now i am trying to host using docker container, and would manually do the needful on the server side. Let's see.
Unknown User
Unknown User9mo ago
Message Not Public
Sign In & Join Server To View

Did you find this page helpful?