how to run a quantized model on server less? I'd like to run the 4/8 bit version of this model:

https://huggingface.co/unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF · Hugging Face

Runpod Join

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

21,202Members

Resources

Recent Announcements

Similar Threads

Was this page helpful?

Twitter GitHub Discord

Communities Docs About Terms Privacy

Star

Setup for Free

Twitter GitHub Discord

Communities Docs About Terms Privacy

how to run a quantized model on server less? I'd like to run the 4/8 bit version of this model: - Runpod

Runpod•13mo ago•

4 replies

codyman4488

how to run a quantized model on server less? I'd like to run the 4/8 bit version of this model:

https://huggingface.co/unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF

unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF · Hugging Face

Runpod Join

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

21,202Members

Resources

Recent Announcements

RRunpod / ⚡｜serverless

2y ago

Run LLM Model on Runpod Serverless

RRunpod / ⚡｜serverless

3y ago

Général advices on the pricing and the use of server less

RRunpod / ⚡｜serverless

3y ago

server less capability check

RRunpod / ⚡｜serverless

3y ago

Similar Threads

Was this page helpful?

RRunpod / ⚡｜serverless

2y ago

Run LLM Model on Runpod Serverless

RRunpod / ⚡｜serverless

3y ago

Général advices on the pricing and the use of server less

RRunpod / ⚡｜serverless

3y ago

server less capability check

RRunpod / ⚡｜serverless

3y ago