Search
Star
Feedback
Setup for Free
© 2026 Hedgehog Software, LLC
Twitter
GitHub
Discord
System
Light
Dark
More
Communities
Docs
About
Terms
Privacy
vLLM Inconsistently Hangs at NCCL Initialization - Runpod
R
Runpod
•
11mo ago
•
2 replies
maple
vLLM Inconsistently Hangs at NCCL Initialization
Hi
, I am trying to run vLLM on 2x A40s GPUs and it will sometimes hang at NCCL initialization
. This inconsistently occurs and sometimes will work fine
. But for a pod that it hangs on
, repeated attempts will aways hang
.
.
.
CUDA 12
.4
.1
python 3
.10
vllm 0
.7
.3
command
:
vllm serve unsloth/Meta-Llama-3.1-8B --tensor-parallel-size 2
vllm serve unsloth/Meta-Llama-3.1-8B --tensor-parallel-size 2
Runpod
Join
We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!
21,202
Members
View on Discord
Resources
ModelContextProtocol
ModelContextProtocol
MCP Server
Similar Threads
Was this page helpful?
Yes
No
Similar Threads
Mi300x NCCL Issue
R
Runpod / ⛅|pods
13mo ago
Runpod VLLM - How to use GGUF with VLLM
R
Runpod / ⛅|pods
17mo ago
vLLM and multiple GPUs
R
Runpod / ⛅|pods
11mo ago
Stuck on Pod Initialization
R
Runpod / ⛅|pods
15mo ago