R
Runpod16mo ago
Laikh

Unable to start pod with MI300x

Observing "hang" when starting pod with 8xMI300x, screenshot attached. Any ideas on how to fix this?
No description
3 Replies
yhlong00000
yhlong0000016mo ago
I am able to run with 8xMI300X using official templates, i am wondering if something related to your image?
Unknown User
Unknown User16mo ago
Message Not Public
Sign In & Join Server To View
Laikh
LaikhOP16mo ago
Gotcha -- this was the image that was used: https://hub.docker.com/r/eliovp/rocm6.1.2_py3.10_torch2.5_vllm0.5_bkc Using the official rocm pytorch images from runpod seems to work. Thanks!

Did you find this page helpful?