Ulibach
RRunPod
•Created by Ulibach on 5/17/2025 in #⚡|serverless
slow model loading times with vllm
deployed vllm worker from webui with 0.8.5 version and attached a network storage.
it is a finetuned gemma3 model.
INFO 05-17 20:09:56 [loader.py:458] Loading weights took 113.32 seconds
INFO 05-17 20:09:56 [model_runner.py:1140] Model loading took 23.3141 GiB and 160.792180 seconds
is this normal? total loading time is 160s.
could this be a disk io issue?
4 replies