vllm seems not use GPU - Runpod