Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified? - Runpod