RunpodR
Runpod2y ago
JanE

same GPU, different machine -> different speed

The image shows 2 yolo object detection runs with identical setup (same batch size, image size, number of epochs) on 2 different runpods. The GPU was in both cases the RTX 4090


slow machine
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 535.129.03 Driver Version: 535.129.03 CUDA Version: 12.2 |
|-----------------------------------------+----------------------+----------------------+
|=========================================+======================+======================|
| 0 NVIDIA GeForce RTX 4090 On | 00000000:A1:00.0 Off | Off |


fast machine
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.54.15 Driver Version: 550.54.15 CUDA Version: 12.4 |
|-----------------------------------------+------------------------+----------------------+
| 0 NVIDIA GeForce RTX 4090 On | 00000000:01:00.0 Off | Off |


There was a 30% increase in training speed on the fast machine, and the power consumption was less.

(1) Is this only due to the driver being newer?
(2) Would the effect be the same for an older GPU, like the A100 ?
image.png
Was this page helpful?