How does multy GPU trainign work anyways? Each GPU has a copy of a trained model and perform traning

How does multy GPU trainign work anyways?
Each GPU has a copy of a trained model and perform traning steps independantly, so how in the end we are not gettign two separate models?
Was this page helpful?