-
Notifications
You must be signed in to change notification settings - Fork 2
Test multi-gpu setup #10
Copy link
Copy link
Open
Labels
evalChanges related to the eval pipelineChanges related to the eval pipelinegood first issueGood for newcomersGood for newcomerstrainingChanges related to training pipelineChanges related to training pipeline
Metadata
Metadata
Assignees
Labels
evalChanges related to the eval pipelineChanges related to the eval pipelinegood first issueGood for newcomersGood for newcomerstrainingChanges related to training pipelineChanges related to training pipeline
Description
Spin up an instance on the cloud with GPU and test that the training/eval pipeline works with multiple GPUs.
Start by changing
--num_gpus=1to 2. Understand and attempt to fix any basic errors encountered, otherwise close the issue and report the errors.Multi-GPU training would allow us to:
Checklist before resolving issue
mainand fixed any conflictsdry_run.shon a CUDA device successfully