force same gpu by mike-ferguson · Pull Request #398 · brain-score/language

mike-ferguson · 2026-03-18T19:54:19Z

Fix multi-GPU device mismatch in HuggingfaceSubject

Problem

When running with device_map='auto' on multi-GPU machines, inputs were always sent to cuda:0 while the embedding layer could be on another GPU (e.g. cuda:3), causing:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:3 and cuda:0!

Small models (e.g. DistilGPT-2) were most affected because device_map='auto' can place the embedding layer on a non-zero device.

Solution

Use the embedding layer's device for inputs instead of assuming cuda:0: self.device = self.basemodel.get_input_embeddings().weight.device
In estimate_reading_times, move actual_tokens to predicted_logits.device before F.cross_entropy, since logits can reside on a different GPU with device_map='auto'.

Impact

Fixes multi-GPU runs for all model sizes when device_map='auto' is used.
No behavior change for single-GPU use.
Batch jobs typically use one GPU per job, so they were unaffected.

force same gpu

a33e09e

KartikP added the OOM label Mar 18, 2026

mike-ferguson merged commit 151976f into main Mar 19, 2026
17 of 18 checks passed

mschrimpf deleted the same_gpu_load branch March 25, 2026 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

force same gpu#398

force same gpu#398
mike-ferguson merged 1 commit intomainfrom
same_gpu_load

mike-ferguson commented Mar 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mike-ferguson commented Mar 18, 2026

Fix multi-GPU device mismatch in HuggingfaceSubject

Problem

Solution

Impact

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants