Skip to content

add 5 large models#399

Merged
mike-ferguson merged 2 commits intomainfrom
add_5-large_models
Mar 19, 2026
Merged

add 5 large models#399
mike-ferguson merged 2 commits intomainfrom
add_5-large_models

Conversation

@mike-ferguson
Copy link
Copy Markdown
Member

Adds five new Brain-Score language model plugins:

  • Mistral-Small-24B (mistralai/Mistral-Small-24B) — 40 layers, hidden size 5120, layer mapping model.layers.39
  • Falcon-40B (tiiuae/falcon-40b) — 60 layers, hidden size 8192, layer mapping transformer.h.59
  • MPT-30B (mosaicml/mpt-30b) — 48 layers, hidden size 7168, layer mapping transformer.blocks.47
  • OPT-30B (facebook/opt-30b) — 48 layers, hidden size 7168, layer mapping model.decoder.layers.47
  • Qwen2.5-14B (Qwen/Qwen2.5-14B) — 48 layers, hidden size 5120, layer mapping model.layers.47

All models are registered via HuggingfaceSubject with the last transformer layer mapped to the language system, following the existing plugin pattern (e.g. Mistral-7B, Falcon-7B).
Each plugin includes tests for model loading, neural representation shape, and next-word prediction.

Test plan

  • pytest brainscore_language/models/mistral_small_24b/test.py -m memory_intense — all tests pass, neuroid dim = 5120
  • pytest brainscore_language/models/falcon40b/test.py -m memory_intense — all tests pass, neuroid dim = 8192
  • pytest brainscore_language/models/mpt30b/test.py -m memory_intense — all tests pass, neuroid dim = 7168
  • pytest brainscore_language/models/opt30b/test.py -m memory_intense — all tests pass, neuroid dim = 7168
  • pytest brainscore_language/models/qwen2_5_14b/test.py -m memory_intense — all tests pass, neuroid dim = 5120

@mike-ferguson mike-ferguson added the submission_prepared Attached to a PR is metadata and layer mapping is successful. label Mar 19, 2026
@mike-ferguson mike-ferguson merged commit 74aa89b into main Mar 19, 2026
10 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

submission_prepared Attached to a PR is metadata and layer mapping is successful.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant