Encoding models in functional magnetic resonance imaging: the Voxelwise Encoding Model framework

This repository contains the code and analysis scripts to reproduce the figures from the guide on the Voxelwise Encoding Model (VEM) framework:

Visconti di Oleggio Castello*, M., Deniz*, F., Dupré la Tour, T., & Gallant, J. L. (2025). Encoding models in functional magnetic resonance imaging: the Voxelwise Encoding Model framework. PsyArXiv. https://doi.org/10.31234/osf.io/nt2jq_v2

*equal contribution

Overview

This codebase implements a basic VEM analaysis on public data. The project demonstrates how to fit banded ridge regression models using motion energy and WordNet semantic features to predict neural responses across different brain regions. It is heavily based on the Voxelwise Encoding Model tutorials available here.

Installation

Prerequisites

Python 3.10+
CUDA-compatible GPU (recommended for model fitting; fitting on CPU is possible but it will take a very long time!)
uv - Modern Python package manager

Setup

Clone this repository:

git clone https://github.com/gallantlab/vem-review.git
cd vem-review

Install uv (if not already installed):

curl -LsSf https://astral.sh/uv/install.sh | sh

Install the package and dependencies:

uv sync

This will automatically create a virtual environment and install all required dependencies, including Jupyter for running notebooks.

Dependencies

The project requires several scientific computing and neuroimaging libraries:

Core dependencies:

numpy, scipy, scikit-learn
matplotlib (visualization)
h5py (data storage)
torch (GPU acceleration)

Neuroimaging libraries:

pycortex (cortical surface visualization)
himalaya (efficient ridge regression models)
pymoten (motion energy features)
voxelwise_tutorials (utilities)

Data management:

datalad (automatic data downloading)

All dependencies are automatically installed via uv sync and defined in pyproject.toml

Data

The project uses experimental data from the Gallant Lab's short clips dataset. The necessary data are automatically downloaded from https://gin.g-node.org/gallantlab/shortclips when you run the analysis scripts. The first time you run the script it will need to download approximately 5GB of data.

Usage

Basic Workflow

Fit encoding models:
```
uv run python scripts/01_fit-banded-ridge.py S01
```
This script fits banded ridge regression models with motion energy and WordNet features for subject S01. Replace S01 with other subjects (S02, S03, S04, S05) as needed.
Generate visualizations:
```
uv run python scripts/02_plot-banded-ridge.py S01
```
Creates cortical surface visualizations and analysis plots for the fitted models. These are the plots shown in the figures of the paper.

Jupyter Notebooks

The notebooks/ directory contains simulations that are used in Boxes 3 and 4.

example-fir-model.ipynb: Show how finite impulse response (FIR) models work
simulate-noise-ceiling.ipynb: Explains how the normalized correlation coefficient can account for different levels of noise
utils.py: Utility functions for notebook analyses

To run notebooks with the proper environment:

uv run jupyter notebook notebooks/

Output

Running the analysis scripts will generate:

tutorial-data/          # Created by scripts, contains downloaded data and results
├── shortclips/         # Downloaded experimental data
│   ├── features/       # Motion energy and WordNet features
│   ├── mappers/        # Cortical surface mappers per subject
│   ├── responses/      # fMRI responses per subject
│   ├── stimuli/        # Training and test stimuli
│   └── utils/          # WordNet categories and graph structure
├── results/           # Fitted model results
│   └── {subject}_bandedridge.hdf  # Cross-validation scores, model weights
└── figures/           # Generated visualizations per subject
    └── {subject}/
        ├── {subject}_ev.png                    # Explained variance visualization
        ├── {subject}_joint_r2_scores.png       # Joint model R² scores
        ├── {subject}_split_r2_cvscores.png     # Split model cross-validation scores
        ├── {subject}_split_r2_scores.png       # Split model R² scores
        ├── {subject}_wordnet_flatmap_pc1.png   # WordNet PC1 flatmap visualization
        ├── {subject}_wordnet_flatmap_pc234.png # WordNet PC2-4 flatmap visualization
        └── {subject}_wordnet_graph_*.png       # WordNet semantic graphs for specific ROIs

Citation

If you use this code in your research, please cite:

@article{vem-review,
  title={Encoding models in functional magnetic resonance imaging: the Voxelwise Encoding Model framework},
  author={{Visconti di Oleggio Castello}, Matteo and Deniz, Fatma and {Dupré la Tour}, Tom and Gallant, Jack L.},
  journal={PsyArXiv},
  year={2025},
  doi={10.31234/osf.io/nt2jq_v2},
  url={https://doi.org/10.31234/osf.io/nt2jq_v2}
}

License

This project is licensed under the BSD 3-Clause License - see the LICENSE.md file for details.

Support

For questions or issues, please open an issue on the GitHub repository.

Acknowledgments

This work builds upon these existing neuroimaging analysis tools:

Himalaya for ridge regression
Pycortex for cortical visualization
Voxelwise Tutorials for analysis utilities

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
CITATION.cff		CITATION.cff
CLAUDE.md		CLAUDE.md
LICENSE.md		LICENSE.md
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Encoding models in functional magnetic resonance imaging: the Voxelwise Encoding Model framework

Overview

Installation

Prerequisites

Setup

Dependencies

Data

Usage

Basic Workflow

Jupyter Notebooks

Output

Citation

License

Support

Acknowledgments

About

Uh oh!

Contributors 2

Uh oh!

Languages

License

gallantlab/vem-review

Folders and files

Latest commit

History

Repository files navigation

Encoding models in functional magnetic resonance imaging: the Voxelwise Encoding Model framework

Overview

Installation

Prerequisites

Setup

Dependencies

Data

Usage

Basic Workflow

Jupyter Notebooks

Output

Citation

License

Support

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 2

Uh oh!

Languages