remove unnecessary parallelize usage in sepal by selmanozleyen · Pull Request #1120 · scverse/squidpy

selmanozleyen · 2026-02-20T12:56:09Z

hi,

from the rsc sepal implementation I know we can get rid of parallelize easily. I also have a longer term plan to remove parallelize. I checked with this script: https://gist.github.com/selmanozleyen/c4b0e4780243fd4621f68bc2d78cae5c

Results pretified with AI:

Benchmark results (502 genes, 2688 cells, visium_hne_adata, 3 runs, 8 cores)

variant	mean	median	min	max
main, n_jobs=1 (sequential)	134.9s	134.4s	134.3s	136.0s
main, n_jobs=6 (joblib/loky)	40.5s	40.2s	37.9s	43.5s
this PR, prange (no pre-alloc)	37.7s	37.6s	36.9s	38.6s
this PR, prange + pre-alloc buffers	31.9s	31.3s	30.9s	33.6s

Motivation

This PR replaces the internal parallelize + joblib.Parallel machinery in
sepal() with Numba's native prange threading, removing the need for
multi-process orchestration (subprocess spawning, pickling, IPC).

Key changes

prange over genes -- The outer gene loop is parallelized with
numba.prange inside a @njit(parallel=True) function, replacing
joblib.Parallel(n_jobs=..., backend="loky").
Pre-allocated per-thread workspace buffers (this commit e011ddc) You can see why I did this in https://numba.readthedocs.io/en/stable/user/parallel.html#diagnostics under "allocation hoisting"

for more information, see https://pre-commit.ci

codecov · 2026-02-20T13:09:47Z

Codecov Report

❌ Patch coverage is 32.35294% with 23 lines in your changes missing coverage. Please review.
✅ Project coverage is 66.16%. Comparing base (6dc2a91) to head (357966a).

Files with missing lines	Patch %	Lines
src/squidpy/gr/_sepal.py	32.35%	23 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1120      +/-   ##
==========================================
- Coverage   66.36%   66.16%   -0.21%     
==========================================
  Files          44       44              
  Lines        7132     7137       +5     
  Branches     1212     1209       -3     
==========================================
- Hits         4733     4722      -11     
- Misses       1923     1941      +18     
+ Partials      476      474       -2

Files with missing lines	Coverage Δ
src/squidpy/gr/_sepal.py	`45.07% <32.35%> (-9.68%)`	⬇️

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

flying-sheep

Nice, very straightforward.

The only potential issue I see is that you densify vals as one, whereas the other version densified at most n_jobs columns of vals at a time.

Are there use cases for this where that matters?

flying-sheep · 2026-02-20T15:18:09Z

src/squidpy/gr/_sepal.py

+    sat_shape = sat.shape[0]
+    n_threads = get_num_threads()
+
+    # Pre-allocate per-thread workspace to avoid allocator contention


I wonder if there isn’t a way to tell numba to do the pre-allocation instead of doing it manually.

I am also wondering the same but I couldn't find a way to do it without calling get_num_threads()

@Intron7 do you know anything about this?

timtreis · 2026-02-20T23:12:56Z

I also have a longer term plan to remove parallelize.

Is that plan fully based on prange? How would you f.e. replace a case like https://github.com/scverse/squidpy/pull/982/changes#diff-bd3d3c041f3b69cca0f8b6ece0e25425eaf6329c636038e53062b1fcf1108285R342 ?

selmanozleyen · 2026-02-21T19:58:42Z

Is that plan fully based on prange? How would you f.e. replace a case like https://github.com/scverse/squidpy/pull/982/changes#diff-bd3d3c041f3b69cca0f8b6ece0e25425eaf6329c636038e53062b1fcf1108285R342 ?

I would need extra time to give review on this but I am confident we can find a way that uses the cores without having multi-processes and serialization overhead.

selmanozleyen · 2026-02-21T20:56:17Z

The only potential issue I see is that you densify vals as one, whereas the other version densified at most n_jobs columns of vals at a time.

Are there use cases for this where that matters?

Matters for potential OOM's. It's doable but densifying up-front is better for speed and I don't think we will have inputs that will cause memory issues since sepal works only with spot based datasets. Is that correct @timtreis ?

timtreis · 2026-02-24T14:12:55Z

IIRC sepal works with any situation in which the obs are arranged in a regular grid or pattern. That'd include Visium with about 4k spots, yes, but also Visium HD which depending on which bin you choose could have more than a million obs. Not sure that'd actually be useful, but could be a potential input.

…selmanozleyen/squidpy into feat/remove-parallelize-minimal

for more information, see https://pre-commit.ci

selmanozleyen · 2026-02-24T14:57:23Z

ok I added a sparse_batch_size which is 128. Given that 100m cells per gene is around 400mb I think this is a reasonable default.

Here are the results


Running sepal() 3 time(s)...
  run 1: 45.700s
  run 2: 44.678s
  run 3: 43.301s

Results (3 runs):
  mean:   44.560s
  median: 44.678s
  min:    43.301s
  max:    45.700s

selmanozleyen and others added 4 commits February 20, 2026 11:43

remove parallelize from sepal

a6e1438

preallocated buffers

e011ddc

Merge branch 'main' into feat/remove-parallelize-minimal

4aad2c0

[pre-commit.ci] auto fixes from pre-commit.com hooks

d9e3438

for more information, see https://pre-commit.ci

selmanozleyen requested review from flying-sheep and timtreis February 20, 2026 12:58

flying-sheep reviewed Feb 20, 2026

View reviewed changes

Merge branch 'main' into feat/remove-parallelize-minimal

357966a

selmanozleyen and others added 5 commits February 24, 2026 15:48

add sparse batch support

8c6df25

Merge branch 'feat/remove-parallelize-minimal' of https://github.com/…

023461d

…selmanozleyen/squidpy into feat/remove-parallelize-minimal

[pre-commit.ci] auto fixes from pre-commit.com hooks

b0d0792

for more information, see https://pre-commit.ci

better default

b9605dd

[pre-commit.ci] auto fixes from pre-commit.com hooks

2df55b2

for more information, see https://pre-commit.ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

remove unnecessary parallelize usage in sepal#1120

remove unnecessary parallelize usage in sepal#1120
selmanozleyen wants to merge 10 commits intoscverse:mainfrom
selmanozleyen:feat/remove-parallelize-minimal

selmanozleyen commented Feb 20, 2026 •

edited

Loading

Uh oh!

codecov bot commented Feb 20, 2026 •

edited

Loading

Uh oh!

flying-sheep left a comment •

edited

Loading

Uh oh!

flying-sheep Feb 20, 2026 •

edited

Loading

Uh oh!

selmanozleyen Feb 21, 2026

Uh oh!

selmanozleyen Feb 22, 2026

Uh oh!

timtreis commented Feb 20, 2026

Uh oh!

selmanozleyen commented Feb 21, 2026

Uh oh!

selmanozleyen commented Feb 21, 2026

Uh oh!

timtreis commented Feb 24, 2026

Uh oh!

selmanozleyen commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

selmanozleyen commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark results (502 genes, 2688 cells, visium_hne_adata, 3 runs, 8 cores)

Motivation

Key changes

Uh oh!

codecov bot commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

flying-sheep left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

flying-sheep Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

selmanozleyen Feb 21, 2026

Choose a reason for hiding this comment

Uh oh!

selmanozleyen Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

timtreis commented Feb 20, 2026

Uh oh!

selmanozleyen commented Feb 21, 2026

Uh oh!

selmanozleyen commented Feb 21, 2026

Uh oh!

timtreis commented Feb 24, 2026

Uh oh!

selmanozleyen commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

selmanozleyen commented Feb 20, 2026 •

edited

Loading

codecov bot commented Feb 20, 2026 •

edited

Loading

flying-sheep left a comment •

edited

Loading

flying-sheep Feb 20, 2026 •

edited

Loading