p2p: batch publish data column sidecars #16183

aarshkshah1992 · 2025-12-22T07:58:31Z

What type of PR is this?

Feature

What does this PR do? Why is it needed?

This PR takes @MarcoPolo 's PR at #16130 to completion with tests.

The description on his PR:

"""
a relatively small change to optimize network send order.

Without this, network writes tend to prioritize sending data for one column to all peers before sending data for later columns (e.g for two columns and 4 peers per column it would send A,A,A,A,B,B,B,B). With batch publishing we can change the write order to round robin across columns (e.g. A,B,A,B,A,B,A,B).

In cases where the process is sending at a rate over the network limit, this approach allows at least some copies of the column to propagate through the network. In early simulations with bandwidth limits of 50mbps for the publisher, this improved dissemination by ~20-30%.
"""
See the issue for some more context.

Which issues(s) does this PR fix?

Fixes #16129

Other notes for review

Acknowledgements

I have read CONTRIBUTING.md.
I have included a uniquely named changelog fragment file.
I have added a description with sufficient context for reviewers to understand this PR.
I have tested that my changes work as expected and I added a testing plan to the PR description (if applicable).

a relatively small change to optimize network send order. Without this, network writes tend to prioritize sending data for one column to all peers before sending data for later columns (e.g for two columns and 4 peers per column it would send A,A,A,A,B,B,B,B). With batch publishing we can change the write order to round robin across columns (e.g. A,B,A,B,A,B,A,B). In cases where the process is sending at a rate over the network limit, this approach allows at least some copies of the column to propagate through the network. In early simulations with bandwidth limits of 50mbps for the publisher, this improved dissemination by ~20-30%.

aarshkshah1992 · 2025-12-23T04:10:47Z

@kasey Kurtosis testing looks good:

Here is the Kurtosis config file:

participants:
  # Super-nodes
  - el_type: geth
    el_image: ethpandaops/geth:master
    cl_type: prysm
    vc_image: gcr.io/offchainlabs/prysm/validator:latest
    cl_image: prysm-bn-custom-image:latest
    count: 2
    cl_extra_params:
      - --supernode
      - --subscribe-all-subnets
    vc_extra_params:
      - --verbosity=info
  # Full-nodes
  - el_type: geth
    el_image: ethpandaops/geth:master
    cl_type: prysm
    vc_image: gcr.io/offchainlabs/prysm/validator:latest
    cl_image: prysm-bn-custom-image:latest
    count: 2
    validator_count: 4
    cl_extra_params:
      - --semi-supernode
      - --subscribe-all-subnets
    vc_extra_params:
      - --verbosity=info
  - el_type: geth
    el_image: ethpandaops/geth:master
    cl_type: lighthouse
    cl_image: ethpandaops/lighthouse:unstable
    count: 2
    cl_extra_params:
      - --subscribe-all-subnets


additional_services:
  - dora
  - spamoor

spamoor_params:
  image: ethpandaops/spamoor:master
  max_mem: 4000
  spammers:
    - scenario: eoatx
      config:
        throughput: 200
    - scenario: blobs
      config:
        throughput: 20

network_params:
  fulu_fork_epoch: 0
  withdrawal_type: "0x02"
  preset: mainnet
  seconds_per_slot: 6

global_log_level: info

Here is the DORA screenshot:

Here are the logs showing batch publishing of data columns (I added these log lines locally, haven't committed them):

kasey · 2025-12-29T20:59:29Z

beacon-chain/p2p/broadcaster_test.go

+		if seen[topic] == 0 {
+			t.Errorf("Expected topic %s to be sent at least once", topic)
+		}
+	}


The above test check the new behavior by verifying the following invariants:

all expected topics are seen at least once

no publishes occur on the same topic twice, until all expected topics have had 1 publish

I think we should also have a test case making sure that each peer is given each expected message. An example bug that the current test assertions would miss is "only publish each message to 1 peer".

kasey · 2025-12-29T21:10:23Z

beacon-chain/p2p/pubsub.go

+		case <-ctx.Done():
+			return errors.Wrapf(ctx.Err(), "unable to find requisite number of peers for topic %s, 0 peers found to publish to", topic)
+		default:
+			time.Sleep(100 * time.Millisecond)


select will enter the default case immediately if ctx.Done blocks, at which point the goroutine will sleep - not checking for context cancellation, and then going back into the loop where we may call AddToBatch despite the context being canceled during the sleep. This is why we should generally prefer concurrent context checks until the deadline hits using time.After, like this:

for { if len(topicHandle.ListPeers()) > 0 || flags.Get().MinimumSyncPeers == 0 { return topicHandle.AddToBatch(ctx, batch, data, opts...) } select { case <-ctx.Done(): return errors.Wrapf(ctx.Err(), "unable to find requisite number of peers for topic %s, 0 peers found to publish to", topic) case <-time.After(100 * time.Millisecond): // after 100ms, reenter the for loop } }

beacon-chain/p2p/pubsub.go

beacon-chain/p2p/broadcaster.go

nalepae · 2026-01-02T12:30:45Z

beacon-chain/p2p/pubsub.go

+func (s *Service) addToBatch(ctx context.Context, batch *pubsub.MessageBatch, topic string, data []byte, opts ...pubsub.PubOpt) error {
+	topicHandle, err := s.JoinTopic(topic)
+	if err != nil {
+		return err


Please wrap errors.

nalepae · 2026-01-02T12:35:28Z

beacon-chain/p2p/broadcaster.go

+// method to broadcast or batch messages to other peers in our gossip mesh. If
+// batch is non-nil the message is added to the batch WITHOUT publishing. The
+// caller MUST publish the batch after all messages have been added to the batch
+func (s *Service) broadcastOrBatchObject(ctx context.Context, batch *pubsub.MessageBatch, obj ssz.Marshaler, topic string) error {


I understand you created this broadcastOrBatchObject to avoid some code duplication with broadcastObject and batchObject.
However in this precise case, I think it would be better to:

remove this broadcastOrBatchObject

create a subfunction running the common code

call this subfunction both in broadcastObject and batchObject.

(Also, the span p2p.broadcastObject is now wrong.)

nalepae · 2026-01-02T12:38:32Z

beacon-chain/p2p/broadcaster.go


 			// Broadcast the data column sidecar to the network.
-			if err := s.broadcastObject(ctx, sidecar, topic); err != nil {
+			if err := s.batchObject(ctx, &messageBatch, sidecar, topic); err != nil {


Before your PR:
If, for a given subnet, no peers were found, then the sidecars corresponding to the other subnets were broadcasted.

After your PR:
No sidecar will be broadcasted at all until the required number of peers (1) is found in all subnets for which sidecars must be broadcasted.

@nalepae I agree. I have two thoughts on this:

With pro-active peer discovery and connectivity for subnets in https://github.com/OffchainLabs/prysm/pull/16036/files, not having peers for a given subnet will be less of a problem.

We could periodically batch publish here and empty the batch if timeouts are hit i.e. if it's been say 1s since a publish, just publish the batch right away and continue the batching.

What about first split in two categories sidecars with enough peers and sidecars without enough peers.
For sidecars with enough peers, let's keep the new mechanism (batch publishing).
For sidecars without peers, let's then start looking for new peers (one goroutine per sidecar) and publish as soon as there is enough peers (in the same goroutine)?

Done. Testing on Kurtosis now.

@nalepae Please can you do one more review ?

Co-authored-by: Preston Van Loon <pvanloon@offchainlabs.com>

nalepae · 2026-01-05T09:15:16Z

.gitignore

+execution/
+
+# AI assistant files
+CLAUDE.md


Actually currently we don't have a CLAUDE.md file for Prysm. I think:

we should have one

we should NOT ignore it.

If you want to have a personal CLAUDE.md file, it's better to use CLAUDE.local.md. (This one, ignored by git.)

NIT: No new line at the end of file.

Removed the CLAUDE.md file for now.

Generally prefer folks to use .git/info/exclude rather than edit the repo gitignore.

beacon-chain/p2p/broadcaster.go

aarshkshah1992 · 2026-01-05T11:23:13Z

@nalepae Kurtosis testing looks good:

aarshkshah1992 · 2026-01-05T13:29:21Z

Looks okay so far on Hoodi, nothing fishy:

d9851cc652fe7888402577879e5f2db602ef33bbc7d06be5b7c99f slot=2117192 [2026-01-05 18:58:25.46] INFO blockchain: Called fork choice updated with optimistic block finalizedPayloadBlockHash=0xba19beeb3797 headPayloadBlockHash=0xdeb732634435 headSlot=2117192 [2026-01-05 18:58:25.47] INFO blockchain: Synced new block block=0x87ebf2f6... epoch=66162 finalizedEpoch=66160 finalizedRoot=0x9e2bc186... slot=2117192 [2026-01-05 18:58:25.47] INFO blockchain: Finished applying state transition attestations=8 kzgCommitmentCount=21 payloadHash=0xdeb732634435 slot=2117192 syncBitsCount=456 txCount=122 [2026-01-05 18:58:36.49] INFO blockchain: Called new payload with optimistic block parentRoot=0x87ebf2f610d9851cc652fe7888402577879e5f2db602ef33bbc7d06be5b7c99f payloadBlockHash=0xcd5f8eec9289 root=0x77615f0591fb354540ddea9695a5871e946a6607c3126394b88c7783b7a3273c slot=2117193 [2026-01-05 18:58:36.75] INFO blockchain: Called fork choice updated with optimistic block finalizedPayloadBlockHash=0xba19beeb3797 headPayloadBlockHash=0xcd5f8eec9289 headSlot=2117193 [2026-01-05 18:58:36.76] INFO blockchain: Synced new block block=0x77615f05... epoch=66162 finalizedEpoch=66160 finalizedRoot=0x9e2bc186... slot=2117193 [2026-01-05 18:58:36.76] INFO blockchain: Finished applying state transition attestations=8 kzgCommitmentCount=5 payloadHash=0xcd5f8eec9289 slot=2117193 syncBitsCount=444 txCount=53 [2026-01-05 18:58:49.19] INFO blockchain: Called new payload with optimistic block parentRoot=0x77615f0591fb354540ddea9695a5871e946a6607c3126394b88c7783b7a3273c payloadBlockHash=0x517e232ba353 root=0x55a5cad42fe18b398ae9be9cee4f171fc29e2367af3f510dfca728d61601f1e7 slot=2117194 [2026-01-05 18:58:49.80] INFO blockchain: Called fork choice updated with optimistic block finalizedPayloadBlockHash=0xba19beeb3797 headPayloadBlockHash=0x517e232ba353 headSlot=2117194 [2026-01-05 18:58:49.81] INFO blockchain: Synced new block block=0x55a5cad4... epoch=66162 finalizedEpoch=66160 finalizedRoot=0x9e2bc186... slot=2117194 [2026-01-05 18:58:49.81] INFO blockchain: Finished applying state transition attestations=8 kzgCommitmentCount=21 payloadHash=0x517e232ba353 slot=2117194 syncBitsCount=446 txCount=90

beacon-chain/p2p/broadcaster.go

Co-authored-by: Manu NALEPA <enalepa@offchainlabs.com>

beacon-chain/p2p/broadcaster.go

MarcoPolo and others added 3 commits December 10, 2025 09:29

use gossipsub in tests

92e89dd

test for round robbin batch publish

6e89d0e

aarshkshah1992 requested a review from kasey December 23, 2025 04:15

Merge branch 'develop' into batch-publish-data-columns

97381be

aarshkshah1992 mentioned this pull request Dec 23, 2025

p2p: batch publish data column sidecars #16130

Closed

4 tasks

terencechain previously approved these changes Dec 23, 2025

View reviewed changes

kasey reviewed Dec 29, 2025

View reviewed changes

address own feedback: time.After instead of sleep in default

f3f5340

kasey dismissed terencechain’s stale review via f3f5340 December 30, 2025 18:45

kasey and others added 2 commits December 30, 2025 12:25

expand unit test to assert all peers got every message

8548931

Merge branch 'develop' into batch-publish-data-columns

29eed62

prestonvanloon reviewed Dec 31, 2025

View reviewed changes

beacon-chain/p2p/pubsub.go Outdated Show resolved Hide resolved

beacon-chain/p2p/broadcaster.go Outdated Show resolved Hide resolved

nalepae reviewed Jan 2, 2026

View reviewed changes

aarshkshah1992 and others added 5 commits January 5, 2026 11:27

Apply suggestions from code review

db544fa

Co-authored-by: Preston Van Loon <pvanloon@offchainlabs.com>

Apply suggestion from @prestonvanloon

74ace53

Co-authored-by: Preston Van Loon <pvanloon@offchainlabs.com>

wrap errors

294affa

remove batchOrBroadcast method

96c7c03

add gitignore

26a0db6

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Show resolved Hide resolved

aarshkshah1992 added 3 commits January 5, 2026 16:20

seperate batch and individual publish

51a7971

remove CLAUDE.md from gitignore

3a50705

add debug logging

7d62ae1

Merge branch 'develop' into batch-publish-data-columns

fd48483

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Outdated Show resolved Hide resolved

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Outdated Show resolved Hide resolved

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Show resolved Hide resolved

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Outdated Show resolved Hide resolved

Apply suggestions from code review

51d39ca

Co-authored-by: Manu NALEPA <enalepa@offchainlabs.com>

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Show resolved Hide resolved

aarshkshah1992 added 2 commits January 5, 2026 19:47

changes as per review

3fbb8bb

better logging

c814013

nalepae reviewed Jan 5, 2026

View reviewed changes

beacon-chain/p2p/broadcaster.go Outdated Show resolved Hide resolved

aarshkshah1992 and others added 2 commits January 5, 2026 20:01

remove logs

588e04a

Merge branch 'develop' into batch-publish-data-columns

da0e4d7

kasey enabled auto-merge January 5, 2026 21:39

nalepae approved these changes Jan 5, 2026

View reviewed changes

kasey added this pull request to the merge queue Jan 5, 2026

Merged via the queue into develop with commit cc4510b Jan 5, 2026
19 checks passed

kasey deleted the batch-publish-data-columns branch January 5, 2026 22:09

p2p: batch publish data column sidecars #16183

p2p: batch publish data column sidecars #16183

Conversation

aarshkshah1992 commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aarshkshah1992 commented Dec 23, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aarshkshah1992 commented Jan 5, 2026

Uh oh!

aarshkshah1992 commented Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

aarshkshah1992 commented Dec 22, 2025 •

edited

Loading